Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): Lecture - Visual Question Answering Based on Image and Video

May 30, 2021

Lecture - Visual Question Answering Based on Image and Video - Thao Minh Le

Visual Question Answering Application

demo - Link

Pics and videos are everywhere, words are how humans communicate

Vision + ML + NLP - Interesection of all fields
The Flow

Low level image processing
Objects and Shapes
Object recognition, Relationship between objects
Relationship between object, events
Pizza, Type of pizza

Applications

Visually impaired assistance

Video Analytics analysis
Check a piece of information

Open-ended questions
Choice-based questions
Counting type questions

Next Steps

Perception - Reasoning - Multistep reasoning

Difficult for a single model to address
Obtain knowledge - Form Relationships
Dataset - 1 Million questions
bag of words to embed

BOW + LSTM

Reasoning - Chaining of relative predicates to arrive at the conclusion

Objects - RCNN
Contextual words - Bidirectional LSTM

Connect all objects in sequence

Semantic similarities representation

Relational Reasoning on Visial QA

Conditional Relation Network Unit

Every weekend makes me feel guilty about vision current state of art vs what I am working on when I will bridge the knowledge gap!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

May 30, 2021

Lecture - Visual Question Answering Based on Image and Video - Thao Minh Le

No comments:

Git Code Repository

About Me

What is your Expertise

Search This Blog

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts