Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): Learning Notes - Action Recognition

June 06, 2020

Learning Notes - Action Recognition - Part II

Paper #1 - Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection

Key Notes

Extraction of local spatio-temporal features followed by temporal modeling

Spatio-temporal feature extraction

Sample consecutive frames
Optical flow for temporal modeling
Dense Trajectory (IDT), Motion History Image (MHI)

Network Architecture

Bi-directional LSTM
Spatial-temporal CNN (STCNN) with Segmentation models
Temporal convolutional networks (TCN)
Temporal deformable residual networks (TDRN)

Different Convolution Strategies

Standard convolution - The standard convolutions use the box, unchangeable shape of the filters
Dilated convolution - Dilating the filter means expanding its size filling the empty positions with zeros.
#out = Conv2D(10, (3, 3), dilation_rate=2)(input_tensor)
Deformable convolution - he deformable convolutions learn the filter shapes and adjust shapes to the most frequent cases

Implementation

Downsampled to 6fps
Frames were resized to 224x224 and augmented using random cropping and mean removal
Each video snippet contained 16 frames after sampling

Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection

Key Notes

Generative Adversarial Network (GAN) to generate exact joint locations from noisy probability heat maps
Detection classification is applied to a continuous sequence of videos of multiple activities
Generative adversarial network (GAN) to produce potential body joint locations in an unsupervised manner

Features

Optical flow (OF) and feature matching
Picking from shelf vs putting back
Joint location estimation results using GAN-based approach.
Actions - Reach, Retract, Hand in, Insp. Product, Insp. Shelf
Fashion Dataset Keypoint detection similar approach can be leveraged here too

Paper #3 - Temporal Convolutional Networks for Action Segmentation and Detection

Key Notes

Temporal Convolutional Networks (TCNs)
Two types of TCNs
First, our EncoderDecoder TCN (ED-TCN) only uses a hierarchy of temporal convolutions, pooling, and upsampling but can efficiently capture long-range temporal patterns.
Second, Dilated TCN uses dilated convolutions

Code Temporal Convolutional Networks
More Reads
An introduction to ConvLSTM
Keras Convolutional LSTM network
Dense-Optical-Flow
Anomaly Detection in Videos using LSTM Convolutional Autoencoder
Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition

#Drones can monitor when fights break out.
by @Seeker #AI #ArtificialIntelligence #IoT #InternetOfThings #DeepLearning #DataScience #DataAnalytics

Cc: @pawlowskimario @randal_olson @hackingdata @revodavid pic.twitter.com/e2Vuas8GTs
— Ronald van Loon (@Ronald_vanLoon) July 14, 2020

#AI #Technology now on the lookout for shoplifters
by @mashable #ArtificialIntelligence #Tech #IT

Cc: @mikequindazzi @stratorob @moegmida @andy_fitze @wotnot_io pic.twitter.com/8nwhflxHv8
— Ronald van Loon (@Ronald_vanLoon) July 13, 2020

Happy Learning!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

June 06, 2020

Learning Notes - Action Recognition - Part II

No comments:

Git Code Repository

About Me

What is your Expertise

Search This Blog

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts