Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): Day #156

November 30, 2018

Day #156 - Reinforcement Learning

"Rewards for right moves, Starve for wrong moves"

Key Summary

Intelligence Systems Stack
Agents to Effectors
Raw Data - Features - Gain Knowledge - Reason - Short term and Long Term Actions
Sensory Data - Create Representations
Raw Sensory Data - Feature Learning (Higher Order Representations) - Extract Actionable usable Knowledge
Supervised learning - Memorizers
Reinforcement learning - brute force reasoning
Reinforcement learning components (Goal - State - Actions - Reward)

Step 1 - Reinforcement Learning Stack

Step 2 - Data Sources

Step 3 - Feature Extraction

Step 4 - Representations

Step 5 - Reasoning

Step 6 - Actions

Types of Deep Learning

Reinforcement Learning Components

Learning States Logic

Markov Decision Process

State - Action - Reward - State
Policy - Behavior function
Value Function - How good is state / function
Model - Agents representation of Environment
Stochastic System (having a random probability distribution or pattern that may be analysed statistically but may not be predicted precisely)
Reward structure changes the next step strategy
Encourage Exploration with positive reward
Goal is to Optimize reward

Summary
Intelligence - Ability to accomplish complex goals
Understanding - Ability to turn complex information into simple, useful information

DQN - Deep Q Learning

Neural Network injected into Q
Q function injected into Neural Network
Deep Mind uses DQN
Greedy way pick the best action

Policy Gradients

DQN - Q Learning - Off Policy
Policy Gradient - Directly optimizing policy space

DeepStack

To beat poker players

"Deep Learning for Perception tasks but not for forming actions"

Happy Mastering DL!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

November 30, 2018

Day #156 - Reinforcement Learning

No comments:

Git Code Repository

About Me

What is your Expertise

Search This Blog

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts