Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): ChatGPT

July 29, 2023

ChatGPT

Decoder-only model.
Decoder-only architecture does not have an explicit encoder to summarize / context vector the input information
In Decode only, input sequence is directly fed into the decoder, which generates the output sequence by attending to the input sequence through self-attention mechanisms.

Ref - Link1, Link2

Transformer Key blocks

In the attention step, words “look around” for other words that have relevant context and share information with one another.
In the feed-forward step, each word “thinks about” information gathered in previous attention steps and tries to predict the next word.

Ref - Link

What we know about transformers

"What differentiates the Transformer from its predecessors is it’s ability to learn the contextual relationship of values within a sequence through a mechanism called self-attention.

Transformers can be generally categorized into one of three categories:

- encoder onlya la BERT,

- decoder only a la GPT and

- having an encoder-decoder architecture a la T5

Ref - Link

Keep Exploring!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

July 29, 2023

ChatGPT

No comments:

About Me

What is your Expertise

Search This Blog

Git Code Repository

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts