Five Things about GPT
#1. GPT-3, or the third generation Generative Pre-trained Transformer, is a neural network machine learning model trained using internet data to generate any type of text
ChatGPT is a chatbot technology developed by 𝐎𝐩𝐞𝐧𝐀𝐈. It is designed to assist with a variety of tasks and functions, including answering questions, providing information, and completing tasks.
#2. Number of layers and parameters
#3. Parameters
- GPT-2 was released in February 2019 with 1.5 billion parameters
- GPT-3 was released in June 2020 with 175 billion parameters (~120x improvement)
- GPT-4 will be released soon and is expected to have 100 trillion parameters (~500x improvement)
#4. How intelligent can it be?
#5. GPT-4 is built on the Transformer architecture, which has been effective for a variety of machine-learning tasks, including computer vision. This means that GPT-4 might be used for tasks such as image and video generation
#6. Training Approach
No comments:
Post a Comment