ChatGPT is not all you need. A State of the Art Review of large Generative AI models
- DALLE-2 model - text to 3D images,
- ChatGPT - texts to code
- Flamingo model - texts to video
- Phenaki model - texts to audio
- DALL·E 2, created by OpenAI, is able to generate original, genuine and realistic images and art from a prompt consisting on a text description
- Imagen is a text-to-image diffusion model [17] consisting on large transformer language models
- Stable Diffusion : Stable Diffusion is a latent-diffusion model that is opensource and has been developed by the CompVis group at LMU Munich
Models Timeline
Generator
- Resnet50
- Downsampling - Strided convolution
- Residual blocks - Do not change width or height of activation map
- Downsampling - Dialted Convolution
Discriminator
- Pixel Exact Images
- Multihead self attention
- Multi-Head Attention
UNETGAN
MagGAN
MaskGAN
LOHO
Keep Exploring!!!
No comments:
Post a Comment