"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;

January 19, 2023

GAN Models Study

ChatGPT is not all you need. A State of the Art Review of large Generative AI models

  • DALLE-2 model - text to 3D images,
  • ChatGPT - texts to code
  • Flamingo model - texts to video
  • Phenaki model - texts to audio





  • DALL·E 2, created by OpenAI, is able to generate original, genuine and realistic images and art from a prompt consisting on a text description
  • Imagen is a text-to-image diffusion model [17] consisting on large transformer language models
  • Stable Diffusion : Stable Diffusion is a latent-diffusion model that is opensource and has been developed by the CompVis group at LMU Munich

Models Timeline


Generator
  • Resnet50
  • Downsampling - Strided convolution
  • Residual blocks - Do not change width or height of activation map
  • Downsampling - Dialted Convolution

Discriminator
  • Pixel Exact Images
  • Multihead self attention
  • Multi-Head Attention

UNETGAN
MagGAN
MaskGAN
LOHO



Keep Exploring!!!

No comments: