"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;

October 11, 2023

Evaluating LLM Models

Good checklist on Evaluating LLM Models

  • LLM Type - Open source / Proprietary / Datasets
  • Deployment Options - Private cloud or API model
  • Infra needs - Self-hosting
  • Retrieval Augmentation - Support for RAG
  • Scalability - Performance of large RAG datasets
  • Hallucinations - Handling Hallucinations
  • Benchmark compared to other competitive models
  • Legal Compliance - IP / ownership of prompts / PII 
  • Output Compliance - Bias / Toxicity
  • Output filtering / Content filters

Keep Exploring!!!

No comments: