"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;

November 30, 2023

Basics - Multivariate testing vs A/B test

Multivariate testing - A technique for testing a hypothesis in which multiple variables are modified. A Vs B Vs C Vs D Vs E. . An MVT allows marketers to understand the impact of changing a number of elements at once (e.g. headline and image

A/B test has isolated variations made beforehand. A vs B. A/B tests are best for testing the global maximum.

  • Optimizely - One of the most well-known optimization platforms that offer a wide range of testing options, including A/B testing, multivariate testing, and personalization.
  • VWO (Visual Website Optimizer) - Provides a suite of testing tools, including A/B testing, split testing, multivariate testing, and other features like heatmaps, session recording, and more.
  • Adobe Target - Part of Adobe's marketing cloud, Adobe Target offers A/B and multivariate testing along with personalization and is a tool used by many enterprise-level businesses.
  • Google Optimize - Google's testing and personalization platform integrates with Google Analytics and offers both A/B and multivariate testing capabilities, as well as redirect tests.
  • Unbounce - While primarily known for its landing page builder, Unbounce also offers A/B and multivariate testing tools specifically geared towards optimizing landing pages for higher conversions.
We used to do something similar called weblabs experimentation during my days at Amazon

Keep Exploring!!!

The Future of Product Innovation is Here! 🌟 #NextGenIdeas #CollaborativeMagic

  • 🎯 Go niche with a Domain-Specific GPT. Why settle for generic when you can bring your own data and craft a model that knows your field inside out? #CustomizedAI #FocusedExcellence
  • 💡 It's a Multimodal World! Knowledge isn't just text; it's Images + Text + Data. Embrace the power of combined data forms to receive enriched, multimodal insights that tell the complete story. #HolisticAI #MultimodalKnowledge
  • 🚀 Supercharge your innovation engine with Ideas an ensemble of Models. Watch your prototyping speed take off as diverse AI models converge to refine your visions faster than ever! #SpeedyPrototyping #AIEnsemble
  • 🛡️ Wave goodbye to one-size-fits-all solutions and hello to Personal Data + Private LLM. Your privacy remains intact while you enjoy tailor-made advice crafted just for you. #PersonalizedAdvice #PrivacyMatters
  • 🛠️ Be part of the vanguard shaping AI Infrastructures. It's not just about building models; it's about building them right—robust, reliable, and fair. For all infrastructure providers, evaluators, and advocates for responsible AI, your insights are invaluable. #AIEthics #ResponsibleAI
  • Let's merge creativity with cutting-edge tech to craft products that not only resonate but revolutionize. Your privacy, our innovation—let's create the future, together.
Make the leap into a new era of product development and #TransformativeCollaboration today! 

Keep Exploring!!!

November 27, 2023

Roles vs Positions vs Titles

  • Developer - Ownership of module - Tech knowledge
  • Sr.Developer - Ownership of code + module design + QA - Tech + QA + Domain knowledge
  • Lead - Ownership of certain functionality + Support - Tech + QA + Support + Domain knowledge
  • Manager - Ownership + Design + Delivery - Tech + QA + Support + Domain knowledge + Delivery
  • Sr.Manager - Ownership and initiatives - Tech + QA + Support + Domain knowledge + Delivery + Architecture 
  • Director - Run the Team + Envision New Initiatives + Code / Adopt New tech - Tech + QA + Support + Domain knowledge + Delivery + Architecture 

Keep Exploring!!!

Ideas - Solutions - New Perspectives

  • Some knowledge is from what we worked
  • Some knowledge is from what we read
  • Some knowledge is from what we were referred
  • The solution is always evolving

Ideas may not always have past references :)

Something Trendy <> Good

War <> Advanced Soceity
Kindness, Empathy Creativity is more important than Grudge, Crooked, Wicked intentions




Keep Exploring!!!

ML to predict Load, Handle requests, Anamoly + Load Forecast

Mitigating DDoS with data science using AWS Shield Advanced and AWS WAF

  • Time series - events are fed into a time-series database in near real time and generate insights using machine learning (ML) models
  • Generating confidence percentage that helps in defining further action, verifies consumer authenticity, and serves the request. It also blocks malicious requests at the edge. Identify malicious patterns
  • Forecast Load - Derive pattern-based rate limits: Deriving rate limits based on a larger set of data—including consumer and IP address—by looking at weekly and monthly patterns.

  • Our data science and eng teams build rigorous models based on historical data at both a Stripe-wide and user-by-user level.
  • We build resilient systems to support spikes and flash sales, and scale our systems to handle more than the predicted peak.

How Razorpay handled significant transaction bursts during events like IPL

  • Rate-limiting and throttling were implemented to safeguard their system against a deluge of requests also DDoS attacks
  • The machine learning system consumed payment success and failure events to predict in real-time where the payment requests should be directed.

The Making of Developer-Console

Keep Exploring!!!

November 26, 2023

LLM Notes

LLM Notes from Andre Karpathy Slides






Keep Exploring!!!

November 24, 2023

Instant Demo Creation - Tools and Ideas

Step #1- Tool - https://scre.io/

Step #2 - Change Format - ffmpeg -i demo1.webm -strict experimental video.mp4

Step #3 Trim it - ffmpeg -i video.mp4 -ss 00:00:08 -t 00:02:43 -async 1 cut.mp4

Step #4 - Tweak Edges - ffmpeg -i Case_Study.mp4 -ss 00:00:00 -t 00:02:37 -async 1 Case_Studyv1.mp4

Keep Exploring!!!


November 22, 2023

Secure Google App Engine App - Streamlit App

1. Steps to build/ deploy sample App



2. Enable IAP for App Engine


Add users
To grant access to the application, 
click "Add Principal" 
select the IAP-secured Web App User role






Keep Exploring!!!

November 21, 2023

Learning Materials - DL - ML - DE - MLOps - NLP

Ready to Learn collated examples
Good Data related papers - Link
Neatly organized here: Link

Good Data Engineering Papers
Updated list of engineering papers worth reading.

1. Google File System - https://lnkd.in/d2-wnyqZ
2. Map Reduce Big Data Algorithm - https://lnkd.in/dvE8-s8M
3. BigTable NoSQL Document Store - https://lnkd.in/drmvvSAK
4. Colossus Next Gen File Store - https://lnkd.in/dERKhwMf
5. Megastore Large Object Store - https://lnkd.in/d5JDs2-K
6. Monarch Time Series DB - https://lnkd.in/d3kH_NCp
7. Chubby Distributed Lock Management - https://lnkd.in/dYy-w5rW
8. Spanner Distributed Database - https://lnkd.in/d6Emnycp
9. Spanner - CAP theorem considerations - https://lnkd.in/dq29BAWQ
10. Dapper Tracing System - https://lnkd.in/dm36-6jn
11. Borg Cluster Management - https://lnkd.in/dnveV-HU
12. Zanzibar Authentication System - https://lnkd.in/d5Vf7sRD
13. Pregel Graph Processing - https://lnkd.in/daq4576Y
14. Napa - Data Warehousing - https://lnkd.in/dbEfsa5B
15. Napa - Partitioning Algorithm - https://lnkd.in/dkhA7efJ
16. TensorFlow - Machine Learning at Scale - https://lnkd.in/d-4NfV2Z
17. Google F1 - Fast Analytics - https://lnkd.in/dbZqEKuf
18. HALP - YouTube Content Delivery Network - https://lnkd.in/dHzJtUc7
19. Mesa - Data Warehousing - https://lnkd.in/dFJ_Jrz6
20. Google Firestore - https://lnkd.in/drtEN9qR
21. Amazon Aurora DB Architecture - https://lnkd.in/dcevpwFt
22. Dynamo DB NoSQL Database - https://lnkd.in/dMD8C_WK
23. Apple Foundation DB - NewSQL database - https://lnkd.in/dG75i_9K
24. TikTok Monolith - Embedding in real-time - https://lnkd.in/dcjBXCnc
25. Scalability at what COST - https://lnkd.in/dJ9ScYKq
26. Gorilla - Time Series DB - https://lnkd.in/d3AeN2kB
27. Cassandra - NoSQL DB - https://lnkd.in/d-_nhtED
28. FlexiRaft - Distributed Consensus Tradeoffs - https://lnkd.in/dX3nMvmt
29. Memcache - In-memory Cache at Facebook - https://lnkd.in/dKeYK67g
30. Millisampler Network Sampling - https://lnkd.in/dsj9FuD6
31. TAO Graph Database - https://lnkd.in/daasJpYf
32. MineSweeper - Root Cause Analysis - https://lnkd.in/dEsd6iwj
33. Facebook Prophet - Forecasting at Scale - https://lnkd.in/daCmAjak
34. Facebook ShardManager - https://lnkd.in/dDy9Dp2h
35. Hive - Map Reduce Jobs - https://lnkd.in/dpV8BM2R
36. Apache Thrift - Definition Language - https://lnkd.in/d7NzhP54
37. Meta Twine - Cluster Management System - https://lnkd.in/d5t7VFKE
38. Meta ServiceRouter - Service mesh - https://lnkd.in/dVnkv_bV
39. Apache Hadoop - Distributed File System - https://lnkd.in/dHsQu9FN
40. Apache Kafka - Event Bus - https://lnkd.in/dyxuKbMb
41. Apache Flink - https://lnkd.in/dn_gMvaR


NLP


Keep Exploring!!!

Forecasting - nixtla

A ton of Algo's consolidated and released - nixtla

There's an ever-growing forest of related projects out there, including:

  • TimeGPT does zero-shot inference
  • Allows forecasting of multiple time series.
  • TimeGPT leverages the Transformer for forecasting tasks.
Ref - Link








Temporian is a library for safe, simple and efficient preprocessing and feature engineering of temporal data in Python.


Keep Exploring!!!

November 20, 2023

Image Generation DALLE 3 vs Stable Diffusion

Prompt - A picturesque countryside in autumn splendor, with rolling hills adorned in warm fall colors, a quaint farmhouse sitting amidst harvested fields, Latest 2023 Tesla Car nestled under a canopy of russet leaves.

DALLE 3




Stable Diffusion


Keep Exploring!!!

The New Stack and Ops for AI

Consistency, User Exp - LLMOps Best practices

  • Framework
  • Repeatable/precise ways
  • AI augments human capabilities
  • User Experience
  • Keep humans in loop
  • Iterate and improve
  • Communicate capabilities
  • Suggestive prompts








  • Consistency

  • Output from JSON grammar


  • Seed parameter


  • Ground Truth



  • Microservice and Keeping track
  • Function Calling

  • DB / Browse data

  • Evaluate performance







  • Semantic Cache
  • Lookup if already in the cache







Keep Exploring!!!


November 19, 2023

Knowing, learning, applying, and predicting

Each skill contributes to the depth of a person's experience or expertise.

  • Knowing, as a skill, involves the ability to recall and understand information accurately. 
  • Learning is a skill that entails not only the acquisition of new information but also the ability to integrate it with what is already known, the ability to adapt to new situations or problems.
  • Applying is a skill that encompasses the practical use of knowledge and learning in real-world situations.
  • Predicting the future is a skill that draws from an understanding of patterns, trends, and the extrapolation of current knowledge to forecast what may happen.

Keep Exploring!!!

November 17, 2023

3PL - Touch points - Logistics - Visibility

3PL, How much visibility do you have over your Logistics? As a consumer / 3PL company track and trace inventory across the chain



The key to the supply chain is visibility.

A look at different touch points collected by FedEx and the level of visibility :)

Touchpoints

  • Item picked up
  • Left the facility
  • On the way
  • At the destination sort facility
  • Arrived at Hub
Keep Exploring!!!

November 14, 2023

Implement OAuth with GCP

This learning is more around adding OAuth for streamlit App. This video and code shared was a good reference to customize.

Creating a custom GCP project, providing OAuth Consent, Adding user info, onboarding is the process.

Step 1 - Fill Consent Screen and Add Test Users







Step 2 - Create Credentials



Sample code




Keep Exploring!!!