Reposting Summary from Quora Answer with my perspective added
What you don't learn in Kaggle Competitions
- Determining business problem to solve with data
- Real world data imbalance, Accuracy issues, Maintaining Models
- Miss the challenges of data engineering (What features to select, causational vs correlation in domain context)
- Identifying / Reusing Existing data for first level models
- Identifying pipelines to build for more relevant variables
- ETL / Data Consolidation / Aggregation, Eliminating outliers / Redundant Data
Happy Learning!!!
No comments:
Post a Comment