Practical advice for analysis of large, complex data sets
Technical - Ideas to Analyse Data
- Look at distributions within data
- Look for examples for validate understanding
- Consider outliers
- Check for consistency over time (Validity over period of time)
- Data collection setup
- Reproducible
- Exploratory Data Analysis
- Data Analysis starts with questions not with code or data
- Accept ignorance and mistakes
- Be skeptical
- Educate Consumers
- Identify and compute the refresh rate pattern and accordingly refresh data
Very interesting article on data related risks / challenges.
- Unstable Data Dependencies
- Underutilized Data Dependencies
- Legacy Features
- Correction Cascades
- When Correlations No Longer Correlate
Happy Learning!!!
No comments:
Post a Comment