Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): Data Perspectives

March 18, 2020

Data Perspectives

Different perspectives to decide on choosing the right database?

Strict data types - Schema on write
Schemaless data - Schema on read
Read-only immutable data
Eventually consistent data
Dirty read vs Committed data
Multi-version concurrency control
Replicate data based on logs
Replay committed logs
Data sharding
High reads consistent data - RDBMS
High writes low reads - HBase, Cassandra
Document-based storage - Mongodb, Couchdb
CAP, ACID Properties

Things I Wished More Developers Knew About Databases

Want to Debug Latency?

I wrote an initial draft on the things I wished more developers knew about DBs. It touches a variety of topics: write skews, external consistency, clock skews, database-generated IDs, nested transaction issues, caches & more. Is there anything you wished more devs knew about DBs?
— Jaana Dogan (@rakyll) April 13, 2020

Almost similar and deep-dive techniques from the tweet conversation

Read heavy vs write heavy. Insert vs updates. Vaccuuming
Replication or not, transaction logging, why indexes matter, performance tuning, i/o scheduler, unicode, gender isn't binary
Locks, cache effects, isolation levels
IO bound vs network bound especially in the situation of replication, scaling strayegy, concurrency vs distributed.
Materialized views, and the dangers of invalidating them unexpectedly.
Connection pool, scaling techniques to handle distributed application / system, improve performance, optimization of query etc.
I'd be interested in how this applies to a distributed system. Concurrency (specifically MVCC), connections, DB threading, backpressure handling
Disk storage implementation and optimization

Keep Thinking!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

March 18, 2020

Data Perspectives

No comments:

Git Code Repository

About Me

What is your Expertise

Search This Blog

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts