Next Post is learning about Streaming Data Access in HDFS
Summarising from the link
- HDFS is targeted for batch processing
- Emphasis is high throughput for Data Reads
The answer is very easy to interpret and understand. Please find below answer and underlined is important lines from the answer.
Since Data Stored Sequentials and Read Sequentially. Cost of Random Reads, Time to locate Data Node is minimal.
Another beautiful explanation from Google Research Paper - Google File System
Please feel free to add your comments.
Happy Learning!!!
No comments:
Post a Comment