8. Database Harddrive
Unstructured ( 61.7% growths )
time to find one record = logb N * 10ms
log100(100,000,000) * 10ms = 40ms
time to read record = 10ms
10,000,000 * 50ms = 5.8 days
9. Hadoop Harddrive
throughput = 10MB/s
time to transfer record = 10ms
10,000,000 * 10ms = 1.5 days
random reads = (5.8 days)
10. Laws of Physics
Random
Sequential Values/Sec.
316
Disk
53,200,000
1,924
SSD
42,200,000
36,700,000
Memory
358,200,000
1 10 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 1,000,000,000
Adam Jacobs
The Pathologies of Big Data
15. Game Changer
Slow Static Barrier
Business
ETL Data Warehouse
Intelligence
Fast Dynamic View
Raw Load Hadoop Data Pipeline
16. NO - SQL
RDBMS
Standard SQL Structured Data Response in sec.
No SQL Unstructured Data Batch
Hadoop
17. Common Applications
Asset Management Analytics
Security Analytics
Product Cohort Analytics
Advanced Web Analytics
Structured +
Many
Unstructured Decision Makers
Data Sources
Data