11. Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, Michael Stonebraker To appear in SIGMOD ‘09 *Basic ideas from MapReduce - a major step backwards , D. DeWitt and M. Stonebraker
12.
13. Architectural Element Parallel Databases MapReduce Schema Support Structured Unstructured Indexing B-Trees or Hash based None Programming Model Relational Codasyl Data Distribution Projections before aggregation Logic moved to data, but no optimizations Execution Strategy Push Pull Flexibility No, but Ruby on Rails, LINQ Yes Fault Tolerance Transactions have to be restarted in the event of a failure Yes: Replication, Speculative execution
18. SELECT pageURL , pageRank FROM Rankings WHERE pageRank > X ;
19. SELECT INTO Temp sourceIP , AVG ( pageRank ) as avgPageRank , SUM ( adRevenue ) as totalRevenue FROM Rankings AS R , UserVisits AS UV WHERE R . pageURL = UV . destURL AND UV . visitDate BETWEEN Date ( ‘2000-01-15’ ) AND Date ( ‘2000-01-22’ ) GROUP BY UV.sourceIP ; SELECT sourceIP , totalRevenue , avgPageRank FROM Temp ORDER BY totalRevenue DESC LIMIT 1 ;
20.
21. Albert Greenberg, James Hamilton, David A. Maltz, Parveen Patel MSR Redmond Presented by: Saurabh Nangia
22.
23.
24. Amortized Costs (one time purchases amortized over reasonable lifetimes, assuming 5% cost of money) 45% 25% 15% 15%
Details may vary somewhat by site or by moment in time.
Video download – external traffic Search application – internal traffice
Requests from Internet are IP (layer 3) routed through border and access routers to a layer 2 domain based on destination VIP address Single layer 2 domain – 4000 servers Layer 2 domain divided up into subnets using VLANs configured on Layer 2 switches