Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons

43 Slides1.18 MB

Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons Intel Science & Technology Center for Cloud Computing Keynote Talk at IPDPS’15 May 28, 2015

ISTC for Cloud Computing 11.5M over 5 years 4 Intel researchers. Launched Sept 2011 25 faculty 87 students (CMU Berkeley, GA Tech, Princeton, Washington Underlying Infrastructure enabling the future of cloud computing www.istc-cc.cmu.edu Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 2

Big Data Performance Challenge whenever the volume or velocity of data overwhelms current processing systems/techniques, resulting in performance that falls far short of desired This talk: Focus on performance as key challenge Many other challenges, including: variety of data, veracity of data analytics algorithms that scale programming security, privacy insights from the data, visualization Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 4

How to Tackle the Big Data Performance Challenge Three approaches to improving performance by orders of magnitude are: Scale down the amount of data processed or the resources needed to perform the processing Scale up the computing resources on a node, via parallel processing & faster memory/storage Scale out the computing to distributed nodes in a cluster/cloud or at the edge Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 5

Scale down the amount of data processed or the resources needed to perform the processing Goal: Answer queries much faster/cheaper than brute force Specific query? memoized answer Family of queries? Retrieval? good index With underlying common subquery (table)? materialized view Aggregation? data cube Important Scale Down tool: approximation (w/error guarantees) Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 6

Big Data Queries circa 1995 Decision Support Systems (DSS) SQL Query Exact Answer Long Response Times! Scale Down Insight: Often EXACT answers not required – DSS applications usually exploratory: early feedback to help identify “interesting” regions – Preview answers while waiting. Trial queries – Aggregate queries: precision to “last decimal” not needed Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 7

Fast Approximate Answers Often, only interested in leading digits of answer E.g., Average salary for 59,152.25 (exact) in 10 minutes 59,000 /- 500 (with 95% confidence) Original Data (PB/TB) statistical summarization in 10 seconds Synopsis (GB/MB) Orders of magnitude speed-up because synopses are orders of magnitude smaller than original data Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 8

The Aqua Architecture SQL Query Q [Sigmod’98, ] Q Network HTML XML Result Data Warehouse Browser Excel Picture without Aqua Big Data: Scale Down, Scale Up, Scale Out Warehouse Data Updates Phillip B. Gibbons 9

The Aqua Architecture SQL Query Q [Sigmod’98, ] Q’ Network HTML XML Rewriter Result (w/ error bounds) Browser Excel Picture with Aqua: Data Warehouse AQUA Synopses Warehouse Data Updates AQUA Tracker – Aqua is middleware, between client & warehouse (Client: error bound reporting. Warehouse SW: unmodified) – Aqua Synopses are stored in the warehouse – Aqua intercepts the user query and rewrites it to be a B.query Phillip Gibbons Q’ 10 Big Data: Scale Down, Scale Up, Scale Out

Precomputed, Streaming Synopses Our Insights (circa 1996) Precomputed is often faster than on-the-fly – Better access pattern than sampling – Small synopses can reside in memory Compute synopses via one pass streaming – Seeing entire data is very helpful: provably & in practice (Biased sampling for group-bys, Distinct value sampling, Join sampling, Sketches & other statistical functions) – Incrementally update synopses as new data arrives Bottom Line: Orders of magnitude faster on DSS queries Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 11

Example: Distinct-Values Queries select count(distinct target-attr) from rel Template where P select count(distinct o custkey) Example using from orders TPC-D/H/R where o orderdate ‘2014-05-28’ schema How many distinct customers placed orders in past year? – Orders table has many rows for each customer, but must only count each customer once & only if has an order in past year Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 12

Distinct-Values Query Approaches 10% sample Estimate from Random Sample 7 3 – Statistics, Databases, etc 3791 76 – Lousy in practice 5 distinct? 50 distinct? – [Charikar’00] Need linear sample size Flajolet-Martin‘85 u universe size – One-pass algorithm, stores O(log u) bits – Only produces count, can’t apply a predicate Our Approach: Distinct Sampling [VLDB’01] – One-pass, stores O(t * log u) tuples – Yields sample of distinct values, with up to t-size uniform sample of rows for each value – First to provide provably good error guarantees Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 13

Accuracy vs. Data Skew Data set size 1M Sample sizes 1% 7 Ratio Error 6 5 4 Distinct Sampling 3 GEE 2 AE 1 0 0.5 1 1.5 2 2.5 3 3.5 4 Zipf Parameter [VLDB’01] Over the entire range of skew : Distinct Sampling has 1.00-1.02 ratio error At least 25 times smaller relative error than GEE and AE Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 14

Scale Down Today Hundreds and hundreds of clever algorithms – Synopsis-based approximations tailored to query families – Reduce data size, data dimensionality, memory needed, etc Synopses routinely used in Big Data analytics applications at Google, Twitter, Facebook, etc – E.g., Twitter’s open source Summingbird toolkit Hyperloglog – number of unique users who perform a certain action; followers-of-followers CountMin Sketch – number of times each query issued to Twitter search in a span of time; building histograms Bloom Filters – keep track of users who have been exposed to an event to avoid duplicate impressions (10 8 events/day for 10 8 users) [Boykin et al, VLDB’14] Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 15

How to Tackle the Big Data Performance Challenge Scale Down Scale Up the computing resources on a node, via parallel processing & faster memory/storage Scale Out Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 16

Why Scale Up when you can Scale Out? Much of Big Data focus has been on Scale Out – Hadoop, etc But if data fits in memory of multicore then often order of magnitude better performance – GraphLab1 (multicore) is 1000x faster than Hadoop (cluster) – Multicores now have 1-12 TB memory: most graph analytics problems fit! Even when data doesn’t fit, will still want to take advantage of Scale Up whenever you can Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 17

Multicore: 144-core Xeon Haswell E7-v3 socket socket 2 HW threads 32KB 256KB 18 2 HW threads 2 HW threads 32KB 256KB 45MB Shared L3 Cache 8 32KB 18 256KB 2 HW threads 32KB 256KB 45MB Shared L3 Cache up to 12 TB Main Memory Attach: Hard Drives & Flash Devices Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 18

Hierarchy Trends Good performance [energy] requires effective use of hierarchy Hierarchy getting richer – More cores – More levels of cache – New memory/storage technologies Flash/SSDs, emerging PCM Bridge gaps in hierarchies – can’t just look at last level of hierarchy Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 19

Hi-Spade: Hierarchy-Savvy Sweet Spot Platform 1 performance HierarchySavvy (Pain)-Fully Aware Platform 2 Ignoring programming effort Goals: Modest effort, good performance in practice, robust, strong theoretical foundation Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 20

What Yields Good Hierarchy Performance? Spatial locality: use what’s brought in Temporal locality: reuse it Constructive sharing: don’t step on others’ toes Stepping on toes CPU1 L1 CPU2 L1 CPU3 L1 e.g., all CPUs write B at the same time B Shared L2 Cache L2 Cache Two design options Cache-aware: Focus on the bottleneck level Cache-oblivious: Design for any cache size Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 21

Multicore Hierarchies’ Key New Dimension: Scheduling Scheduling of parallel threads has LARGE impact on hierarchy performance Recall our problem scenario: Key reason: Caches not fully shared CPU3 CPU2 CPU1 all CPUs want to write B at the same time L1 B L1 L1 Shared L2 Cache L2 Cache Big Data: Scale Down, Scale Up, Scale Out Can mitigate (but not solve) if can schedule the writes to be far apart in time Phillip B. Gibbons 22

Program-centric Analysis Start with a portable program description: dynamic Directed Acyclic Graph (DAG) Analyze DAG without reference to cores, caches, connections Program-centric metrics Number of operations (Work, W) Length of Critical Path (Depth, D) Data reuse patterns (Locality) Our Goal: Program-centric metrics Smart thread scheduler delivering provably good performance on many platforms Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 23

Parallel Cache Complexity Model Decompose task into maximal subtasks that fit in space M & glue operations M M M Cache Complexity Q*(M,B) Σ Space for M-fitting subtasks Σ Cache miss for every access in glue M,B parameters either used in algorithm (cache-aware) or not (cache-oblivious) [Simhadri, 2013] Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 24

Space-Bounded Scheduler [Chowdhury, Silvestri, Blakeley, Ramachandran IPDPS‘10] Key Ideas: Assumes space use (working set sizes) of tasks are known (can be suitably estimated) C Assigns a task to a cache C that fits the task’s working set. Reserves the space in C. Recurses on the subtasks, using the CPUs and caches that share C (below C in the diagram) Cache costs: optimal levels Q*(Mi) x Ci where Ci is the miss cost for level i caches [SPAA’11] [SPAA’14] Experiments on 32-core Nehalem: reduces cache misses up to 65% vs. work-stealing Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 25

Sharing vs. Contention Sharing: operations that share the same memory location (or possibly other resource) Contention: serialized access to a resource (potential performance penalty of sharing) Replace concurrent update with Priority Update: updates only if higher priority than current Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 26

Priority Update has Low [SPAA’13] Contention under High Sharing Perform poorly under high sharing * Perform well under high sharing *Random priorities 5 runs of 108 operations on 40-core Intel Nehalem

Further Research Directions Determinism at function call abstraction, Commutative Building Blocks, Deterministic Reservations for loops, Use of priority update [PPoPP’12, SPAA’13, SODA’15] Scaling Up by redesigning algorithms & data structures to take advantage of new storage/memory technologies [VLDB’08, SIGMOD’10, CIDR’11, SIGMOD’11, SPAA’15] Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 28

How to Tackle the Big Data Performance Challenge Scale Down Scale Up Scale Out the computing to distributed nodes in a cluster/cloud or at the edge Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 29

Big Learning Frameworks & Systems Goal: Easy-to-use programming framework for Big Data Analytics that delivers good performance on large (and small) clusters Idea: Discover & take advantage of distinctive properties of Big Learning algorithms - Use training data to learn parameters of a model - Iterate until Convergence approach is common - E.g., Stochastic Gradient Descent for Matrix Factorization or Multiclass Logistic Regression; LDA via Gibbs Sampling; Page Rank; Deep learning; Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 30

Parameter Servers for Distributed ML Provides all machines with convenient access to global model parameters Enables easy conversion of single-machine parallel ML algorithms “Distributed shared memory” programming style Replace local memory access with PS access Worker 1 Worker 2 Paramet er Table (one or more machines) Worker 3 Worker 4 Ahmed et al. (WSDM’12), Power and Li (OSDI’10) Single Machine Parallel UpdateVar(i) { old y[i] delta f(old) y[i] delta } UpdateVar(i) { old PS.read(y,i) Distributed delta f(old) with PS PS.inc(y,i,delta) } 31

The Cost of Bulk Synchrony Wasted computing time! Thread 1 1 2 Thread 2 1 2 Thread 3 1 Thread 4 1 3 3 2 2 3 3 Time Threads must wait for each other End-of-iteration sync gets longer with larger clusters Precious computing time wasted But: Fully asynchronous No algorithm convergence guarantees 32

Stale Synchronous Parallel (SSP) Staleness Threshold 3 Thread 1 waits until Thread 2 has reached iter 4 Thread 1 Thread 1 will always see these updates Thread 2 Thread 3 Thread 1 may not see these updates (possible error) Thread 4 0 1 2 3 4 5 6 7 8 9 Iteration [NIPS’13] Allow threads to usually run at own pace Fastest/slowest threads not allowed to drift S iterations apart Protocol: check cache first; if too old, get latest version from network Consequence: fast threads must check network every iteration Slow threads check only every S iterations – fewer network accesses, so catch up!

Staleness Sweet Spot

Enhancements to SSP Early transmission of larger parameter [SoCC’15] changes, up to bandwidth limit Find sets of parameters with weak dependency to compute on in parallel – Reduces errors from parallelization Low-overhead work migration to eliminate transient straggler effects Exploit repeated access patterns of iterative [SoCC’14] algorithms (IterStore) – Optimizations: prefetching, parameter data placement, static cache policies, static data structures, NUMA memory management Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 35

IterStore: Exploiting Iterativeness [SoCC’14] Collaborative Filtering (CF) on NetFlix data set, 8 machines x 64 cores

Big Learning Systems Big Picture Framework approaches: – BSP-style approaches: Hadoop, Spark – Think-like-a-vertex: Pregel, GraphLab – Parameter server: Yahoo!, SSP Tend to revisit the same problems Ad hoc solutions Machine Learning problems Scale Down/ Up/Out techniques What is the entire big picture? Phillip B. Gibbons Big Data: Scale Down, Scale Up, Scale Out 37

Unified Scale Down, Scale Up, Scale Out Big Data System? No system combines all three Research questions: – How best to combine: Programming & Performance challenges – Scale down techniques for Machine Learning? E.g., Early iterations on data synopses – Scale up techniques more broadly applied? Lessons from decades of parallel computing research – Scale out beyond the data center? Lessons from IrisNet project? Big Data: Scale Down, Scale Up, Scale Out [Sigmod’03, PC 2003] Phillip B. Gibbons 38

How to Tackle the Big Data Performance Challenge Three approaches to improving performance by orders of magnitude are: Scale down the amount of data processed or the resources needed to perform the processing Scale up the computing resources on a node, via parallel processing & faster memory/storage Scale out the computing to distributed nodes in a cluster/cloud or at the edge Acknowledgment: Thanks to MANY collaborators Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 39

Appendix Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 40

Slides 9-11: References (1/3) [Sigmod’98] P. B. Gibbons and Y. Matias. New sampling-based summary statistics for improving approximate query answers. ACM SIGMOD, 1998. S. Acharya, P. B. Gibbons, V. Poosala, and S. Ramaswamy. Join synopses for approximate query answering. ACM SIGMOD, 1999. S. Acharya, P. B. Gibbons, V. Poosala, and S. Ramaswamy. The Aqua approximate query answering system. ACM SIGMOD, 1999. Demo paper. S. Acharya, P. B. Gibbons, and V. Poosala. Congressional samples for approximate answering of group-by queries. ACM SIGMOD, 2000. N. Alon, P. B. Gibbons, Y. Matias, and M. Szegedy. Tracking join and self-join sizes in limited storage. J. Comput. Syst. Sci., 2002. Special issue on Best of PODS’99. M. Garofalakis and P. B. Gibbons. Probabilistic wavelet synopses. ACM TODS, 2004. Slides 13-14: [Charikar’00] M. Charikar, S. Chaudhuri, R. Motwani, and V. R. Narasayya. Towards Estimation Error Guarantees for Distinct Values. ACM PODS, 2000. [Flajolet-Martin’85] P. Flajolet and G. N. Martin. Probabilistic Counting Algorithms for Data Base Applications. J. Comput. Syst. Sci., 1985. [VLDB’01] P. B. Gibbons. Distinct sampling for highly-accurate answers to distinct values queries and event reports. VLDB, 2001. Slide 15: [Boykin et al. VLDB’14] P. O. Boykin, S. Ritchie, I. O'Connell, and J. Lin. Summingbird: A Framework for Integrating Batch and Online MapReduce Computations. PVLDB 2014. Slide 24: [Simhadri, 2013] H. V. Simhadri. Program-Centric Cost Models for Parallelism and Locality. Ph.D. Thesis, 2013. Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 41

Slide 25: References (2/3) [Chowdhury, Silvestri, Blakeley, Ramachandran IPDPS‘10] R. A. Chowdhury, F. Silvestri, B. Blakeley, and V. Ramachandran. Oblivious algorithms for multicores and network of processors. IEEE IPDPS, 2010. [SPAA’11] G. E. Blelloch, J. T. Fineman, P. B. Gibbons, and H. V. Simhadri. Scheduling Irregular Parallel Computations on Hierarchical Caches. ACM SPAA, 2011. [SPAA’14] H. V. Simhadri, G. E. Blelloch, J. T. Fineman, P. B. Gibbons, and A. Kyrola. Experimental analysis of spacebounded schedulers. ACM SPAA, 2014. Slide 27: [SPAA’13] J. Shun, G. E. Blelloch, J. T. Fineman, and P. B. Gibbons. Reducing contention through priority updates. ACM SPAA, 2013. Slide 28: [PPoPP’12] G. E. Blelloch, J. T. Fineman, P. B. Gibbons, and J. Shun. Internally deterministic algorithms can be fast. ACM PPoPP, 2012. [SPAA’13] see above [SODA’15] J. Shun, Y. Gu, G. E. Blelloch, J. T. Fineman, and P. B. Gibbons. Sequential Random Permutation, List Contraction and Tree Contraction are Highly Parallel. ACM-SIAM SODA, 2015. [VLDB’08] S. Nath and P. B. Gibbons. Online maintenance of very large random samples on flash storage. VLDB, 2008. [SIGMOD’10] S. Chen, P. B. Gibbons, and S. Nath. PR-join: A non-blocking join achieving higher result rate with statistical guarantee. ACM SIGMOD, 2010. [CIDR’11] S. Chen, P. B. Gibbons, S. Nath. Rethinking database algorithms for phase change memory. CIDR, 2011 [SIGMOD’11] M. Athanassoulis, S. Chen, A. Ailamaki, P. B. Gibbons, and R. Stoica. MASM: Efficient online updates in data warehouses. ACM SIGMOD, 2011. Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 42

References (3/3) [SPAA’15] G. E. Blelloch, J. T. Fineman, P. B. Gibbons, Y. Gu, and J. Shun. Sorting with Asymmetric Read and Write Costs. ACM SPAA, 2015. Slide 31: [Ahmed et al. (WSDM’12)] A. Ahmed, M. Aly, J. Gonzalez, S. M. Narayanamurthy, and A. J. Smola. Scalable inference in latent variable models. ACM WSDM, 2012. [Power and Li (OSDI’10)] R. Power and J. Li. Piccolo: Building Fast, Distributed Programs with Partitioned Tables. Usenix OSDI, 2010. Slide 33: [NIPS’13] Q. Ho, J. Cipar, H. Cui, S. Lee, J. K. Kim, P. B. Gibbons, G. Gibson, G. Ganger, and E. Xing. More effective distributed ML via a state synchronous parallel parameter server. NIPS, 2013. Slide 34: [ATC’14] H. Cui, J. Cipar, Q. Ho, J. K. Kim, S. Lee, A. Kumar, J. Wei, W. Dai, G. R. Ganger, P. B. Gibbons, G. A. Gibson, and E. P. Xing. Exploiting Bounded Staleness to Speed Up Big Data Analytics. Usenix ATC, 2014. Slides 35-36: [SoCC’14] H. Cui, A. Tumanov, J. Wei, L. Xu, W. Dai, J. Haber-Kucharsky, Q. Ho, G. R. Ganger, P. B. Gibbons, G. A. Gibson, and E. P. Xing. Exploiting iterative-ness for parallel ML computations. ACM SoCC, 2014. [SoCC’15] J. Wei, W. Dai, A. Qiao, Q. Ho, H. Cui, G. R. Ganger, P. B. Gibbons, G. A. Gibson, and E. P. Xing. Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. ACM SoCC, 2015. Slide 38: [Sigmod’03] A. Deshpande, S. Nath, P. B. Gibbons, and S. Seshan. Cache-and-query for wide area sensor databases. ACM SIGMOD, 2003. [PC 2003] P. B. Gibbons, B. Karp, Y. Ke, S. Nath, and S. Seshan. Irisnet: An architecture for a worldwide sensor web. IEEE Pervasive Computing, 2003. Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 43

Acknowledgments The work presented in this talk resulted from various collaborations with a large number of , students, and colleagues. I thank all of my coauthors, whose names appear in the list of References. A number of these slides were adapted from slides created by my co-authors, and I thank them for those slides. Big Data: Scale Down, Scale Up, Scale Out Phillip B. Gibbons 44

Back to top button