Aurojit Panda ▪, Mosharaf Chowdhury▴, Aditya Akella⬩, Scott Shenker ⋆, Ion Stoica ⋆ ⋆ UC Berkeley ▪ NYU ⬩ University of Wisconsin ▴ University of Michigan HotCloud, July 09, 2018
focuses on simple task placement/queries § Graph analytics iterative in nature § Flexibility over data placement and join sites § Graph partitioning difﬁcult § Estimating intermediate data § Difﬁcult in graph algorithms Geo-Distributed Analytics on Graphs
Idea: approximate the graph using a sparse, much smaller graph § Drop edges/vertices § Sparsify without accuracy loss § Only worry about reducing cross-DC entities § Leverage graph-parallel model and algorithm properties 0 1 4 2 3 0 1 4 2 3 DC1 DC2
Idea: approximate the graph using a sparse, much smaller graph § Drop edges/vertices § Sparsify without accuracy loss § Only worry about reducing cross-DC entities § Leverage graph-parallel model and algorithm properties 0 1 4 2 3 0 1 4 2 3 DC1 DC2
execution model § Better execution models at bootstrap stage § How would the global sync work? § Multi-tenancy § Would it provide opportunities to leverage existing GDA techniques? § Graph updates § What is an incremental model in this case?
geo-distributed fashion § Can benefit from geo-distributed graph analytics. § Our proposal Monarch: § Early attempt at bringing geo-distributed analytics to graph processing. § Initial results are encouraging. http://www.cs.berkeley.edu/~api api@cs.berkeley.edu