You are here

Scuba: Diving into Data at Facebook

April 23, 2014 -
4:00pm to 5:00pm
MIT CSAIL, Stata Center, Bldg 32 Kiva Conference Room, 4th Floor, G449
Speaker Name: 
Janet Wiener, Software Engineer, Facebook

Facebook engineers query multiple databases to monitor and analyze Facebook products and services. The fastest of these databases is Scuba, which achieves sub second query response time and latencies of under a minute from events occurring (a client request on a phone, a bug report filed, a code change checked in) to graphs showing those events on engineers’ monitors.
Scuba is a fast, scalable, distributed, in-memory database built at Facebook. It currently ingests millions of rows (events) per second and expires data at the same rate. Scuba stores data completely in memory on hundreds of servers each with 144 GB RAM. To process each query, Scuba aggregates data from all servers. Scuba processes almost a million queries per day. Scuba is used extensively for interactive, ad hoc, analysis queries that run in under a second over live data. In addition, Scuba is the workhorse behind Facebook’s code regression analysis, bug report monitoring, ads revenue monitoring, and performance debugging.
This talk will include content from papers in VLDB 2013 and Sigmod 2014.

Janet Wiener is a software engineer at Facebook, where she works on Scuba and other data analysis tools. She also teaches Facebook employees how to make product decisions and trouble shoot live systems issues by asking questions, running experiments, and using data tools to collect and analyze the data. Her previous work includes database algorithms, distributed systems performance, and web exploration at Stanford University, DEC, Compaq, and HP. She earned a PhD in databases from the U. of Wisconsin-Madison in 1995 and a BA from Williams College in 1989.