Facebook often uses analytics for data-driven decision making. Over the past few years, user and product growth has pushed our analytics engines to operate on data sets in the tens of terabytes for a single query. Some of our batch analytics is executed through the venerable Hive platform (contributed to Apache Hive by Facebook in 2009) and Corona, our custom MapReduce implementation. Facebook has
![Apache Spark @Scale: A 60 TB+ production use case](https://cdn-ak-scissors.b.st-hatena.com/image/square/a2e3c8645d62231848bbff0edee7aa34bbeb093d/height=288;version=1;width=512/https%3A%2F%2Fengineering.fb.com%2Fwp-content%2Fuploads%2F2016%2F08%2FApache-spark2.jpeg)