The ability to perform timely and scalable analytical processing of large datasets is now a critical ingredient for the success of most organizations. The Hadoop software stack (which consists of an extensible MapReduce execution engine, pluggable distributed storage engines, and a range of procedural to declarative interfaces) is a popular choice for big data analytics. Unfortunately, Hadoop's po