Today, we’re excited to open source Scalding, a Scala API for Cascading. Cascading is a thin Java library and API that sits on top of Apache Hadoop's MapReduce layer. Scalding is comprised of two main components: a DSL to make MapReduce computations look very similar to Scala's collection API A wrapper for Cascading to make it simpler to define the typical use cases of jobs, tests and describing d