Pig is an Apache library that interprets scripts written in a language called Pig Latin and then runs them on a Hadoop cluster. The Pig Latin language is a high level data transformation language that allows you to concentrate on the data transformations you require rather than begin concerned with individual map and reduce functions. You can find the Pig project home page at http://hadoop.