This document discusses Apache Tez, a framework for accelerating Hadoop query processing. Some key points: - Tez is a dataflow framework that expresses computations as directed acyclic graphs (DAGs) of tasks, allowing for optimizations like container reuse and locality-aware scheduling. - It is built on YARN and provides a customizable execution engine as well as APIs for applications like Hive an
![Apache Tez: Accelerating Hadoop Query Processing](https://cdn-ak-scissors.b.st-hatena.com/image/square/5be7b92accbc7efce55c44495fabf3cd4bd38f8e/height=288;version=1;width=512/https%3A%2F%2Fcdn.slidesharecdn.com%2Fss_thumbnails%2Ftez-131115140651-phpapp02-thumbnail.jpg%3Fwidth%3D640%26height%3D640%26fit%3Dbounds)