Apache Tez: a Framework for YARN-based Data Processing Applications In Hadoop.
Apache™ Tez is an extensible framework for high performance batch and interactive data processing applications in Hadoop for terabyte to petabyte scale datasets. It allows projects in the Hadoop ecosystem (including Apache Hive, Apache Pig, various 3rd-party vendor software) to express fit-to-purpose data processing applications in a way that meets