Presto is an open source distributed SQL query engine for running queries against large datasets stored in Hadoop/HDFS clusters. It uses in-memory parallel processing, pipelining, data locality, caching, and dynamic compilation to byte code for low query latency. Key techniques include caching frequently used metadata and compiled plans, processing data locally on nodes where it resides, and contr
