This document discusses Hivemall, a scalable machine learning library for Apache Hive. It provides concise summaries of machine learning algorithms as user-defined functions that can run on large datasets in Hive. The document outlines the motivation for Hivemall, what algorithms it supports, how to use it to perform tasks like data preparation, training models, and prediction, and how it handles