Machine Learning Library (MLlib) MLlib is Spark’s scalable machine learning library consisting of common learning algorithms and utilities, including classification, regression, clustering, collaborative filtering, dimensionality reduction, as well as underlying optimization primitives, as outlined below: Data types Basic statistics summary statistics correlations stratified sampling hypothesis te