[This is work presented at SIGMOD'13.] The use of large-scale data mining and machine learning has proliferated through the adoption of technologies such as Hadoop, with its simple programming semantics and rich and active ecosystem. This paper presents LinkedIn's Hadoop-based analytics stack, which allows data scientists and machine learning researchers to extract insights and build product featu
![Data Applications and Infrastructure at LinkedIn__HadoopSummit2010](https://cdn-ak-scissors.b.st-hatena.com/image/square/0a35fdb0be8cb578e303fc072b4938ce92225b39/height=288;version=1;width=512/https%3A%2F%2Fcdn.slidesharecdn.com%2Fss_thumbnails%2F6dataapplicationlinkedinhadoopsummmit2010-100630140404-phpapp02-thumbnail.jpg%3Fwidth%3D640%26height%3D640%26fit%3Dbounds)