LSHTC: A Benchmark for Large-Scale Text Classification LSHTC is a series of challenges which aims to assess the performance of classification systems in large-scale classification in a a large number of classes (up to hundreds of thousands). Four editions of the LSHTC challenge were organized from 2010 to 2014. In this page you can download the datasets used in the different editions of the challe