Location: Web Spam Detection > Datasets > WEBSPAM-UK2007 > Feature Sets Pre-computed feature sets These per-host feature sets are provided to encourage participation on the Web Spam Challenge 2008. They are also available in Matlab and ARFF (for weka) format. In the data the host IDs are assigned in the same ordering as in the uk-2007-05.hostnames.txt.gz file. The collection contains 114,529 diffe