Wikipedia processing. PyPy vs CPython benchmark Lately I’ve done some data mining tasks on Wikipedia. It consist of: processing enwiki-pages-articles.xml Wikipedia dump storing pages and categories into mongodb using redis for mapping category titles I made a benchmark on a real tasks for CPython 2.7.3 and PyPy 2b. Libraries I used: redis 2.7.2 pymongo 2.4.2 Furthermore CPython was supported by: h