I saw a post on HN demonstrating how to scrape a blog with Scrapy (Python web crawler) and MongoDB. Interested in seeing what kind of Ruby crawlers were out there, I found Anemone and decided to replicate the functionality. The crawler is going to: Start at the blog root URL: http://bullsh.it Only crawl page links ("/page/4") and blog post links ("/2012/04/this-is-a-title") Store blog post titles