I wanted to make some crawlers that I wrote in ruby to run faster. Thought of running it using hadoop, and just started trying out using Hadoop Streaming for the purpose, and thought of sharing a short how-to on it since there might be others looking for it. Hadoop Streaming facilitates users to write the map and reduce code in any language and run it on a Hadoop grid easily. I assume that you ha