First, you need to get a copy of the Nutch code. You can download a release from http://lucene.apache.org/nutch/release/. Unpack the release and connect to its top-level directory. Or, check out the latest source code from subversion and build it with Ant. Try the following command: bin/nutch This will display the documentation for the Nutch command script. Now we're ready to crawl. There are two