How to Use Text extraction with Webstemmer has the following steps: Obtain a number of "seed" pages from a particular site. Learn the layout patterns from the obtained pages. Later on, obtain updated pages from the same site. Extract texts from the newly obtained pages using the learned patterns. Step 1. and 2. are only required at the first time. Once you learned the layout patterns, you can use