Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions. It is an open source library released under the Eclipse Public License (EPL), GNU Lesser General Public License (LGPL), and Apache Licence. You ar