Jericho HTML Parser
Jericho HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document, including some common server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It can als
Tutorial Details:
The following sample programs are available:
ConvertStyleSheets Demonstrates how to detect all external style sheets and place them inline into the document.
DisplayAllElements Demonstrates the behaviour of the library when retrieving all elements from a document containing a mix of normal HTML, different types of server tags, and badly formatted HTML.
DisplayFormFields Demonstrates the use of the Segment.findFormFields() method.
DisplaySpecialTags Demonstrates how to search for special tags such as document type declarations, XML declarations, processing instructions, common server tags, PHP tags, Mason tags, and HTML comments.
JSPTest Demonstrates how to parse a document containing JSP tags without the server tags interfering with the syntax of the HTML.
SplitLongLines Demonstrates how to reformat a document so that lines
the library does a good job of analysing documents containing invalid or badly formatted HTML in areas irrelevant to the analysis, any attempt to analyse the badly formatted HTML itself will yield unpredictable results, which may or may not correspond with the interpretation of the majority of user agents. Furthermore, the behaviour of the library in relation to badly formatted HTML is not guaranteed to remain consistent in future versions. An exception to this is where any of the sample files containing badly formatted HTML produce particular results in any of the sample applications.
exceeding a certain number of characters are split into multiple lines.
Read
Tutorial at: Click here to view the tutorial
Rate Tutorial: Jericho HTML Parser
View Tutorial: Jericho HTML Parser
Related
Tutorials:
JavaWorld article about
JavaCC
JavaWorld article about
JavaCC |
Programming XML in Java, Part 1 - JavaWorld March 2000
Programming XML in Java, Part 1 - JavaWorld March 2000 |
Programming XML in Java, Part 3 - JavaWorld July
2000
Programming XML in Java, Part 3 - JavaWorld July
2000 |
Mapping XML to Java, Part 1 - JavaWorld August 2000
Mapping XML to Java, Part 1 - JavaWorld August 2000 |
Build your own
languages with
JavaCC - JavaWorld December
2000
Build your own
languages with
JavaCC - JavaWorld December
2000 |
An open alternative to JSP - The faults of JSP So what's wrong with JSP?
How the template-based, open source API FreeMarker trumps JSP |
XML APIs for databases - JavaWorld January 2000
XML APIs for databases - JavaWorld January 2000 |
Take the sting out of SAX
Take the sting out of SAX |
Create a quick-and-dirty XML parser
Create a quick-and-dirty XML parser |
XML glossary
XML glossary |
AurigaDoclet: Javadoc doclet for generating javadoc in pdf, postscript, etc
What Is AurigaDoclet?
AurigaDoclet is a Javadoc doclet which can generate Java API document in fo, pdf, postscript, pcl, and svg format. AurigaDoclet accepts command line options which can be used to further customize the generated output. |
a-visual-llk-parser-generator VisualLangLab
a-visual-llk-parser-generator: VisualLangLab
A Visual IDE-Style LL(k) Parser Generator that uses an editable tree with icons for tokens and non-terminals to represent the grammar symbols and grammar rules.
|
FastParser 1.6.3
FastParser 1.6.9.1
XML Edition
FastParser is a Java Xml parser
High performance XML parser (benchmarks* : up to +100% faster compared to Xerces and JDK1.4 integrated parser)
SAX Level 1 and 2 compliant
DOM support
JAXP compatibility
Names |
Real World HTML Parser
Real World HTML Parser
The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). Whil |
JEP - Java Mathematical Expression Parser
JEP - Java Mathematical Expression Parser
JEP is a Java API for parsing and evaluating mathematical expressions. With this library you can allow your users to enter an arbitrary formula as a string, and instantly evaluate it. JEP supports user defined |
XML Document Validation with an XML Schema
This tutorial explains the procedure of validating an XML document with an XML schema. |
Building Java Server Pages
A detailed look at building JSP pages. Should you use JSP or servlets? It mainly depends on the ratio of markup to code. Here you'll also find a guide to the different varieties of tag, and details about the main tags such as and |
HtmlCalendar bean ver. 1.4
Java bean allows you to generate Calendar for your HTML pages. |
Adding search to your applications
The Lucene search engine is an open source, Jakarta project used to build and search indexes. Lucene can index any text-based information you like and then find it later based on various search criteria. |
Struts HTML Tags
Struts HTML Tags
Struts HTML Tags
Struts provides HTML tag library for easy creation of user interfaces. In this lesson I will show you what all Struts HTML Tags are available to the JSP for the development of user interfaces.
To use the Struts |
|
|
|