Package org.htmlparser.nodes
The nodes package has the concrete node implementations.
These classes are derived from the
The classes in this package all implement the Node
interface. They individually implement the more specific interfaces for text,
remarks and tags. The three node types are mapped as follows:
Interface | Concrete Implementation |
Text | TextNode |
Remark | RemarkNode |
Tag | TagNode |
AbstractNode
class, which provides
basic functionality common to all nodes, such as maintaining start and end positions
and child and parent references. Only the TagNode
objects
contain a list of Attribute
objects.
The Lexer
parses an HTML stream into a
contiguous stream of these nodes.
The Parser
returns either these nodes or specific
Tag
objects (which are subclasses of TagNode)
for tags with names that have been registered via
registerTag()
.
-
Class Summary Class Description AbstractNode The concrete base class for all types of nodes (tags, text remarks).RemarkNode The remark tag is identified and represented by this class.TagNode TagNode represents a generic tag.TextNode Normal text in the HTML document is represented by this class.