Package org.htmlparser.visitors
Class TextExtractingVisitor
- java.lang.Object
-
- org.htmlparser.visitors.NodeVisitor
-
- org.htmlparser.visitors.TextExtractingVisitor
-
public class TextExtractingVisitor extends NodeVisitor
Extracts text from a web page. Usage:Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();
-
-
Constructor Summary
Constructors Constructor Description TextExtractingVisitor()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.lang.String
getExtractedText()
void
visitEndTag(Tag tag)
Called for eachTag
visited that is an end tag.void
visitStringNode(Text stringNode)
Called for eachStringNode
visited.void
visitTag(Tag tag)
Called for eachTag
visited.-
Methods inherited from class org.htmlparser.visitors.NodeVisitor
beginParsing, finishedParsing, shouldRecurseChildren, shouldRecurseSelf, visitRemarkNode
-
-
-
-
Method Detail
-
getExtractedText
public java.lang.String getExtractedText()
-
visitStringNode
public void visitStringNode(Text stringNode)
Description copied from class:NodeVisitor
Called for eachStringNode
visited.- Overrides:
visitStringNode
in classNodeVisitor
- Parameters:
stringNode
- The string node being visited.
-
visitTag
public void visitTag(Tag tag)
Description copied from class:NodeVisitor
Called for eachTag
visited.- Overrides:
visitTag
in classNodeVisitor
- Parameters:
tag
- The tag being visited.
-
visitEndTag
public void visitEndTag(Tag tag)
Description copied from class:NodeVisitor
Called for eachTag
visited that is an end tag.- Overrides:
visitEndTag
in classNodeVisitor
- Parameters:
tag
- The end tag being visited.
-
-