Uses of Class
net.htmlparser.jericho.TextExtractor
Packages that use TextExtractor
-
Uses of TextExtractor in net.htmlparser.jericho
Methods in net.htmlparser.jericho that return TextExtractorModifier and TypeMethodDescriptionSegment.getTextExtractor()
Extracts the textual content from the HTML markup of this segment.TextExtractor.setConvertNonBreakingSpaces
(boolean convertNonBreakingSpaces) Sets whether non-breaking space (
) character entity references are converted to spaces.TextExtractor.setExcludeNonHTMLElements
(boolean excludeNonHTMLElements) Sets whether the content of non-HTML elements is excluded from the output.TextExtractor.setIncludeAttributes
(boolean includeAttributes) Sets whether any attribute values are included in the output.