Modifier and Type | Method and Description |
---|---|
TextExtractor |
Segment.getTextExtractor()
Extracts the textual content from the HTML markup of this segment.
|
TextExtractor |
TextExtractor.setConvertNonBreakingSpaces(boolean convertNonBreakingSpaces)
Sets whether non-breaking space (
) character entity references are converted to spaces. |
TextExtractor |
TextExtractor.setExcludeNonHTMLElements(boolean excludeNonHTMLElements)
Sets whether the content of non-HTML elements is excluded from the output.
|
TextExtractor |
TextExtractor.setIncludeAttributes(boolean includeAttributes)
Sets whether any attribute values are included in the output.
|