Uses of Class
org.apache.nutch.parse.ParseText
-
Packages that use ParseText Package Description org.apache.nutch.parse TheParse
interface and related classes.org.apache.nutch.segment A segment stores all data from on generate/fetch/update cycle: fetch list, protocol status, raw content, parsed content, and extracted outgoing links. -
-
Uses of ParseText in org.apache.nutch.parse
Methods in org.apache.nutch.parse that return ParseText Modifier and Type Method Description static ParseText
ParseText. read(DataInput in)
Methods in org.apache.nutch.parse with parameters of type ParseText Modifier and Type Method Description void
ParseResult. put(String key, ParseText text, ParseData data)
Store a result of parsing.void
ParseResult. put(Text key, ParseText text, ParseData data)
Store a result of parsing.Constructors in org.apache.nutch.parse with parameters of type ParseText Constructor Description ParseImpl(ParseText text, ParseData data)
ParseImpl(ParseText text, ParseData data, boolean isCanonical)
-
Uses of ParseText in org.apache.nutch.segment
Methods in org.apache.nutch.segment with parameters of type ParseText Modifier and Type Method Description boolean
SegmentMergeFilter. filter(Text key, CrawlDatum generateData, CrawlDatum fetchData, CrawlDatum sigData, Content content, ParseData parseData, ParseText parseText, Collection<CrawlDatum> linked)
The filtering method which gets all information being merged for a given key (URL).boolean
SegmentMergeFilters. filter(Text key, CrawlDatum generateData, CrawlDatum fetchData, CrawlDatum sigData, Content content, ParseData parseData, ParseText parseText, Collection<CrawlDatum> linked)
Iterates over allSegmentMergeFilter
extensions and if any of them returns false, it will return false as well.
-