Package org.creativecommons.nutch
Class CCParseFilter
- java.lang.Object
- 
- org.creativecommons.nutch.CCParseFilter
 
- 
- All Implemented Interfaces:
- Configurable,- HtmlParseFilter,- Pluggable
 
 public class CCParseFilter extends Object implements HtmlParseFilter Adds metadata identifying the Creative Commons license used, if any.
- 
- 
Nested Class SummaryNested Classes Modifier and Type Class Description static classCCParseFilter.WalkerWalks DOM tree, looking for RDF in comments and licenses in anchors.
 - 
Field Summary- 
Fields inherited from interface org.apache.nutch.parse.HtmlParseFilterX_POINT_ID
 
- 
 - 
Constructor SummaryConstructors Constructor Description CCParseFilter()
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description ParseResultfilter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.ConfigurationgetConf()voidsetConf(Configuration conf)
 
- 
- 
- 
Method Detail- 
filterpublic ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc) Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page.- Specified by:
- filterin interface- HtmlParseFilter
- Parameters:
- content- the- Contentfor a given response
- parseResult- the result of running on or more- Parser's on the content.
- metaTags- a populated- HTMLMetaTagsobject
- doc- a- DocumentFragment(DOM) which can be processed in the filtering process.
- Returns:
- a filtered ParseResult
- See Also:
- Parser.getParse(Content)
 
 - 
setConfpublic void setConf(Configuration conf) - Specified by:
- setConfin interface- Configurable
 
 - 
getConfpublic Configuration getConf() - Specified by:
- getConfin interface- Configurable
 
 
- 
 
-