Package org.apache.nutch.crawl
Class CrawlDbReducer
- java.lang.Object
-
- org.apache.hadoop.mapreduce.Reducer<Text,CrawlDatum,Text,CrawlDatum>
-
- org.apache.nutch.crawl.CrawlDbReducer
-
public class CrawlDbReducer extends Reducer<Text,CrawlDatum,Text,CrawlDatum>
Merge new page entries with existing entries.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Reducer
Reducer.Context
-
-
Constructor Summary
Constructors Constructor Description CrawlDbReducer()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
reduce(Text key, Iterable<CrawlDatum> values, Reducer.Context context)
void
setup(Reducer.Context context)
-
-
-
Method Detail
-
setup
public void setup(Reducer.Context context)
- Overrides:
setup
in classReducer<Text,CrawlDatum,Text,CrawlDatum>
-
reduce
public void reduce(Text key, Iterable<CrawlDatum> values, Reducer.Context context) throws IOException, InterruptedException
- Overrides:
reduce
in classReducer<Text,CrawlDatum,Text,CrawlDatum>
- Throws:
IOException
InterruptedException
-
-