Package org.apache.nutch.util
Class CrawlCompletionStats
- java.lang.Object
-
- org.apache.hadoop.conf.Configured
-
- org.apache.nutch.util.CrawlCompletionStats
-
- All Implemented Interfaces:
Configurable
,Tool
public class CrawlCompletionStats extends Configured implements Tool
Extracts some simple crawl completion stats from the crawldb Stats will be sorted by host/domain and will be of the form: 1 www.spitzer.caltech.edu FETCHED 50 www.spitzer.caltech.edu UNFETCHED
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
CrawlCompletionStats.CrawlCompletionStatsCombiner
-
Constructor Summary
Constructors Constructor Description CrawlCompletionStats()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static void
main(String[] args)
int
run(String[] args)
-
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
-
-