Class ProtocolStatusStatistics

  • All Implemented Interfaces:
    Configurable, Tool

    public class ProtocolStatusStatistics
    extends Configured
    implements Tool
    Extracts protocol status code information from the crawl database. ProtocolStatusStatistics will give you information on the count of all status codes encountered on your crawl. This can be useful for checking a number of things. An example output run showing the number of encountered status codes such as 200, 300, and a count of un-fetched record. 38 200 19 301 2 302 665 UNFETCHED