Class Http
- java.lang.Object
-
- org.apache.nutch.protocol.http.api.HttpBase
-
- org.apache.nutch.protocol.htmlunit.Http
-
- All Implemented Interfaces:
Configurable
,Pluggable
,Protocol
public class Http extends HttpBase
-
-
Field Summary
Fields Modifier and Type Field Description protected static org.slf4j.Logger
LOG
-
Fields inherited from class org.apache.nutch.protocol.http.api.HttpBase
accept, acceptCharset, acceptLanguage, BUFFER_SIZE, COOKIE, enableCookieHeader, enableIfModifiedsinceHeader, maxContent, maxCrawlDelay, maxDuration, partialAsTruncated, proxyException, proxyHost, proxyPort, proxyType, RESPONSE_TIME, responseTime, storeHttpHeaders, storeHttpRequest, storeIPAddress, timeout, tlsCheckCertificate, tlsPreferredCipherSuites, tlsPreferredProtocols, useHttp11, useHttp2, useProxy, userAgent
-
Fields inherited from interface org.apache.nutch.protocol.Protocol
X_POINT_ID
-
-
Constructor Summary
Constructors Constructor Description Http()
Default constructor.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected Response
getResponse(URL url, CrawlDatum datum, boolean redirect)
static void
main(String[] args)
void
setConf(Configuration conf)
Set theConfiguration
object.-
Methods inherited from class org.apache.nutch.protocol.http.api.HttpBase
getAccept, getAcceptCharset, getAcceptLanguage, getConf, getCookie, getMaxContent, getMaxDuration, getProtocolOutput, getProxyHost, getProxyPort, getRobotRules, getTimeout, getTlsPreferredCipherSuites, getTlsPreferredProtocols, getUseHttp11, getUserAgent, isCookieEnabled, isIfModifiedSinceEnabled, isStoreHttpHeaders, isStoreHttpRequest, isStoreIPAddress, isStorePartialAsTruncated, isTlsCheckCertificates, logConf, main, processDeflateEncoded, processGzipEncoded, useProxy, useProxy, useProxy
-
-
-
-
Method Detail
-
setConf
public void setConf(Configuration conf)
Set theConfiguration
object.- Specified by:
setConf
in interfaceConfigurable
- Overrides:
setConf
in classHttpBase
- Parameters:
conf
- a popultaedConfiguration
-
getResponse
protected Response getResponse(URL url, CrawlDatum datum, boolean redirect) throws ProtocolException, IOException
- Specified by:
getResponse
in classHttpBase
- Throws:
ProtocolException
IOException
-
-