Class ContentAsTextInputFormat


  • public class ContentAsTextInputFormat
    extends SequenceFileInputFormat<Text,​Text>
    An input format that takes Nutch Content objects and converts them to text while converting newline endings to spaces. This format is useful for working with Nutch content objects in Hadoop Streaming with other languages.