public class TextInputFormat extends DelimitedInputFormat
DelimitedInputFormat.AbstractConfigBuilder<T>, DelimitedInputFormat.ConfigBuilderFileInputFormat.FileBaseStatistics, FileInputFormat.InputSplitOpenThread| Modifier and Type | Field and Description |
|---|---|
protected boolean |
ascii |
protected ByteBuffer |
byteWrapper |
static String |
CHARSET_NAME |
protected CharsetDecoder |
decoder |
static String |
DEFAULT_CHARSET_NAME |
static String |
FIELD_POS |
protected int |
pos |
protected StringValue |
theString |
RECORD_DELIMITERDEFLATE_SUFFIX, enumerateNestedFiles, filePath, minSplitSize, numSplits, openTimeout, READ_WHOLE_SPLIT_FLAG, splitLength, splitStart, stream, unsplittable| Constructor and Description |
|---|
TextInputFormat() |
| Modifier and Type | Method and Description |
|---|---|
void |
configure(Configuration parameters) |
Record |
readRecord(Record reuse,
byte[] bytes,
int offset,
int numBytes)
This function parses the given byte array which represents a serialized key/value
pair.
|
close, configureDelimitedFormat, getBufferSize, getDelimiter, getLineLengthLimit, getNumLineSamples, getStatistics, loadGloablConfigParams, nextRecord, open, reachedEnd, readLine, setBufferSize, setDelimiter, setDelimiter, setDelimiter, setDelimiter, setDelimiter, setLineLengthLimit, setNumLineSamplesacceptFile, configureFileFormat, createInputSplits, getFilePath, getFileStats, getInputSplitAssigner, getMinSplitSize, getNumSplits, getOpenTimeout, getSplitLength, getSplitStart, setFilePath, setFilePath, setMinSplitSize, setNumSplits, setOpenTimeout, testForUnsplittable, toStringpublic static final String CHARSET_NAME
public static final String FIELD_POS
public static final String DEFAULT_CHARSET_NAME
protected final StringValue theString
protected transient CharsetDecoder decoder
protected transient ByteBuffer byteWrapper
protected transient int pos
protected transient boolean ascii
public void configure(Configuration parameters)
configure in interface InputFormat<Record,FileInputSplit>configure in class DelimitedInputFormat<Record>public Record readRecord(Record reuse, byte[] bytes, int offset, int numBytes)
DelimitedInputFormatreadRecord in class DelimitedInputFormatreuse - The optional reusable holder for the line that is read.bytes - The serialized record.Copyright © 2015 The Apache Software Foundation. All rights reserved.