Class Summary |
AcidInputFormat.Options |
Options for controlling the record readers. |
AcidOutputFormat.Options |
Options to control how the files are written |
AcidUtils |
Utilities that are shared by all of the ACID input and output formats. |
AcidUtils.ParsedDelta |
|
BucketizedHiveInputFormat<K extends WritableComparable,V extends Writable> |
BucketizedHiveInputFormat serves the similar function as hiveInputFormat but
its getSplits() always group splits from one input file into one wrapper
split. |
BucketizedHiveInputSplit |
HiveInputSplit encapsulates an InputSplit with its corresponding
inputFormatClass. |
BucketizedHiveRecordReader<K extends WritableComparable,V extends Writable> |
BucketizedHiveRecordReader is a wrapper on a list of RecordReader. |
CodecPool |
A global compressor/decompressor pool used to save and reuse (possibly
native) compression/decompression codecs. |
CombineHiveInputFormat<K extends WritableComparable,V extends Writable> |
CombineHiveInputFormat is a parameterized InputFormat which looks at the path
name and determine the correct InputFormat for that path name from
mapredPlan.pathToPartitionInfo(). |
CombineHiveInputFormat.CombineHiveInputSplit |
CombineHiveInputSplit encapsulates an InputSplit with its corresponding
inputFormatClassName. |
CombineHiveRecordReader<K extends WritableComparable,V extends Writable> |
CombineHiveRecordReader. |
DefaultHivePartitioner<K2,V2> |
Partition keys by their Object.hashCode() . |
FlatFileInputFormat<T> |
An InputFormat for Plain files with
Deserializer records. |
FlatFileInputFormat.RowContainer<T> |
A work-around until HADOOP-1230 is fixed. |
FlatFileInputFormat.SerializationContextFromConf<S> |
An implementation of FlatFileInputFormat.SerializationContext that reads the
Serialization class and specific subclass to be deserialized from the
JobConf. |
HiveBinaryOutputFormat<K extends WritableComparable,V extends Writable> |
HiveBinaryOutputFormat writes out the values consecutively without any
separators. |
HiveContextAwareRecordReader<K,V> |
This class prepares an IOContext, and provides the ability to perform a binary search on the
data. |
HiveFileFormatUtils |
An util class for various Hive file format tasks. |
HiveIgnoreKeyTextOutputFormat<K extends WritableComparable,V extends Writable> |
HiveIgnoreKeyTextOutputFormat replaces key with null before feeding the to TextOutputFormat.RecordWriter. |
HiveInputFormat<K extends WritableComparable,V extends Writable> |
HiveInputFormat is a parameterized InputFormat which looks at the path name
and determine the correct InputFormat for that path name from
mapredPlan.pathToPartitionInfo(). |
HiveInputFormat.HiveInputSplit |
HiveInputSplit encapsulates an InputSplit with its corresponding
inputFormatClass. |
HiveKey |
HiveKey is a simple wrapper on Text which allows us to set the hashCode
easily. |
HiveKey.Comparator |
A Comparator optimized for HiveKey. |
HiveNullValueSequenceFileOutputFormat<K,V> |
A HiveOutputFormat that writes SequenceFile s with the
content saved in the keys, and null in the values. |
HiveOutputFormatImpl<K extends WritableComparable<K>,V extends Writable> |
Hive does not use OutputFormat's in a conventional way, but constructs and uses
the defined OutputFormat for each table from FileSinkOperator. |
HivePassThroughOutputFormat<K,V> |
This pass through class is used to wrap OutputFormat implementations such that new OutputFormats not derived from
HiveOutputFormat gets through the checker |
HivePassThroughRecordWriter<K extends WritableComparable<?>,V extends Writable> |
|
HiveRecordReader<K extends WritableComparable,V extends Writable> |
HiveRecordReader is a simple wrapper on RecordReader. |
HiveSequenceFileOutputFormat<K,V> |
A HiveOutputFormat that writes SequenceFile s. |
IgnoreKeyTextOutputFormat<K extends WritableComparable,V extends Writable> |
Deprecated. use HiveIgnoreKeyTextOutputFormat instead} |
IOConstants |
|
IOContext |
IOContext basically contains the position information of the current
key/value. |
IOPrepareCache |
IOPrepareCache is used to cache pre-query io-related objects. |
NonSyncDataInputBuffer |
A thread-not-safe version of Hadoop's DataInputBuffer, which removes all
synchronized modifiers. |
NonSyncDataOutputBuffer |
A thread-not-safe version of Hadoop's DataOutputBuffer, which removes all
synchronized modifiers. |
NullRowsInputFormat |
NullRowsInputFormat outputs null rows, maximum 100. |
NullRowsInputFormat.DummyInputSplit |
|
NullRowsInputFormat.NullRowsRecordReader |
|
OneNullRowInputFormat |
OneNullRowInputFormat outputs one null row. |
OneNullRowInputFormat.OneNullRowRecordReader |
|
RCFile |
RCFile s, short of Record Columnar File, are flat files
consisting of binary key/value pairs, which shares much similarity with
SequenceFile . |
RCFile.KeyBuffer |
KeyBuffer is the key of each record in RCFile. |
RCFile.Reader |
Read KeyBuffer/ValueBuffer pairs from a RCFile. |
RCFile.ValueBuffer |
ValueBuffer is the value of each record in RCFile. |
RCFile.Writer |
Write KeyBuffer/ValueBuffer pairs to a RCFile. |
RCFileInputFormat<K extends LongWritable,V extends BytesRefArrayWritable> |
RCFileInputFormat. |
RCFileOutputFormat |
RCFileOutputFormat. |
RCFileRecordReader<K extends LongWritable,V extends BytesRefArrayWritable> |
RCFileRecordReader. |
RecordIdentifier |
Gives the Record identifer information for the current record. |
SchemaAwareCompressionInputStream |
SchemaAwareCompressionInputStream adds the ability to inform the compression
stream what column is being read. |
SchemaAwareCompressionOutputStream |
SchemaAwareCompressionOutputStream adds the ability to inform the comression stream
the current column being compressed. |
SequenceFileInputFormatChecker |
SequenceFileInputFormatChecker. |
SymbolicInputFormat |
|
SymlinkTextInputFormat |
Symlink file is a text file which contains a list of filename / dirname. |
SymlinkTextInputFormat.SymlinkTextInputSplit |
This input split wraps the FileSplit generated from
TextInputFormat.getSplits(), while setting the original link file path
as job input path. |
VectorizedRCFileInputFormat |
A MapReduce/Hive Vectorized input format for RC files. |
VectorizedRCFileRecordReader |
RCFileRecordReader. |