Package org.apache.hadoop.hive.ql.io

Interface Summary
AcidInputFormat<V> The interface required for input formats that what to support ACID transactions.
AcidInputFormat.RawReader<V>  
AcidInputFormat.RowReader<V>  
AcidOutputFormat<V> An extension for OutputFormats that want to implement ACID transactions.
AcidUtils.Directory  
ContentSummaryInputFormat ContentSummayInputFormat provides an interface to let the input format itself figure the content summary for a give input path.
FlatFileInputFormat.SerializationContext<S> An implementation of SerializationContext is responsible for looking up the Serialization implementation for the given RecordReader.
HiveOutputFormat<K,V> HiveOutputFormat describes the output-specification for Hive's operators.
HivePartitioner<K2,V2> Partition keys by their Object.hashCode().
InputFormatChecker Check for validity of the input files.
RecordUpdater API for supporting updating records.
ReworkMapredInputFormat  
StatsProvidingRecordReader If a file format internally gathers statistics (like ORC) then it can expose the statistics through this interface.
StatsProvidingRecordWriter If a file format internally gathers statistics (like ORC) while writing then it can expose the statistics through this record writer interface.
 

Class Summary
AcidInputFormat.Options Options for controlling the record readers.
AcidOutputFormat.Options Options to control how the files are written
AcidUtils Utilities that are shared by all of the ACID input and output formats.
AcidUtils.ParsedDelta  
BucketizedHiveInputFormat<K extends WritableComparable,V extends Writable> BucketizedHiveInputFormat serves the similar function as hiveInputFormat but its getSplits() always group splits from one input file into one wrapper split.
BucketizedHiveInputSplit HiveInputSplit encapsulates an InputSplit with its corresponding inputFormatClass.
BucketizedHiveRecordReader<K extends WritableComparable,V extends Writable> BucketizedHiveRecordReader is a wrapper on a list of RecordReader.
CodecPool A global compressor/decompressor pool used to save and reuse (possibly native) compression/decompression codecs.
CombineHiveInputFormat<K extends WritableComparable,V extends Writable> CombineHiveInputFormat is a parameterized InputFormat which looks at the path name and determine the correct InputFormat for that path name from mapredPlan.pathToPartitionInfo().
CombineHiveInputFormat.CombineHiveInputSplit CombineHiveInputSplit encapsulates an InputSplit with its corresponding inputFormatClassName.
CombineHiveRecordReader<K extends WritableComparable,V extends Writable> CombineHiveRecordReader.
DefaultHivePartitioner<K2,V2> Partition keys by their Object.hashCode().
FlatFileInputFormat<T> An InputFormat for Plain files with Deserializer records.
FlatFileInputFormat.RowContainer<T> A work-around until HADOOP-1230 is fixed.
FlatFileInputFormat.SerializationContextFromConf<S> An implementation of FlatFileInputFormat.SerializationContext that reads the Serialization class and specific subclass to be deserialized from the JobConf.
HiveBinaryOutputFormat<K extends WritableComparable,V extends Writable> HiveBinaryOutputFormat writes out the values consecutively without any separators.
HiveContextAwareRecordReader<K,V> This class prepares an IOContext, and provides the ability to perform a binary search on the data.
HiveFileFormatUtils An util class for various Hive file format tasks.
HiveIgnoreKeyTextOutputFormat<K extends WritableComparable,V extends Writable> HiveIgnoreKeyTextOutputFormat replaces key with null before feeding the to TextOutputFormat.RecordWriter.
HiveInputFormat<K extends WritableComparable,V extends Writable> HiveInputFormat is a parameterized InputFormat which looks at the path name and determine the correct InputFormat for that path name from mapredPlan.pathToPartitionInfo().
HiveInputFormat.HiveInputSplit HiveInputSplit encapsulates an InputSplit with its corresponding inputFormatClass.
HiveKey HiveKey is a simple wrapper on Text which allows us to set the hashCode easily.
HiveKey.Comparator A Comparator optimized for HiveKey.
HiveNullValueSequenceFileOutputFormat<K,V> A HiveOutputFormat that writes SequenceFiles with the content saved in the keys, and null in the values.
HiveOutputFormatImpl<K extends WritableComparable<K>,V extends Writable> Hive does not use OutputFormat's in a conventional way, but constructs and uses the defined OutputFormat for each table from FileSinkOperator.
HivePassThroughOutputFormat<K,V> This pass through class is used to wrap OutputFormat implementations such that new OutputFormats not derived from HiveOutputFormat gets through the checker
HivePassThroughRecordWriter<K extends WritableComparable<?>,V extends Writable>  
HiveRecordReader<K extends WritableComparable,V extends Writable> HiveRecordReader is a simple wrapper on RecordReader.
HiveSequenceFileOutputFormat<K,V> A HiveOutputFormat that writes SequenceFiles.
IgnoreKeyTextOutputFormat<K extends WritableComparable,V extends Writable> Deprecated. use HiveIgnoreKeyTextOutputFormat instead}
IOConstants  
IOContext IOContext basically contains the position information of the current key/value.
IOPrepareCache IOPrepareCache is used to cache pre-query io-related objects.
NonSyncDataInputBuffer A thread-not-safe version of Hadoop's DataInputBuffer, which removes all synchronized modifiers.
NonSyncDataOutputBuffer A thread-not-safe version of Hadoop's DataOutputBuffer, which removes all synchronized modifiers.
NullRowsInputFormat NullRowsInputFormat outputs null rows, maximum 100.
NullRowsInputFormat.DummyInputSplit  
NullRowsInputFormat.NullRowsRecordReader  
OneNullRowInputFormat OneNullRowInputFormat outputs one null row.
OneNullRowInputFormat.OneNullRowRecordReader  
RCFile RCFiles, short of Record Columnar File, are flat files consisting of binary key/value pairs, which shares much similarity with SequenceFile.
RCFile.KeyBuffer KeyBuffer is the key of each record in RCFile.
RCFile.Reader Read KeyBuffer/ValueBuffer pairs from a RCFile.
RCFile.ValueBuffer ValueBuffer is the value of each record in RCFile.
RCFile.Writer Write KeyBuffer/ValueBuffer pairs to a RCFile.
RCFileInputFormat<K extends LongWritable,V extends BytesRefArrayWritable> RCFileInputFormat.
RCFileOutputFormat RCFileOutputFormat.
RCFileRecordReader<K extends LongWritable,V extends BytesRefArrayWritable> RCFileRecordReader.
RecordIdentifier Gives the Record identifer information for the current record.
SchemaAwareCompressionInputStream SchemaAwareCompressionInputStream adds the ability to inform the compression stream what column is being read.
SchemaAwareCompressionOutputStream SchemaAwareCompressionOutputStream adds the ability to inform the comression stream the current column being compressed.
SequenceFileInputFormatChecker SequenceFileInputFormatChecker.
SymbolicInputFormat  
SymlinkTextInputFormat Symlink file is a text file which contains a list of filename / dirname.
SymlinkTextInputFormat.SymlinkTextInputSplit This input split wraps the FileSplit generated from TextInputFormat.getSplits(), while setting the original link file path as job input path.
VectorizedRCFileInputFormat A MapReduce/Hive Vectorized input format for RC files.
VectorizedRCFileRecordReader RCFileRecordReader.
 

Enum Summary
IOContext.Comparison  
 



Copyright © 2014 The Apache Software Foundation. All rights reserved.