org.apache.hadoop.hive.ql.exec
Class StatsNoJobTask
java.lang.Object
org.apache.hadoop.hive.ql.exec.Task<StatsNoJobWork>
org.apache.hadoop.hive.ql.exec.StatsNoJobTask
- All Implemented Interfaces:
- Serializable, Node
public class StatsNoJobTask
- extends Task<StatsNoJobWork>
- implements Serializable
StatsNoJobTask is used in cases where stats collection is the only task for the given query (no
parent MR or Tez job). It is used in the following cases 1) ANALYZE with partialscan/noscan for
file formats that implement StatsProvidingRecordReader interface: ORC format (implements
StatsProvidingRecordReader) stores column statistics for all columns in the file footer. Its much
faster to compute the table/partition statistics by reading the footer than scanning all the
rows. This task can be used for computing basic stats like numFiles, numRows, fileSize,
rawDataSize from ORC footer.
- See Also:
- Serialized Form
Nested classes/interfaces inherited from class org.apache.hadoop.hive.ql.exec.Task |
Task.FeedType |
Fields inherited from class org.apache.hadoop.hive.ql.exec.Task |
BACKUP_COMMON_JOIN, COMMON_JOIN, CONVERTED_MAPJOIN, CONVERTED_MAPJOIN_LOCAL, CONVERTED_SORTMERGEJOIN, HINTED_MAPJOIN, HINTED_MAPJOIN_LOCAL, MAPJOIN_ONLY_NOBACKUP, NO_TAG, taskCounters, taskHandle |
Methods inherited from class org.apache.hadoop.hive.ql.exec.Task |
addDependentTask, done, equals, executeTask, getAndInitBackupTask, getBackupChildrenTasks, getBackupTask, getChildren, getChildTasks, getCounters, getDependentTasks, getDriverContext, getFeedSubscribers, getId, getInitialized, getJobID, getParentTasks, getQueryPlan, getQueued, getReducer, getResultSchema, getTaskHandle, getTaskTag, getTopOperators, getWork, hashCode, hasReduce, ifRetryCmdWhenFail, isLocalMode, isMapRedLocalTask, isMapRedTask, isRootTask, isRunnable, removeDependentTask, removeFromChildrenTasks, requireLock, setBackupChildrenTasks, setBackupTask, setChildTasks, setConsole, setDone, setFeedSubscribers, setId, setInitialized, setLocalMode, setParentTasks, setQueryPlan, setQueued, setRetryCmdWhenFail, setRootTask, setStarted, setTaskTag, setWork, shutdown, started, subscribeFeed, toString |
StatsNoJobTask
public StatsNoJobTask()
initialize
public void initialize(HiveConf conf,
QueryPlan queryPlan,
DriverContext driverContext)
- Overrides:
initialize
in class Task<StatsNoJobWork>
execute
public int execute(DriverContext driverContext)
- Description copied from class:
Task
- This method is overridden in each Task. TODO execute should return a TaskHandle.
- Returns:
- status of executing the task
getType
public org.apache.hadoop.hive.ql.plan.api.StageType getType()
- Description copied from class:
Task
- Should be overridden to return the type of the specific task among the types in StageType.
- Specified by:
getType
in class Task<StatsNoJobWork>
- Returns:
- StageType.* or null if not overridden
getName
public String getName()
- Description copied from interface:
Node
- Gets the name of the node. This is used in the rule dispatchers.
- Specified by:
getName
in interface Node
- Returns:
- String
Copyright © 2014 The Apache Software Foundation. All rights reserved.