org.apache.hadoop.hive.ql.optimizer.physical
Class GenMRSkewJoinProcessor

java.lang.Object
  extended by org.apache.hadoop.hive.ql.optimizer.physical.GenMRSkewJoinProcessor

public final class GenMRSkewJoinProcessor
extends Object

GenMRSkewJoinProcessor.


Method Summary
static void processSkewJoin(JoinOperator joinOp, Task<? extends Serializable> currTask, ParseContext parseCtx)
          Create tasks for processing skew joins.
static boolean skewJoinEnabled(HiveConf conf, JoinOperator joinOp)
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Method Detail

processSkewJoin

public static void processSkewJoin(JoinOperator joinOp,
                                   Task<? extends Serializable> currTask,
                                   ParseContext parseCtx)
                            throws SemanticException
Create tasks for processing skew joins. The idea is (HIVE-964) to use separated jobs and map-joins to handle skew joins.

For each table, we launch one mapjoin job, taking the directory containing big keys in this table and corresponding dirs in other tables as input. (Actally one job for one row in the above.)

For more discussions, please check https://issues.apache.org/jira/browse/HIVE-964.

Throws:
SemanticException

skewJoinEnabled

public static boolean skewJoinEnabled(HiveConf conf,
                                      JoinOperator joinOp)


Copyright © 2014 The Apache Software Foundation. All rights reserved.