Developing Apache Spark Applications
Also available as:
PDF

Read Hive ORC Tables

For existing Hive tables, Spark can read them without createOrReplaceTempView. If the table is stored as ORC format (the default), predicate Push-down, partition pruning, and vectorized query execution are also applied according to the configuration.

spark.sql("SELECT * FROM hiveTable WHERE age = 20")