Tuning Impala The following sections explains procedures for tuning Impala queries and other SQL operations. Setting Up HDFS CachingSet up HDFS caching with Impala for improved performance.Setting up Data Cache for Remote ReadsWhen Impala compute nodes and its storage are not co-located, the network bandwidth requirement goes up as the network traffic includes the data fetch as well as the shuffling exchange traffic of intermediate results. To mitigate the pressure on the network, you can enable the compute nodes to cache the working set read from remote filesystems, such as, remote HDFS data node, S3, ABFS, ADLS.Configuring Dedicated Coordinators and ExecutorsConfigure a dedicated coordinator and a dedicated executor roles to improve scalability of Impala.