Setting the scratch space limit for spilling Impala queries
Running out of space to store query data on the SSD causes query failure. To prevent these failures, you need to know how to configure scratch space limits for Impala Virtual Warehouse (CDW) service.
- Attached solid state drive (SSD) storage, using NVMe (non-volatile memory express) protocol.
- Instance storage 2x300 GB allocated as follows:
- Data cache (200 GiB default value)
- Scratch space (200 GiB default value)
Scratch space requires a base 2 storage increment, such as GibiBytes.
If you have spilling queries that require more scratch space than your compute nodes have, you need to configure additional scratch space for Impala Virtual Warehouses on AWS Elastic Block Store (EBS). You configure scratch space limits when you create an Impala Virtual Warehouse. After the Virtual Warehouse is created, you cannot add scratch space limits by editing the configuration. Configuring scratch space on EBS volumes incurs additional cost. See Amazon EBS Pricing for details.