How to Use Antivirus Software on CDP Hosts
If you use antivirus software on your servers, consider configuring it to skip scans on certain types of Hadoop-specific resources.
It can take a long time to scan large files or directories with a large number of files. In
addition, if your antivirus software locks files or directories as it scans them, those
resources will be unavailable to your Hadoop processes during the scan, and can cause latency
or unavailability of resources in your cluster. Consider skipping scans on the following types
of resources:
- Scratch directories used by services such as Impala
- Log directories used by various Hadoop services
- Data directories which can grow to petabytes in size