Using the S3Guard CLI

The S3Guard CLI offers other maintenance commands. for information on how to use them, refer toApache documentation.

One useful command to use is hadoop s3guard prune -tombstone, which removes "tombstone markers" from the table. These are entries created when files are deleted, so as to identify recently deleted files which should be omitted from listings. After the S3 Store has become consistent with these deletions, the markers are no longer deleted.
> hadoop s3guard prune -tombstone -days 1 s3a://hwdev-steve-ireland-new/

2019-08-20 16:55:40,790 [main] INFO  s3guard.S3GuardTool - Metadata store DynamoDBMetadataStore{region=eu-west-1, tableName=example1, tableArn=arn:aws:dynamodb:eu-west-1:98067886600:table/example1} is initialized.
2019-08-20 16:55:40,810 [main] INFO  s3guard.DynamoDBMetadataStore - Starting: Pruning DynamoDB Store
2019-08-20 16:55:40,846 [main] INFO  s3guard.DynamoDBMetadataStore - Pruning DynamoDB Store: duration 0:00.036s
2019-08-20 16:55:40,846 [main] INFO  s3guard.DynamoDBMetadataStore - Finished pruning 0 items in batches of 25