Calculates a fuzzy/locality-sensitive hash value for the Content of a FlowFile and puts that hash value on the FlowFile as an attribute whose name is determined by the <Hash Attribute Name> property.Note: this processor only offers non-cryptographic hash algorithms. And it should be not be seen as a replacement to the HashContent processor.Note: The underlying library loads the entirety of the streamed content into and performs result evaluations in memory. Accordingly, it is important to consider the anticipated profile of content being evaluated by this processor and the hardware supporting it especially when working against large files.
hashing, fuzzy-hashing, cyber-security
In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values.
Display Name | API Name | Default Value | Allowable Values | Description |
---|---|---|---|---|
Hash Attribute Name | ATTRIBUTE_NAME | fuzzyhash.value | The name of the FlowFile Attribute that should hold the Fuzzy Hash Value | |
Hashing Algorithm | HASH_ALGORITHM |
| The hashing algorithm utilised |
Name | Description |
---|---|
success | Any FlowFile that is successfully hashed will be sent to this Relationship. |
failure | Any FlowFile that is successfully hashed will be sent to this Relationship. |
Name | Description |
---|---|
<Hash Attribute Name> | This Processor adds an attribute whose value is the result of Hashing the existing FlowFile content. The name of this attribute is specified by the <Hash Attribute Name> property |