Creating Parsers

Parsers transform raw data into JSON messages suitable for downstream enrichment and indexing by Cloudera Cybersecurity Platform (CCP). There is one parser for each data source and CCP pipes the information to the Enrichment/Threat Intelligence topology.

You can transform the field output in the JSON messages into information and formats that make the output more useful. For example, you can change the timestamp field output from GMT to your timezone.

You must make two decisions before you parse a new data source:

  • Type of parser to use

    CCP supports three types of parsers:

    Built-in
    CCP features several built-in parsers that support many common security devices.
    General Purpose

    CCP supports three general purpose parsers: Grok, CSV, and JSON map.

    • Grok - Regular expression-based parser extracts CCP values; ideal for ingesting structured or semi-structured logs that are well understood and telemetries with lower volumes of traffic
    • CSV - Maps CSV columns to CCP events
    • JSON Map - Maps JSON documents into CCP events
    Java

    A Java parser is appropriate for a telemetry type that is complex to parse, with high volumes of traffic.

  • How to parse

    CCP enables you to parse a new data source and transform data fields using the CCP Management module or the command line interface