This processor is deprecated and may be removed in future releases.
Please consider using one the following alternatives: PutBigQuery
Please be aware this processor is deprecated and may be removed in the near future. Use PutBigQuery instead. Load data into Google BigQuery table using the streaming API. This processor is not intended to load large flow files as it will load the full content into memory. If you need to insert large flow files, consider using PutBigQueryBatch instead.
google, google cloud, bq, gcp, bigquery, record
In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.
Display Name | API Name | Default Value | Allowable Values | Description |
---|---|---|---|---|
Project ID | gcp-project-id | Google Cloud Project ID Supports Expression Language: true (will be evaluated using variable registry only) | ||
GCP Credentials Provider Service | GCP Credentials Provider Service | Controller Service API: GCPCredentialsService Implementation: GCPCredentialsControllerService | The Controller Service used to obtain Google Cloud Platform credentials. | |
Number of retries | gcp-retry-count | 6 | How many retry attempts should be made before routing to the failure relationship. | |
Proxy host | gcp-proxy-host | IP or hostname of the proxy to be used.
You might need to set the following properties in bootstrap for https proxy usage:
-Djdk.http.auth.tunneling.disabledSchemes=
-Djdk.http.auth.proxying.disabledSchemes= Supports Expression Language: true (will be evaluated using variable registry only) | ||
Proxy port | gcp-proxy-port | Proxy port number Supports Expression Language: true (will be evaluated using variable registry only) | ||
HTTP Proxy Username | gcp-proxy-user-name | HTTP Proxy Username Supports Expression Language: true (will be evaluated using variable registry only) | ||
HTTP Proxy Password | gcp-proxy-user-password | HTTP Proxy Password Sensitive Property: true Supports Expression Language: true (will be evaluated using variable registry only) | ||
Proxy Configuration Service | proxy-configuration-service | Controller Service API: ProxyConfigurationService Implementation: StandardProxyConfigurationService | Specifies the Proxy Configuration Controller Service to proxy network requests. If set, it supersedes proxy settings configured per component. Supported proxies: HTTP + AuthN | |
Dataset | bq.dataset | ${bq.dataset} | BigQuery dataset name (Note - The dataset must exist in GCP) Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) | |
Table Name | bq.table.name | ${bq.table.name} | BigQuery table name Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) | |
Ignore Unknown Values | bq.load.ignore_unknown | false | Sets whether BigQuery should allow extra values that are not represented in the table schema. If true, the extra values are ignored. If false, records with extra columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. By default unknown values are not allowed. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) | |
Record Reader | bq.record.reader | Controller Service API: RecordReaderFactory Implementations: Syslog5424Reader CEFReader ReaderLookup CiscoEmblemSyslogMessageReader CSVReader GrokReader SyslogReader JsonTreeReader JsonPathReader XMLReader AvroReader JASN1Reader ExcelReader ParquetReader EBCDICRecordReader WindowsEventLogReader IPFIXReader ScriptedReader | Specifies the Controller Service to use for parsing incoming data. | |
Skip Invalid Rows | bq.skip.invalid.rows | false | Sets whether to insert all valid rows of a request, even if invalid rows exist. If not set the entire insert request will fail if it contains an invalid row. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Name | Description |
---|---|
success | FlowFiles are routed to this relationship after a successful Google BigQuery operation. |
failure | FlowFiles are routed to this relationship if the Google BigQuery operation fails. |
Name | Description |
---|---|
bq.records.count | Number of records successfully inserted |
Resource | Description |
---|---|
MEMORY | An instance of this component can cause high usage of this system resource. Multiple instances or high concurrency settings may result a degradation of performance. |