GetFTP

Description:

Fetches files from an FTP Server and creates FlowFiles from them

Tags:

FTP, get, retrieve, files, fetch, remote, ingest, source, input

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
HostnameHostnameThe fully qualified hostname or IP address of the remote system
Supports Expression Language: true (will be evaluated using Environment variables only)
PortPort21The port that the remote system is listening on for file transfers
Supports Expression Language: true (will be evaluated using Environment variables only)
UsernameUsernameUsername
Supports Expression Language: true (will be evaluated using Environment variables only)
PasswordPasswordPassword for the user account
Sensitive Property: true
Supports Expression Language: true (will be evaluated using Environment variables only)
Connection ModeConnection ModePassive
  • Active
  • Passive
The FTP Connection Mode
Transfer ModeTransfer ModeBinary
  • Binary
  • ASCII
The FTP Transfer Mode
Remote PathRemote PathThe path on the remote system from which to pull or push files
Supports Expression Language: true (will be evaluated using Environment variables only)
File Filter RegexFile Filter RegexProvides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched
Path Filter RegexPath Filter RegexWhen Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned
Polling IntervalPolling Interval60 secDetermines how long to wait between fetching the listing for new files
Search RecursivelySearch Recursivelyfalse
  • true
  • false
If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories
Follow symlinkfollow-symlinkfalse
  • true
  • false
If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories
Ignore Dotted FilesIgnore Dotted Filestrue
  • true
  • false
If true, files whose names begin with a dot (".") will be ignored
Delete OriginalDelete Originaltrue
  • true
  • false
Determines whether or not the file is deleted from the remote system after it has been successfully transferred
Connection TimeoutConnection Timeout30 secAmount of time to wait before timing out while creating a connection
Data TimeoutData Timeout30 secWhen transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems
Max SelectsMax Selects100The maximum number of files to pull in a single connection
Remote Poll Batch SizeRemote Poll Batch Size5000The value specifies how many file paths to find in a given directory on the remote system when doing a file listing. This value in general should not need to be modified but when polling against a remote system with a tremendous number of files this value can be critical. Setting this value too high can result very poor performance and setting it too low can cause the flow to be slower than normal.
Use Natural OrderingUse Natural Orderingfalse
  • true
  • false
If true, will pull files in the order in which they are naturally listed; otherwise, the order in which the files will be pulled is not defined
Proxy Configuration Serviceproxy-configuration-serviceController Service API:
ProxyConfigurationService
Implementation: StandardProxyConfigurationService
Specifies the Proxy Configuration Controller Service to proxy network requests. If set, it supersedes proxy settings configured per component. Supported proxies: SOCKS + AuthN, HTTP + AuthN
Proxy TypeProxy TypeDIRECT
  • DIRECT
  • HTTP
  • SOCKS
Proxy type used for file transfers
Proxy HostProxy HostThe fully qualified hostname or IP address of the proxy server
Supports Expression Language: true (will be evaluated using Environment variables only)
Proxy PortProxy PortThe port of the proxy server
Supports Expression Language: true (will be evaluated using Environment variables only)
Http Proxy UsernameHttp Proxy UsernameHttp Proxy Username
Supports Expression Language: true (will be evaluated using Environment variables only)
Http Proxy PasswordHttp Proxy PasswordHttp Proxy Password
Sensitive Property: true
Supports Expression Language: true (will be evaluated using Environment variables only)
Internal Buffer SizeInternal Buffer Size16KBSet the internal buffer size for buffered data streams
Use UTF-8 Encodingftp-use-utf8false
  • true
  • false
Tells the client to use UTF-8 encoding when processing files and filenames. If set to true, the server must also support UTF-8 encoding.

Relationships:

NameDescription
successAll FlowFiles that are received are routed to success

Reads Attributes:

None specified.

Writes Attributes:

NameDescription
filenameThe filename is set to the name of the file on the remote server
pathThe path is set to the path of the file's directory on the remote server. For example, if the <Remote Path> property is set to /tmp, files picked up from /tmp will have the path attribute set to /tmp. If the <Search Recursively> property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to /tmp/abc/1/2/3
file.lastModifiedTimeThe date and time that the source file was last modified
file.lastAccessTimeThe date and time that the file was last accessed. May not work on all file systems
file.ownerThe numeric owner id of the source file
file.groupThe numeric group id of the source file
file.permissionsThe read/write/execute permissions of the source file
absolute.pathThe full/absolute path from where a file was picked up. The current 'path' attribute is still populated, but may be a relative path

State management:

This component does not store state.

Restricted:

This component is not restricted.

Input requirement:

This component does not allow an incoming relationship.

System Resource Considerations:

None specified.

See Also:

PutFTP