Command Line Installation
Also available as:
PDF
loading table of contents...

Configuring Falcon Entities

Falcon provides the following XML configuration files to build your data pipeline:

  • Cluster: Defines where your data and processes are stored.

  • Feed: Defines the datasets to be cleaned and processed.

  • Process: Consumes feeds, invokes processing logic, and produces further feeds.

After you have installed Falcon, edit the example entities shown in "Defining Data Pipelines" (in Data Governance with Apache Falcon), or create your own based on Falcon Schemas (also in the Data Governance guide).