The Data tab serves as a central hub for managing data integration, processing, and configuration, providing tools to handle data sources, agents, pipelines, replication, and sample datasets.
Data overview
The Data tab is a central hub for managing and configuring various data-related components within the Data Integrity Suite. It provides access to different settings that facilitate data integration, processing, and management, You can further drill down into each section for more information:
- Connect to your datasources such as databases, object storage, analytics system, and more. Data Integrity Suite supports a wide range of connections through native connectors or JDBC.
- Refer to Datasources for more information.
- Install data agents on infrastructure like ETL servers to execute pipeline processes and integration tasks. Agents can be downloaded, configured, and updated.
- Refer to Agents for more information.
- Set up execution engines to run data pipelines for transformation and integration. You can leverage managed services such as Databricks and Dataproc or configure your own Spark environment.
- Refer to Pipeline engines for more information.
- Configure connections to handle continuous and mainframe data replication.
- Refer to Replication connections for more information.
- Install runtime engines for continuous data replication from sources like mainframe and enterprise databases. This enables real-time data ingestion through change data capture.
- Refer to Replication engines for more information.
- Set policies for pipeline runs such as data sampling percentages, output file formats and data storage locations.
- Refer to Data samples for more information.