Snowflake

Data Integrity Suite

Product
Spatial_Analytics
Data_Integration
Data_Enrichment
Data_Governance
Precisely_Data_Integrity_Suite
geo_addressing_1
Data_Observability
Data_Quality
dis_core_foundation
Services
Spatial Analytics
Data Integration
Data Enrichment
Data Governance
Geo Addressing
Data Observability
Data Quality
Core Foundation
ft:title
Data Integrity Suite
ft:locale
en-US
PublicationType
pt_product_guide
copyrightfirst
2000
copyrightlast
2025
  • Pipeline engine name: Enter a meaningful name for the pipeline engine.
  • Type: Specifies Snowflake as the datasource type for this pipeline engine.
  • Connection: Lists the available connections for the selected datasource.
  • Schema: Select a schema within the Snowflake datasource where the pipeline engine has write permissions. This schema is used by the pipeline engine for reading and writing temporary staging data during processing, not just for accessing the data to be processed.
  • Session query timeout in seconds (optional): This field specifies the Snowflake query timeout (Session-level), which is a mechanism that sets the maximum amount of time a query can run before it is automatically stopped.
    Note: Session-level Snowflake query time has the highest precedence (among User, Warehouse, and Account levels), so if a lower-level timeout query is being used, then the session-level query will override it.
  • Enrich datasets database (optional): Specifies the name of the data share database in the Databricks environment. Use the exact data share database name to access these datasets while running a pipeline with the Enrich step. This provides you with improved flexibility and customization for saving the data share under one name.

Before using "Enrich datasets database" for Snowflake

Note:
  1. Set up data share: Ensure you have set up a data share that contains the datasets you intend to use for data enrichment. This share should include all the relevant datasets required for the Enrich step.
  2. Create database from a data share: For the first-time user, it's crucial to create a database from the data share within your workspace. To do this:
    • Navigate to the workspace's Data > Private Sharing > Shared With You section and locate the data share containing the enrich datasets.
    • Select Get Data associated with the data share. This creates a database that includes the share within your workspace, making it accessible for future enrichment steps.
  3. Name your data share: While creating a database, you'll have the option to provide a name for the database. This name helps you identify the specific dataset collection associated with the Enrich step.
  4. Access the database: Once the data share database is created, you can access the datasets within your workspace's Data > Databases section. The datasets will be organized under the database name you provided earlier.