Table lookup is a transformation step to search matching fields in a reference dataset (also called as Lookup Dataset) and add fields to the pipeline based on the matching fields.
The matching criteria is defined by mapping the input data field (Pipeline Field) with the lookup dataset field (Table Field). You can add multiple fields to map. The lookup dataset is used to retrieve matching information and add additional fields to the input dataset.
- Select the Table Lookup step from the Enrich type step.
- Choose Match Fields.
- Select the Output configuration fields that you want to include in the output.
- Lookup Dataset must be a cataloged asset.
- In case the mapped field has two different values, the system selects
the first value as the reference. For example, the mapped field
DeptCodehas two values for101asHRandHumanResource. In this case, the first value selected as reference will beHR, considering the lookup dataset is sorted as low to high. - The dataset lookup supports all data types and is case-sensitive.
Usage example:
| EmpId | EmployeeName | Office_Address | Age | Designation | Dept_ID |
|---|---|---|---|---|---|
| 4234855 | John Smith Jr | 5600 Cottle Rd Bldg 24 | 33 | SoftwareDeveloper | 101 |
| 4234930 | William Bill | 1400 S Gloster St | 38 | TechnologyArchitect | 102 |
| 4231180 | Mark Waugh | 7 Barnett Plaza | 47 | Cybersecurity | 103 |
Lookup dataset (reference dataset):
| DeptCode | DeptName |
|---|---|
| 101 | HumanResources |
| 102 | InformationTechnology |
Match fields:
| Pipeline Field | Table Field |
|---|---|
| Dept_ID | DeptCode |
Step preview:
Dept_ID and DeptCode as match
fields. The step preview displays the mapped pipeline fields and the selected fields
in output configuration. Here, the Pipeline Field is
Dept_ID and the field selected in Output
configuration is DeptName.
| Step Preview | |
|---|---|
| Dept_ID | DeptName |
| 101 | HumanResources |
| 102 | InformationTechnology |
Output dataset:
| EmpId | EmployeeName | Office_Address | Age | Designation | Dept_ID | DeptName |
|---|---|---|---|---|---|---|
| (Input field) | (Input field) | (Input field) | (Input field) | (Input field) | (Input field/pipeline field) | (Field selected in output configuration) |
| 4234855 | John Smith Jr | 5600 Cottle Rd Bldg 24 | 33 | SoftwareDeveloper | 101 | HumanResources |
| 4234930 | William Bill | 1400 S Gloster St | 38 | TechnologyArchitect | 102 | InformationTechnology |
Step properties
Specify Properties as below for transformation step.
Lookup Dataset: Select the cataloged reference dataset used to lookup mapped fields from the data catalog. In case the data sample is not available, you can upload or generate a new sample dataset.
- Pipeline Field: Select the field from input dataset.
- Table Field: Select the field from reference dataset or lookup dataset.
Add Field to Map: Adds additional fields to map pipeline and table fields.
Sort lookup dataset: Arranges the items in the selected match field. You can select the sort order as low to high or high to low.
Output configuration
Select fields that you want to include in the pipeline from the lookup dataset. You can either choose to include all fields or specific fields from the lookup dataset. When you save the transformation step, the selected lookup dataset fields under output configuration are added to the pipeline.
Output fields
The output dataset contains input dataset fields and the selected lookup dataset fields (under output configuration). The dataset field added to the output is blank when the match is not identified.