Learn how to create custom match rules in the Data Quality software by defining match scenarios beyond predefined entities. This process involves selecting custom options on the Match and Group screen, configuring matching options, and applying algorithms to ensure accurate data matching.
Go beyond predefined entities, Location and Customer, to define match scenarios for non-entities. You define the custom match rules along with the existing predefined entities. Familiarization with matching domain and rule building can be an added advantage while working with custom approach. Building a custom match rule requires understanding of your data to be able to use correct configurations.
To build a custom match rule:
- On the Match and Group screen, select Custom.
- Click Match Key to define the match key options.
- Click Create Match Rule to add match rules.
- On the Group Options screen, add Name of the group.
- Configure the matching options for group, Matching Method, Missing Data, Threshold Score, and Scoring Method.
- In the left pane, click the plus icon to add field to the group.
- On the Field Options screen, select field from the Name dropdown.
- Configure the matching options for field, Missing Data, Threshold Score, and Scoring Method.
- Click Add algorithm to select the criteria for matching. By default, Exact Match algorithm is selected. You can add more groups and fields from the left pane.
- Click Apply to save your settings and return to the Match and Group panel.
- Click Preview to view your output and then click Save to add transformation step to the pipeline.
For example, consider the custom match rule built as follows:
Here, Area and Name group is the parent group that includes groups Area group and Name group. It has child fields as State, Address, FirstName, and LastName respectively. You define a matching Algorithm and a Scoring Method for each added field based on your data.
- The Area and Name group is the parent that has Matching Method set as All True, and a match will be detected only if both child groups satisfy the criteria.
- The Area group has Matching Method set as All True, and a match will be detected only if both fields satisfy the criteria.
- The Name group has Matching Method set as Any True, and a match will be detected if either of the fields satisfy the criteria.