Options in this dialog box enable you to define and edit match keys that the Match and Group step uses to compare and group records. Match keys are used to eliminate records that must be compared by match scenarios and thereby reduce processing time.
Match keys consist of rules that apply algorithms to schema fields in a dataset. A typical match key is generated from a combination of fields based on the match scenario. For example, a person's first, middle, and last names characterize individuals in a contact database, while account name characterizes account records. Use options in this dialog box to define or edit match keys. For each field in a match key you specify the algorithm to use for matching values.
Select Match Key: Select the match key that you want to apply to the data. The fields and algorithm that the match key applies to each field are displayed in tabular format. You can edit, add or delete keys that you add to this list, except for the Auto-Generated Match Key. The Match and Group step initially creates the Auto-Generated Match Key based on distinguishing fields from entities in the sample data. It will be empty if there is no entity. You cannot rename, edit, or delete the Auto-Generated Match Key. You can copy the Auto-Generated Match and edit its field specifications to match fields evaluated by a scenario.
- To add a new match key, click the New Match Key button next to the currently selected match key. This displays an empty field table for the new match key.
- To duplicate the selected match key, first click the menu button, then click Duplicate Match Key. This lets you use an existing key as the template for a new key. You can then rename the duplicated match key and edit fields and algorithms to create a variation of the duplicated match key.
- To delete the selected match key, first click the menu button, then click Delete.
- To rename the selected match key, click the field name box above the field table and edit the name.
- To add a field to a match key, click Add field. This expands the Match Key Field Options panel on which you can specify the field and algorithm. Click the Done button to add the new field to the field table.
- To delete a field and its algorithm from the field table, select the Delete Rule button in the Algorithm column of the table.
Match Key Field Options: This panel is expanded when you click Add field. Use it to select a field and the match algorithm that you want to apply to the field. The field and algorithm settings constitute a single match key rule.
- Field: Select a field from the box.
- Algorithm: Select the match algorithm from those that are available for the selected field. There may be additional settings for an algorithm.
- Remove noise characters: Select this check box to remove noise characters before comparing values. This increases matching accuracy, as noise characters can detract from the ability to find matching records. For example, the values "Maverick (Aerospace) Ltd" and "Maverick Aerospace Ltd" are a strong match with a character edit distance of 2, but they are an exact match after this option removes the parentheses.
- Done: Saves changes to an existing match key rule or adds the new rule to the match key.
Discard: Closes the Match Key Options dialog box and discards any changes.
Save and Select: Saves any changes and the match key selection.