Configuring Datasource Pairs
Configuring Datasource Pairs
After selecting a matching strategy, the Match Configuration page displays a table of all possible datasource pair combinations. Each row represents a comparison that MatchLogic can perform. You control which pairs are active by checking or unchecking them.
Understanding the Pairs Table
The table contains the following columns:
- Row number — a sequential identifier for each pair
- Checkbox — check to enable the pair, uncheck to disable it
- Source name — the first datasource in the comparison
- Connection type arrow — a visual indicator showing the direction of comparison
- Target name — the second datasource in the comparison
Within-File vs. Between-File Pairs
There are two types of pairs you may see in the table:
- Within-file pairs — indicated by a
Selfbadge next to the datasource name. These compare records inside the same datasource to find internal duplicates. The source and target are the same datasource. - Between-file pairs — compare records from one datasource against records from a different datasource. These are used to find cross-source matches and overlapping records.
Tip
Look for the Self badge to quickly distinguish within-file deduplication pairs from cross-file comparison pairs.
Enabling and Disabling Pairs
Click the checkbox next to any pair to toggle it on or off. Enabled pairs will be included when you run the matching process. Disabled pairs are skipped entirely. This gives you fine-grained control over which comparisons are performed.
Common reasons to disable a pair:
- Two datasources contain completely unrelated data
- A within-file pair is unnecessary because the source is already deduplicated
- You want to reduce processing time by limiting comparisons
Coverage Requirements
MatchLogic requires that every imported datasource participates in at least one enabled pair. If any datasource is left out entirely, the Save & Continue button will be disabled. This safeguard prevents you from accidentally excluding a datasource from the matching process. See #pair-coverage-validation for more details.
Important
You must have at least one pair enabled for every datasource before you can proceed. Check the table carefully to ensure full coverage.
Saving Your Configuration
Once you are satisfied with your pair selections, click Save & Continue to persist the configuration and move on to #what-are-match-definitions. You can return to this page at any time to adjust pairs before running a match.