Configuring Datasource Pairs

Configuring Datasource Pairs

After selecting a matching strategy, the Match Configuration page displays a table of all possible datasource pair combinations. Each row represents a comparison that MatchLogic can perform. You control which pairs are active by checking or unchecking them.

Datasource pairs table showing row numbers, checkboxes, source names, connection type arrows, and target names with Self badges on within-file pairs

Close-up of the pairs table showing enabled and disabled checkboxes next to datasource pair rows, with the bidirectional arrow icon for within-file and directional arrow for between-file pairs

Understanding the Pairs Table

The table contains the following columns:

  1. Row number — a sequential identifier for each pair
  2. Checkbox — check to enable the pair, uncheck to disable it
  3. Source name — the first datasource in the comparison
  4. Connection type arrow — a visual indicator showing the direction of comparison
  5. Target name — the second datasource in the comparison

Within-File vs. Between-File Pairs

There are two types of pairs you may see in the table:

  • Within-file pairs — indicated by a Self badge next to the datasource name. These compare records inside the same datasource to find internal duplicates. The source and target are the same datasource.
  • Between-file pairs — compare records from one datasource against records from a different datasource. These are used to find cross-source matches and overlapping records.

Tip

Look for the Self badge to quickly distinguish within-file deduplication pairs from cross-file comparison pairs.

Enabling and Disabling Pairs

Click the checkbox next to any pair to toggle it on or off. Enabled pairs will be included when you run the matching process. Disabled pairs are skipped entirely. This gives you fine-grained control over which comparisons are performed.

Common reasons to disable a pair:

  • Two datasources contain completely unrelated data
  • A within-file pair is unnecessary because the source is already deduplicated
  • You want to reduce processing time by limiting comparisons

Coverage Requirements

MatchLogic requires that every imported datasource participates in at least one enabled pair. If any datasource is left out entirely, the Save & Continue button will be disabled. This safeguard prevents you from accidentally excluding a datasource from the matching process. See #pair-coverage-validation for more details.

Important

You must have at least one pair enabled for every datasource before you can proceed. Check the table carefully to ensure full coverage.

Saving Your Configuration

Once you are satisfied with your pair selections, click Save & Continue to persist the configuration and move on to #what-are-match-definitions. You can return to this page at any time to adjust pairs before running a match.