Create a Joiner configuration
A Joiner configuration defines how two datasets should be matched and what the merged output should contain.
The most important part of a Joiner setup is not the button sequence. It is deciding what relationship the two files are supposed to have.
Before you begin
Before creating the configuration, write down:
- The main dataset - Which file drives the output?
- The secondary dataset - Which file provides additional fields or comparison records?
- The match keys - Which Fields should connect the two Files?
- The output Fields - Which Fields should appear in the result?
- The unmatched behavior - What should happen when a record does not match?
If any of these are unclear, pause and confirm the business rule first.
Recommended approach
Build the configuration in this order:
- define the purpose of the join
- select the two datasets
- identify the key Fields
- choose the Fields to include in the output
- decide how to handle unmatched Records
- test with known matched and unmatched examples
- save with a clear name
Step 1: Define the business goal
State the goal in plain language.
Examples:
- Add product categories to the monthly sales file.
- Attach account owner details to open customer cases.
- Match vendor payments against the approved vendor list.
- Identify transactions that do not have a known customer account.
A vague goal like "merge these Files" usually hides important choices about keys, Fields, and unmatched Records.
Step 2: Choose the datasets and their roles
Decide which dataset is the primary input and which dataset is the secondary lookup or comparison input.
The primary dataset is usually the File whose Records you want to preserve, such as:
- transactions
- cases
- orders
- employees
- accounts
The secondary dataset usually provides context, such as:
- account details
- product attributes
- approved reference values
- location or owner metadata
Step 3: Select reliable match keys
Choose the Fields that should identify the same business entity in both Files.
Common match keys include:
- customer ID
- account number
- product code
- employee ID
- vendor ID
- location code
Use the fewest keys that correctly express the relationship. If one key is not enough, use a clear combination of fields.
Step 4: Choose output fields intentionally
Include the Fields needed for the next step, but avoid carrying unnecessary Fields into the result.
Ask:
- Which primary Fields must remain?
- Which secondary Fields add useful context?
- Are any similarly named Fields likely to confuse reviewers?
- Should temporary helper Fields be excluded from the final output?
Step 5: Plan for unmatched Records
Unmatched Records are often the most important part of a join review.
Before saving, decide:
- whether unmatched primary Records should remain in the output
- whether unmatched secondary Records should be reviewed
- which Fields will help explain why a match failed
- whether match issues should be fixed by cleaning, deduplicating, or correcting source data
Step 6: Test with representative examples
Use a sample that includes:
- Records that should match cleanly
- Records that should not match
- keys with blanks or unusual formatting
- cases where multiple Records might share the same key
Testing should prove both the successful matches and the exception handling.
Naming guidance
Name the configuration by business purpose.
Better examples:
- Add Product Category to Monthly Sales
- Attach Account Owners to Open Cases
- Match Payments to Approved Vendors
Weaker examples:
- Join 1
- Lookup Config
- Merge Test
Save checklist
Before saving, confirm:
- the two dataset roles are clear
- the match keys are correct
- the output Fields are intentional
- unmatched Records have a review path
- the setup has been tested with realistic examples
Next step
After the Configuration is ready, continue to Run Joiner.