Algorithm & Match Codes - Household
Matching Algorithm
The Household Matching Algorithm delivered as part of the initial configuration is designed as a match algorithm for Household entities. This algorithm is most relevant in a Business to Consumer (B2C) style implementation and consists of two normalizers and two matchers, with an Auto Threshold of 90.0 and a Clerical Review Threshold of 80.0. The Clerical review threshold is both higher and tighter to ensure only customer records that are extremely likely to belong to the same household are considered.
It is worth noting that the Household Matching Algorithm uses the Link Golden Record solution engine rather than the Merge Gold Record approach. This is to ensure that individual customers that are determined to be of the same household are linked to a separate household entity object rather than merged.
For details regarding the Household entity and its purpose, refer to the Data Modeling topic of this documentation.
For more information on configuring a Matching Algorithm, refer to theConfiguring Matching Algorithms topic of the Matching, Linking, and Merging documentation.
Normalizers
Normalizers are used to standardize values that are being compared. This ensures equal formatting is applied, increasing the accuracy of the comparisons being made. For more information, refer to the Match Criteria Data Elements topic of the Matching, Linking, and Merging documentation.
Last Name Normalizer
The Last Name Normalizer uses the Words Normalizer, and only evaluates the Last Name attribute of an individual record. This is because households are referred to only by the Last Name, or household name.
Address Normalizer
Because it is recommended to model addresses as data containers, configure the Address Normalizer to normalize data container attributes as defined within the Address component model.
Matchers
For general information on configuring Matchers, refer to the Match Criteria Matchers topic of the Matching, Linking, and Merging documentation.
Name Matcher
The Last Name Matcher is largely left with the default settings. An equivalent names look-up table is not utilized for Household entities since last names generally do not have shortened versions or a nickname alternative.
Address Matcher
The Machine Learning Matcher version 2.0 is used for address matching.
Rules
When considering match rules, the recommended strategy is to dissect the customer’s information into the smallest possible portions of data. These rules should not weigh the sum of all the customer’s input data (which is likely inefficient) and requires a careful analysis of the customer dataset to determine what combinations of attributes present the best chance of uniqueness.
There is only one rule associated with the Household algorithm, which evaluates the scores of the Name and Address matchers. The resulting score is then weighed against the defined algorithm thresholds to determine whether the individual record should be created, auto-linked to an existing record, or it requires clerical review.
The initial configuration matches on Last Name and Address for households. Matching on Last Names may be advantageous in identifying family units within a large population such as a retirement home or a school. However, in some cases a married couple within a household may not have the same Last Name. It may be beneficial to make Address weigh more in scenarios like this.
Conversely, sometimes a large number of customer records with the same Address should not be considered the same household. This can include college campuses, retirement homes, and similar institutions. These scenarios should be handled on a case-by-case basis. However, one possible approach is to utilize an anonymous words table to anonymize aforementioned addresses, eliminating addresses from the matching logic for such records.
Survivorship
The following survivorship rules are used by the Household Algorithm:
- Value: Most Recent
- Attribute / Attribute Group: Last Name
- Last Edit Date Attribute: Last Edit Date - Record
- Data Container: Most Recent
- Business Condition: DataContainer Survivorship Address
- Data Container Type: Main Address
- Last Edit Date Attribute: Last Edit Date - Main Address
Note: Data Containers require their own survivorship rules. Additionally, each Survivorship rule requires a unique Last Edit Date attribute.
Match Codes
The Household Match Code is generated on the Individual Customer entity type along with the Individual Match Codes. Since a household is identified by its last name, the Household Match Code is composed of the zip code + Metaphone 3 representation of the individual’s last name.
The Household Match Code contain the prefix 'ZILNM~'.