Data Element: Address Normalizer v1 (superseded)

The 'Address Normalizer v1 (superseded)' produces a normalized set of addresses for use in address matching.

Important: The Address Normalizer v1 has been superseded with the 2024.4 update and replaced by the Address Normalizer v2 to support the Machine Learning Matcher for address matching. This matcher employs a pre-trained machine learning model to match addresses and provides substantially more accurate scores. The v1 version of the normalizer is still available for on-premises systems, but we recommend that users of SaaS v2 systems transition to the Address Normalizer v2.

For details, refer to the topics Matcher: Machine Learning Matcher and Data Element: Address Normalizer v2 in the Matching, Linking, and Merging documentation.

Prerequisites

Configure the Address Component Model (defined in the Address Component Model topic of the Data Integration documentation).

Input

The following data is provided by the input address element attributes mapped in the Address Component Model. The address object uses the input attribute values or, if the Standardized Zip is set, the address object is populated using the standardized attributes.

Input Attributes

Standardized Attributes

  • Input City

  • Input Country

  • Input State

  • Input Street

  • Input Zip

  • Standardized City

  • Standardized Country

  • Standardized Country ISO Code

  • Standardized State

  • Standardized Street

  • Standardized Zip

When configuring the data element, the Input Parameters field allows selection of:

  1. 'Use Attribute on Object' – by default, this option is set to ‘True’ and indicates to read attributes on the object itself. Click the Value dropdown to manually set it to 'False' when using information from a Data Container or an Input Normalizer.

  2. 'Data Container' – read attributes from the data container.

  3. 'Input Normalizer’ – read outputs from the selected Match Expression, as defined in the topic Matching Algorithms and Match Expressions.

Output

The output of the Address Normalizer is a class: java.util.Set<com.stibo.partydatamatching.domain.address.Address>

For more information on the contents of the class, refer to the Technical Documentation on the STEP Start Page and review the documents linked from within the Scripting API section.

Functionality

The Address Normalizer v1(superseded) automatically makes the following modifications to the address in the order listed for comparison purposes only:

  1. All elements – remove leading and trailing white space

  2. Country – lower-case text

  3. Country ISO – remove characters other than Latin letters and numbers

  4. Region – lower-case text

  5. City – lower-case text

  6. Postal code – remove all spaces, remove dash (-) characters, lower-case text

  7. Street – lower-case text

Because address information varies between systems and countries, it is sometimes necessary to chain address normalizers. For an example of adding a custom address normalizer business function that further normalizes the address after the standard normalizer runs, refer to the topic Data Element: Business Function Normalizer.

Configuring an Address Normalizer Data Element

After adding the Address Normalizer in the Data Elements flipper of the Decision Table dialog (defined in the Match Criteria topic), configure it as follows:

  1. Click into the Data Elements column and click the ellipsis button () to access the configuration dialog.

  2. On the Address Normalizer dialog:

    • For the Input Parameters, define the source of the data to be normalized. Refer to the Input section above for details.

      Right-click the ellipsis button () in the first column of the Input Parameters table for additional display and edit options. Although it appears that the default 'Use Attribute On Object' parameter can be removed, after closing the dialog it will continue to display. Instead, if a different input parameter is used, click the Value dropdown and manually set 'Use Attribute On Object' option to 'False.'

      Click the Add Input Parameter link to add other input parameters.

  3. To test the configuration, for the Select Nodes parameters:

    • Click on the item picker button () for each field and select two objects for comparison.

    • Click the Evaluate button.

      An empty result field indicates the value is not available in the selected node. Adjust as indicated by the Evaluator results and repeat the evaluation.

  4. Click OK to save and display the configuration in the Data Elements flipper. Click into a Comment cell to add relevant information as desired.