Skip to content

SOLEADIFY FUZZY MATCH SYNTH

Description

The SOLEADIFY FUZZY MATCH SYNTH processor is designed to perform fuzzy matching of company data during the consolidation phase of data processing. This processor leverages advanced fuzzy matching algorithms to identify and match company data based on various attributes such as company name, address, and website. It is particularly useful for reconciling and matching company data from different sources, ensuring that similar but not identical entries can be identified and consolidated.


Config Location

To configure the SOLEADIFY FUZZY MATCH SYNTH processor, you need to define it within the consDPProcessors section of your data point configuration within the schema. This involves specifying the processor name and entity.


Supported Data Types

  • String

Config Requirements

Config ({}) options are required for the SOLEADIFY FUZZY MATCH SYNTH processor. If no configuration is provided or if the configuration is improperly set up, a block violation will be thrown. For more information on Apiro violations and their appearance in the Apiro UI or logs, refer to the violations section.


Example Config

Below is an example of how to configure the SOLEADIFY FUZZY MATCH SYNTH processor in XML format:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
<dataPoint name="ADDRESS" dataType="STRING">
    <consDPProcessors>
        <consDPProcessor name="MATCH_SOLEADIFY_DATA" entity="SOLEADIFY_FUZZY_MATCH_SYNTH">
            <config>
                <![CDATA[
                    {
                        "schema":"COMPANY_DATA",
                        "companyName":"ABC Corporation",
                        "address":"18 John Street",
                        "website":"website.com.au",
                        "synthSrcDD":"",
                        "copyToDD":"Company_Name",
                        "copyFromDD":"NAME",
                        "fuzzyInfoDD":"Company_Address",
                        "apiKey":"key12345"
                    }
                ]]>
            </config>
        </consDPProcessor>
    </consDPProcessors>
</dataPoint>

Example Result

Upon successful configuration and execution, the SOLEADIFY FUZZY MATCH SYNTH processor will apply fuzzy matching algorithms to the "ADDRESS" data point, identifying and matching company data based on the specified attributes. This allows for the consolidation of similar but not identical company entries, ensuring that the data is accurately and precisely aligned according to the specified matching logic.


Config Parameters

name acceptable values comment
schema A string representing the schema or structure for the company data. This defines the schema for the company data.
companyName A string representing the company name to be matched. This is the company name used for matching.
address A string representing the address to be matched. This is the address used for matching.
website A string representing the website to be matched. This is the website used for matching.
synthSrcDD A string representing the data dictionary for synthetic source data. This is used for storing and retrieving synthetic source data.
copyToDD A string representing the data dictionary to copy matched data to. This defines the destination for matched data.
copyFromDD A string representing the data dictionary to copy data from. This defines the source of data for the match process.
fuzzyInfoDD A string representing the data dictionary for fuzzy matching information. This is used for storing and retrieving fuzzy matching data.
apiKey A string representing the API key for accessing the fuzzy matching service. This is used for authentication with the fuzzy matching service.

Common Mistakes

  • Ensure that the processor name and entity are correctly defined in the configuration to ensure that the processor is correctly identified and applied during the data processing pipeline.
  • Verify that the configuration parameters within the <config> tag are correctly written and match the expected format and values.
  • Remember that the configuration for the SOLEADIFY FUZZY MATCH SYNTH processor is required. If no configuration, or improper configuration is provided then a block violation will be thrown.