SOLEADIFY FUZZY MATCH SYNTH
Description
The SOLEADIFY FUZZY MATCH SYNTH processor is designed to perform fuzzy matching of company data during the consolidation phase of data processing. This processor leverages advanced fuzzy matching algorithms to identify and match company data based on various attributes such as company name, address, and website. It is particularly useful for reconciling and matching company data from different sources, ensuring that similar but not identical entries can be identified and consolidated.
Config Location
To configure the SOLEADIFY FUZZY MATCH SYNTH processor, you need to define it within the consDPProcessors
section of your data point configuration within the schema. This involves specifying the processor name and entity.
Supported Data Types
- String
Config Requirements
Config (
Example Config
Below is an example of how to configure the SOLEADIFY FUZZY MATCH SYNTH processor in XML format:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
|
Example Result
Upon successful configuration and execution, the SOLEADIFY FUZZY MATCH SYNTH processor will apply fuzzy matching algorithms to the "ADDRESS" data point, identifying and matching company data based on the specified attributes. This allows for the consolidation of similar but not identical company entries, ensuring that the data is accurately and precisely aligned according to the specified matching logic.
Config Parameters
name | acceptable values | comment |
---|---|---|
schema | A string representing the schema or structure for the company data. | This defines the schema for the company data. |
companyName | A string representing the company name to be matched. | This is the company name used for matching. |
address | A string representing the address to be matched. | This is the address used for matching. |
website | A string representing the website to be matched. | This is the website used for matching. |
synthSrcDD | A string representing the data dictionary for synthetic source data. | This is used for storing and retrieving synthetic source data. |
copyToDD | A string representing the data dictionary to copy matched data to. | This defines the destination for matched data. |
copyFromDD | A string representing the data dictionary to copy data from. | This defines the source of data for the match process. |
fuzzyInfoDD | A string representing the data dictionary for fuzzy matching information. | This is used for storing and retrieving fuzzy matching data. |
apiKey | A string representing the API key for accessing the fuzzy matching service. | This is used for authentication with the fuzzy matching service. |
Common Mistakes
- Ensure that the processor name and entity are correctly defined in the configuration to ensure that the processor is correctly identified and applied during the data processing pipeline.
- Verify that the configuration parameters within the
<config>
tag are correctly written and match the expected format and values. - Remember that the configuration for the SOLEADIFY FUZZY MATCH SYNTH processor is required. If no configuration, or improper configuration is provided then a block violation will be thrown.