Require a component based application to perform accuracy matching (for addresses, occupation, contacts) between two data sources.
Given a threshold (e.g. 70%) attempt to match between the two data sources. Break down sentence to word tokens, for each token, match against words in the other data source. If more than 70% match, then information is verified. First source contain one record, second source contain up to three records. E.g. customer has provided his home address, system will match up to three other addresses logged from other external systems.
Issues the program needs to tackle:
a. Variation of spellings - requires phonetic algorithms such as Double Metaphone, Levenshtein Distance.
b. Translation or associative words (same meaning or a translation from another written language) - requires a data dictionary to be built that can be further enhanced either via human manual input or auto system feedback (learning).
c. Sequencing of words
d. Filtering of insignificant elements such as symbols, numbers.
Requires an audit trace so we know which area system has matched/not matched. Preferred if you can do this in VB.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
VB, SQL Server, Windows.