I am undertaking a project which requires the ability to perform
phonetic searching across business and personal names and addresses.
This data is relatively uncleansed to date. The amount of work to
cleanse the data is tremendous based upon existing data analysis. My
thoughts are that if we can find the right tools we should be able to
perform matching and searching across the data without implementing
heavy duty cleansing prior to the search. When I talk about
uncleansed data in terms of names I am talking about things like the
**Note variations in abbreviations of Inc. this occurs on other words
such as Company, Association, Limited, DBA and other unknowns.
**Note variations in phonetic spelling of words
Ultimately what I am wondering is to what success have people had with
searching fuzzy type searches like this using solely Microsofts Full
Text Indexing. I would also be interested to know if SQL Server 2005
improves on the capabilities of fuzzy matching. In addition I found
some information regardng SQL Turbo http://www.yqcomputer.com/
which seems to be an extender or different implementation of FTS which
does support phonetic matche and I would be interested to know if
anyone has used this tool as well. Any thoughts on how to approach
this matter would be appreciated.