C
C#16mo ago
Foxtrek_64

❔ Recommendations for Fuzzy Equation algorithm

Hi all, I'm looking for recommendations on nuget packages or Linq queries I could use to accomplish a fuzzy equation algorithm. In my case, are comparing company names to find a match, but because of human entry, the names are inconsistent. For example, you could have "Contoso," "Contoso LLC," and "Contoso llc." And sometimes we have some spelling errors because they're typed by someone listening over the phone. So "Cantoso" could be a possibility, for instance. Because we use this comparison for verification purposes, we need it to be able to be tuned to eliminate false positives. AI is also a potential candidate but that sounds like a huge can of worms I don't want to get into. I don't mind rolling my own but ideally I would like to use something actively maintained since this is not an area of expertise.
5 Replies
Foxtrek_64
Foxtrek_6416mo ago
Oh, and it needs to support NetFx 4. I hate doing new development on a legacy framework but it is how it is I may be able to use FuzzySharp. I need to double check and see if I'm using 4.6.1 or 4.6.2 Actually it has direct Net46 and Net461 support
Anton
Anton16mo ago
In the simplest form, I'd do a diff against each element.
jcotton42
jcotton4216mo ago
if this is in a database, see if the engine has support for this in Postgres there's citext, nondeterministic collations, and trigrams
Foxtrek_64
Foxtrek_6416mo ago
Unfortunately "database" is relative. On one side is Salesforce, on the other is data scraped from a webpage
Accord
Accord16mo ago
Was this issue resolved? If so, run /close - otherwise I will mark this as stale and this post will be archived until there is new activity.