❔ Recommendations for Fuzzy Equation algorithm
Hi all, I'm looking for recommendations on nuget packages or Linq queries I could use to accomplish a fuzzy equation algorithm.
In my case, are comparing company names to find a match, but because of human entry, the names are inconsistent. For example, you could have "Contoso," "Contoso LLC," and "Contoso llc." And sometimes we have some spelling errors because they're typed by someone listening over the phone. So "Cantoso" could be a possibility, for instance.
Because we use this comparison for verification purposes, we need it to be able to be tuned to eliminate false positives. AI is also a potential candidate but that sounds like a huge can of worms I don't want to get into.
I don't mind rolling my own but ideally I would like to use something actively maintained since this is not an area of expertise.
5 Replies
Oh, and it needs to support NetFx 4. I hate doing new development on a legacy framework but it is how it is
I may be able to use FuzzySharp. I need to double check and see if I'm using 4.6.1 or 4.6.2
Actually it has direct Net46 and Net461 support
In the simplest form, I'd do a diff against each element.
if this is in a database, see if the engine has support for this
in Postgres there's citext, nondeterministic collations, and trigrams
Unfortunately "database" is relative. On one side is Salesforce, on the other is data scraped from a webpage
Was this issue resolved? If so, run
/close
- otherwise I will mark this as stale and this post will be archived until there is new activity.