Algorithm to match short text strings (equivalent product names from disparate sources)

Cerrado Publicado Jun 20, 2013 Pagado a la entrega
Cerrado Pagado a la entrega

We have an existing MS SQL database which is populated with products retrieved from various sources (portals).

The various portals usually have slightly different names for the products.

E.G

Portal 1:

PortalProduct 1: CocaCola

PortalProduct 2: Manchester United

PortalProduct 3: Maplin Electronics

Portal 2:

PortalProduct 1: Coca-Cola

PortalProduct 2: Man Utd

PortalProduct 3: Maplin Elec Ltd.

- We need to correlate these PortalProducts to a central list of Products.

- One Product can have 1-to-many PortalProducts

- The database is populated with around 10,000 PortalProducts

- The database is not populated with any Products. Products need to be added for each new PortalProduct that doesn't match an existing Product.

This must be implemented as a C# console application so that a) we can see the results when the correlate is run and b) so that we can ultimately run it on a task-schedule.

The developer must develop a text matching algorithm to correlate PortalProduct names with Product names (and create a product where there is no match). The emphasis is on the accuracy of the text matching algorithm. Speed is less important than [login to view URL] developer should have a mathematical mind and preferably has experience in developing mathematical text matching algorithms.

Examples of mathematical text matching algorithms that could be used (and potentially refined and combined) to meet our requirements are:

String metric

Locality-sensitive hashing

Needleman–Wunsch algorithm

Smith–Waterman algorithm

Levenshtein distance

Concept Search

Approximate matching with addition of regular expressions ability

Regular expressions for non-fuzzy (exact) matching

Metaphone

Soundex

Agrep

Plagiarism detection

[login to view URL]

Ideally the developer will already understand and have used some of these before and able to pick the best algorithm(s) for this application.

Algoritmos Programación en C# Matemáticas

Nº del proyecto: #4645513

Sobre el proyecto

19 propuestas Proyecto remoto Activo Jul 27, 2013

19 freelancers están ofertando un promedio de £190 por este trabajo

samitXI

Please check your inbox. Thanks

£250 GBP en 7 días
(154 comentarios)
6.8
feninsoftech

Hi, I have done similar task to map different securities. It is more easy to match the mapping in SQL Server database instead of looping in C#. I have 8.5 years experience in C# and SQL Server programming.

£244 GBP en 15 días
(30 comentarios)
6.5
ANeeK181

I can help you.

£166 GBP en 6 días
(21 comentarios)
4.4
murtaza1981

Hi, I (Myself a senior ASP.NET programmer using C# and VB) have a team having experience of more than 5 years in developing web and mobile applications. I have a team of expert programmers who write robust and bug-fre Más

£200 GBP en 5 días
(8 comentarios)
4.0
studioprog

Hi, I have 5+ years experience in C# and algorithms. I can do it.

£111 GBP en 3 días
(12 comentarios)
4.2
diegofpr

Hello, I am a professional developer with more than 14 years in IT and more than 9 years in c#. I also have a bachelor's degree in computer analysis, with only 3 courses pending to the master's degree, in the Buenos Más

£370 GBP en 20 días
(11 comentarios)
4.0
eried

Hi, I can help

£222 GBP en 20 días
(10 comentarios)
4.0
thanhhungqb

Dear sir, please refer to pmb for more details. Thanks.

£263 GBP en 7 días
(10 comentarios)
3.1
Jraml

Please read my private message. Thanks.

£111 GBP en 5 días
(4 comentarios)
3.2
shehzadwork

I have done work in C#, SQL and applied the concepts in different projects and can do this project for you.

£100 GBP en 4 días
(1 comentario)
2.0
anshul31

Hi, I can do this. Please check PMB. Thanks.

£88 GBP en 2 días
(1 comentario)
1.7
mauricioferreira

I have 3+ years experience working with c# and NLU algorithms.

£177 GBP en 3 días
(0 comentarios)
0.0
SharpEcho

Please check my PM.

£388 GBP en 9 días
(0 comentarios)
0.0
jorcrobe

Hi there, I have developed variations of the Levenshtein string matching algorithm, and I can send you examples of code (.cpp). I quite enjoy this algorithm and I would like to tailor it to your needs, and I can do so Más

£150 GBP en 3 días
(0 comentarios)
0.0
abelzmiretu

I have a good experience in developing algorithms for sofesticated projects.I have developed mathematical algorithms for solving a matrix,any polynomial function and ...

£150 GBP en 3 días
(0 comentarios)
0.0
psychicsoft

I have done programming for more than 4 years. I am an Algorithm and data structure specialist. I can provide you an accurate solution.

£150 GBP en 2 días
(0 comentarios)
0.0
irajhedayati

I did similar works. The last one just here and you can check it in my profile. I'm familiar with these kind of algorithms. Also, I have migrated all the document based data in my company to Dynamics CRM using similar Más

£263 GBP en 7 días
(1 comentario)
0.0
csiki

I believe I have already have made an accurate algorithm for your problem, combining some of the algorithms you listed. Please send me more examples, matches to detemine if I'm right.

£105 GBP en 1 día
(0 comentarios)
0.0