Create a Python script that standardizes scraped data from existing scripts before they're saved to a database

En curso Publicado hace 6 años Pagado a la entrega
En curso Pagado a la entrega

I have 10 different scraping scripts that I run through my VPS that each captures data from a website and stores that data to a database.

There is a slight problem, as the data that is being captured is inconsistent, and I want to display the data in a consistent format in my database and website. You must create a Python script that will 'standardize' the data.

FOR EXAMPLE: One of the fields that is captured on each website is 'Manufacturer'.

Website 1: 'Manufacturer' = {GE, TOSH, WST}

---> {General Electric, Toshiba, Westinghouse}

Website 2: 'Manufacturer' = {Westing., Toshiba, General elec.}

---> {Westinghouse, Toshiba, General Electric}

I want to insert a script within each of the scraping scripts that accomplishes this. Some of the filters will require Regular Expressions, so your script should be set up to be able to handle that.

** I can fill out the specifics of the arrays myself, for which words should be substituted for the terms. I just need someone with Python knowledge to construct the script and the 'template arrays' and tell me where to place it within my scripts. **

I will provide you with a sample of one of the scripts. They run Scrapy, and they are all similar enough that you will probably be able to create just one script and it will work for all of my scrapers.

The budget for this project is $50.

Python Extracción de datos web

Nº del proyecto: #13768792

Sobre el proyecto

9 propuestas Proyecto remoto Activo hace 6 años

9 freelancers están ofertando un promedio de $116 por este trabajo

mantislin

Hi sir, This is kimi and I am scraping expert, I have did too many scraping projects, please check my profile page then you will know. https://www.freelancer.com/u/mantislin.html Can you tell me Más

$250 USD en 5 días
(425 comentarios)
8.2
vlayausa

Hello, I have a lot of experience with web scraping and a lot of scripting experience in Python. I would love to help you with this. Please contact me for more information.

$50 USD en 1 día
(49 comentarios)
5.0
deaswang

hello I have read your requirement. I can help you to finish this work. Can you provide more information about this project? Thank you

$50 USD en 3 días
(20 comentarios)
4.9
DrDri

Hello Sir, How are you ? I read your description and I see that you have the array ready, if that's the case, then let's just start working on the project! please contact me, and thank you!

$55 USD en 3 días
(8 comentarios)
4.6
i333

A Python and web scrapping developer here ready to discuss this further and create this script (regex) to standardize this scraped data. Could you send me the sample script so I can understand better what you exactly w Más

$100 USD en 1 día
(9 comentarios)
3.9
privatecaptain

Hey i have a few questions and suggestions about the project, if we could talk in detail and make this work!

$77 USD en 1 día
(9 comentarios)
4.3