Data scraping from 3 Dutch websites

Completado Publicado May 9, 2011 Pagado a la entrega
Completado Pagado a la entrega

Hi!

For marketing research we need to collect public data form three websites.

REQUIREMENTS MUST HAVE

* PHP as programming language

* Use CURL library for HTTP requests

* PHP script should be run in commandline

* Efficient crawler to minimize number of requests

* Store data in MySQL database

* Configuration part in the script which sets database user/pass/dbname and other settings in one place

REQUIREMETS NICE TO HAVE

* Possibility to extend with new websites

* Possibility to stop the script and to continue where is left off. Do not crawl already crawled data more than once.

FIELDS TO SCRAPE AND STORE

source_website = name of the website

scrape_url = url where data form scraped from

contact_name

visit_adress1

visit_zipcode

visit_city

visit_country

post_address1

post_zipcode

post_city

post_country

phone1

fax1

email

website

company_name

company_vat_number

company_coc_number

company_description

company_url_logo

company_size

company_start

member_from

member_status

category

DELIVERABLES

* Database structure for MySQL tables.

* One PHP script, which can be run in command line. This script fills the database with data from the websites.

PAYMENT

* 2 milestones; 1) 50% when script is running and working and 2) 50% when everything is checked to be working properly.

WEBSITES

== 1 ==

Index page:

[login to view URL]

Data page:

[login to view URL]

Data example:

* URL to company logo

* URL: [login to view URL]

== 2 ==

Index page:

[login to view URL]

Data page:

[login to view URL]

Data example:

* URL to company logo

* [login to view URL]

== 3 ==

Index page (use the # A B C D … subpages for all entries):

[login to view URL]

Data page:

[login to view URL],CompanyId

Data example:

* [login to view URL]

* [login to view URL]

MySQL PHP Extracción de datos web

Nº del proyecto: #1053510

Sobre el proyecto

14 propuestas Proyecto remoto Activo May 14, 2011

Adjudicado a:

FMShinobi

Hello! i read your project details, i`m familiar with CURL and done similar projects, check PM please for more info.

€30 EUR en 2 días
(3 comentarios)
2.7

14 freelancers están ofertando un promedio de €131 por este trabajo

SigmaVisual

We can help in your project, please check PMB and our ratings/reviews to get idea of our experience.

€200 EUR en 7 días
(288 comentarios)
8.2
srinichal

Expert in scrapping and look forward to deliver the project

€200 EUR en 4 días
(167 comentarios)
7.6
phpXpertbd

Please see pm. Thanks

€140 EUR en 10 días
(84 comentarios)
7.3
wildlily980

I can do it.

€90 EUR en 5 días
(66 comentarios)
6.9
allhen

Ready to work !!

€420 EUR en 9 días
(69 comentarios)
5.9
alexander2007

Please check PM, Thanks.

€200 EUR en 7 días
(30 comentarios)
5.9
vunv

Check your PMB, Thanks!

€100 EUR en 3 días
(28 comentarios)
4.5
z0mbie

Hi again, please check PM and attachment.

€50 EUR en 2 días
(13 comentarios)
4.6
a2infotech

please check PMB. Thanks a2infotech

€70 EUR en 3 días
(4 comentarios)
2.7
kayasystems

Please see PM

€100 EUR en 2 días
(0 comentarios)
0.0
PutraCoder

Hi sir, Please check your PMB...

€30 EUR en 0 días
(0 comentarios)
0.0