web page parsing and download

Completado Publicado Mar 21, 2009 Pagado a la entrega
Completado Pagado a la entrega

I require a solution that will parse a webpage and determine all the associated data as it relates to multi-level list box.

When you first arrive at the webpage ( [login to view URL]

), there is a list box that contains about 20 categories listed. When you click on a category, you get a list (within the same list box) of a number of sub-categories. This system continues to additional sub-categories within sub-categories, 4 or 5 levels deep.

I think the proper term for the structure is a tree with nodes, or similar. By looking at the page source I can see that it is driven by javascript (and some have suggested Ajax is used), and although there are linkable strings, the links do not show in the page source code. I need the keyword data associated with each category in the tree structure.

The process is simple enough when done manually, but very time consuming.

## Deliverables

the webpage I need parsed is

[login to view URL]

I have attached a sample of the "tree" showing several levels.

* * *This broadcast message was sent to all bidders on Saturday Mar 21, 2009 4:08:43 PM:

Note to all who have replied so far ... Now that several bidders have asked questions, I find myself confused/stumped by the style used by Google on that webpage. I am starting to wonder if the only easy way to do this task is via the Google API. My ultimate goal is to get all the keyword data. I had thought to do it myself, as I already have some generic webpage parsing tools in PHP. However, that list box threw a curve at me and I could not see how to programmatically get at the links. To add to my problems, extracted links from the tree structure do NOT work in a standard browser's address bar !!! So I will need to ... (a) find one of you who can extract the actual keyword data (by sub-category; and I think there are actually over 500 distinct categories) or (b) get an API licence and have it developed for me that way, or (c) cancel the idea and purchase the raw data from someone. I apologize for the confusion. I have done many extractions before today (some even from Google) but this one is defying me. If you think you can do this expanded version of the job, let me know, otherwise I will have to cancel the request. Richard

PHP

Nº del proyecto: #3746704

Sobre el proyecto

6 propuestas Proyecto remoto Activo Mar 24, 2009

Adjudicado a:

surfingtonio

See private message.

$127.5 USD en 14 días
(95 comentarios)
5.5

6 freelancers están ofertando un promedio de $81 por este trabajo

webexpert78

See private message.

$68 USD en 14 días
(96 comentarios)
6.1
hoesoftware

See private message.

$80.75 USD en 14 días
(62 comentarios)
5.9
keavw

See private message.

$42.5 USD en 14 días
(30 comentarios)
5.2
arun008vw

See private message.

$85 USD en 14 días
(2 comentarios)
1.3
hemangrana

See private message.

$85 USD en 14 días
(0 comentarios)
0.0