Find images from within text(repost)

Cancelado Publicado Oct 14, 2009 Pagado a la entrega
Cancelado Pagado a la entrega

We have 3 million articles in a local database. Many of these articles have associated articles in other databases (which we also have on our system - no need to scrape). We have within the data, which articles are associated with which.

## Deliverables

# Description

We would like you to download images within these associated articles. Most articles don't have images, but when they do, download them and reduce them to thumbnails of maximum size 150x150, maintaining aspect ratio.

Create a simple frontend that displays these images in descending order according to how many associated articles the image appeared on. Ensure there are no duplicate images shown or images that aren't part of the article (this will become clear when you communicate with me in private). If the user clicks on the image it will take them to an external page not created by you (see 3e).

You can use any language you like to import the data. The simple frontend should be in PHP.

The page must serve within .3 seconds (not counting the time it takes the user's browser to download the images).

**Deliverables:**

1. The code which must be rerunnable without manual intervention.

2. Documentation on how to install and run this

3. Database that includes:

3a. When the image was downloaded

3b. Each of the associated article sources the image comes from

3c. Original dimensions of the image

3d. URL where the image was retrieved from.

3e. URL of the page about the image from the original source. This is easily discoverable from the dataset we'll provide.

3f. Article name associated with the image.

4. Very simple frontend that takes the article name as input and shows the images in descending order. No need to do any design, navigation or anything.

**Milestones**

M1. Submit everything for a list of 5,000 articles which we will provide (10% of bid value).

M2. Submit everything for a list of 50,000 articles which we will provide. (10% of bid value).

M2. Do the rest

**In your bid please include:**

B1. What experience you have working with large datasets.

B2. How long it will take to complete

B3. What server resources you'll need for the task.

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables):

a)? For web sites or? other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software? installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

Linux Apache MySQL PHP

MySQL PHP Arquitectura de software Verificación de software Web Hosting Gestión de páginas web Verificación de páginas web

Nº del proyecto: #2921027

Sobre el proyecto

5 propuestas Proyecto remoto Activo Nov 2, 2009

5 freelancers están ofertando un promedio de $884 por este trabajo

elepsis

See private message.

$1020 USD en 35 días
(75 comentarios)
5.4
terryfvw

See private message.

$850 USD en 35 días
(18 comentarios)
4.9
prosolutionvw

See private message.

$1275 USD en 35 días
(8 comentarios)
4.6
meamatacons

See private message.

$552.5 USD en 35 días
(2 comentarios)
3.0
har3567

See private message.

$722.5 USD en 35 días
(0 comentarios)
0.0