delphi html parser

Cerrado Publicado May 25, 2010 Pagado a la entrega
Cerrado Pagado a la entrega

The goal of this project is to make an "intelligent" html parser to extract data from HTML pages.

This parser should be able to automatically extract data such as:

companyName, address, email, fax, tel, website

this parser must be able to extract N times these data, since html pages will contain tablular data. (N data per page).

[url removed, login to view]();

while ([url removed, login to view]()) do begin;

data:=[url removed, login to view]();

// data should be an object or type like

// [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view]

end;

I think a good knowledge of DOM and og REGEX is necessary.

of course it will not work on ALL websites, but should be universal enough.

should work with data from

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

etc..

I think the good startegy would be:

1) find a repetitive fragment in the DOM (when a page contains 20 results, it should extract 20 HTML blocks)

2) apply a parser to each block that contain data to be extracted

Should be DELPHI 6 compatible.

Delphi Ingeniería Microsoft Gestión de proyectos Arquitectura de software Verificación de software Windows Desktop

Nº del proyecto: #3451768

Sobre el proyecto

11 propuestas Proyecto remoto Activo Jun 16, 2010

11 freelancers están ofertando un promedio de $425 por este trabajo

IWSolutions

See private message.

$425 USD en 14 días
(101 comentarios)
6.7
kraneware

See private message.

$425 USD en 14 días
(8 comentarios)
5.9
PaulFarr

See private message.

$425 USD en 14 días
(33 comentarios)
4.9
vw7437936vw

See private message.

$425 USD en 14 días
(19 comentarios)
4.1
powzak

See private message.

$425 USD en 14 días
(28 comentarios)
4.1
devdlrb

See private message.

$425 USD en 14 días
(1 comentario)
0.7
myimservices

See private message.

$425 USD en 14 días
(0 comentarios)
0.0
heidelguest

See private message.

$425 USD en 14 días
(1 comentario)
0.0
secureenix

See private message.

$425 USD en 14 días
(0 comentarios)
0.0
abeloqp

See private message.

$425 USD en 14 días
(0 comentarios)
0.0
bluesoftcoders

See private message.

$425 USD en 14 días
(3 comentarios)
2.2