Find Jobs
Hire Freelancers

A VB program to monitor Google for copies of web pages

$100-500 USD

Cerrado
Publicado hace más de 18 años

$100-500 USD

Pagado a la entrega
What I need is a program that, given a series of web pages, uses the Google API to monitor the web for possible copies of the text. The specs are not yet done on this project, so: 1) I am open to suggestions, proposing good ideas here might be a factor in selecting your bid 2) I realize that after the final specs are done, you might want to change your initial bid. I am ready to accept this What I am thinking about is something like the following: get the text of the web page extract a phrase (N words, where N is like 5-6 words) search it in Google if a site exists, it might be a copy of our text. Another approach that will probably be better to search parial copies: get the text of the web page extract a phrase (N words, where N is like 3-4 words) search it in Google if there are more than 100 pages, it was a too common phrase, try with another phrase remember the results repeat the process 5 times, if a page appears more than 3 times in the 5 results, it might be a copy of your text. There are a LOT of enhancements that can be applied to this, of course, and I would like you to be creative on this too. For example, options that might be included are: - automatically "spider" the web site to be checked (i.e. download all its pages) - white-listing of specific pages/domains: the user sees a page and decides that it's ok. He then marks the page as being whitelisted, and it will not be displayed any more in the possi - show the side-by-side the texts of the original and of the possible copy, highlighting the matching parts of text - set up an automatical check of pages every N days - in case of large lists of pages, where you might hit the 1.000 searches limit from Google, process N pages at a day ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment. b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request. 3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement). ## Platform VB6, Google API
ID del proyecto: 3113031

Información sobre el proyecto

2 propuestas
Proyecto remoto
Activo hace 18 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
2 freelancers están ofertando un promedio de $310 USD por este trabajo
Avatar del usuario
See private message.
$510 USD en 30 días
4,7 (4 comentarios)
4,2
4,2
Avatar del usuario
See private message.
$110,50 USD en 30 días
5,0 (6 comentarios)
1,4
1,4

Sobre este cliente

Bandera de ITALY
Rome, Italy
5,0
234
Forma de pago verificada
Miembro desde may 29, 2001

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.