Web scraper / Spider -content extractor software wanted
$30-250 USD
Cancelado
Publicado hace alrededor de 14 años
$30-250 USD
Pagado a la entrega
i am looking for programmer who can create a web scraper/spider/extractor software.
Primary objective for us is is to extract company name, person name, job-titles, country, email address.
The software must convert the data search into CSV or XLS format.
The software must have 2 part function.
1)The software must crawl from specific website defined by the user
2)The software must be able to crawl from search engine key word to be defined by user. Must also able to search for file such as xls, pdf, and doc file.
Key Features:
A Personal, Customizable Web crawler. Crawling rules.
Multithreaded technology Support for the robots exclusion protocol/standard ([login to view URL] file and Robots META tags);
Export the indexed data into Microsoft Access database, TEXT file, Excel file (CSV), HTML file, MySQL script file;
Start crawling from a list of the URLs specified by user;
Start crawling from a historical list of the URLs;
Start crawling using keywords and phrases;
Store web pages on your local disk;
Auto-resolve URL of redirected links;
Auto-remove duplicate or invalid syntax URLs;
Filter the indexed data;
Command line options;
Generate and export map of the visited links;
Very simple to use, quick learning curve and right to the point.
I will pick the one who is inexpensive with the proven track record of solving similar problems from other projects won in the past on GAF so you must have done this and won one or more jobs similar on GAF.
Attention to all bidders:
Please DO NOT PASTE your standard comments. If you understand the requirement please type "I UNDERSTAND" in the subject or else your bid will be deleted.
No Upfront - payment to be made once the job is 100% done - don't bid i you don't agree with this.
SUBJECT LINE: "I UNDERSTAND"
Hello!
Leading pioneers in scrapping expert wants to crack this great deal with you.
Please refer to the PMB to get yourself acquainted about the deal.
Regards
Shruti Srivastav