Here you can find all the categories of Italian eBay:
[login to view URL]
If you click on 'Altro' on the main categories, you'll be shown all the subcategories.
Basically I need a tool that returns the most 100 frequent products posted in the auctions of every subcategory.
I know of similar tools that read all the listing titles in a specific category, then pick out the most used combination of words. In big categories, like Auto Parts and Accessories, you can "seed" the search with a word and require a word.
For example, you can seed the search with the word "ammortizzatore" ('shocks' in English), then require the word "Range Rover". That should return keyword combinations for all eBay listings in that category for shocks fitting a Range Rover.
This is just an idea, and if you can come up with something more simple or more effective for big categories, don't hesitate to suggest it.
Keep in mind 2 things:
- I want this tool to have some sort of archive, so that if I'm going to scrape the same category one week later there will be no duplicate products name
- on italian eBay there should be AT LEAST 10 different auctions for every product name returned by the script (this means that if a category is very small and say that the 20th most frequent product name returns just 3 auctions, I don't care to have the other 80 product names)
Please bid on this project only if you are experienced with this kind of coding.
Dear,
I have experience with web data scraping, I will use java with HtmlUnit library for development. This library can help us parse the html element, auto click, it also can catch the ajax rendering...
The delivery is a command line app including a bash script (call java classes) and a configuration file (contains csv file path).
Thanks,
Huy