Web scraping of printer toner/ink website with import of data to Magento

Completado Publicado Apr 14, 2014 Pagado a la entrega
Completado Pagado a la entrega

We intend to scrape product data from a printer toner/ink website. For every product on the website we will create one SKU in our Magento Ecommerce store and import the associated data that has been scraped. The scraped data needs to be cleaned and modified to ensure it meets our Magento Ecommerce store data standards. We will use four CSV files to import the data.

This tasks require that you scrape the data, clean and modify the data to meet our data standards, create four CSV files for import and import the data to our non-production Magento store.

Navigation Structure:

The website we want scrape is organised into this navigation structure:

Brand > Category > Models > Products

Brand Page:

There are 16 brand categories currently for the following brands:

Brand

Canon

Dell

Epson

HP

Konica Minolta

Kyocera

Lanier

Lexmark

OKI

Panasonic

Ricoh

Samsung

Sharp

Toshiba

Xerox

Within each brand are one or more of the following categories:

Ink Cartridges

Toner Cartridges

Thermal Rolls

Category Page:

Within each category page is a list of printer models.

Models Page:

Within each model page is a list of products that are for use with that printer model broken into product segments such as below:

Compatible Brand Toner Value Pack

Genuine Brand Toner Value Pack

Compatible Brand Toner

Genuine Brand Toner

Compatible Brand Image Drums

Genuine Brand Image Drums

Product Page:

The final product page has the data that we intend to scrape. Each product page has a unique URL. The same product page will be linked from multiple printer model pages as one product is normally compatible with multiple printers.

We need four CSV files created with the data that is scraped. The four CSV files we need created are:

1) [login to view URL]

2) [login to view URL]

3) [login to view URL]

4) [login to view URL]

For each unique product URL we will create one product SKU in our Magento Ecommerce store. This is [login to view URL] and product_type.csv.

For each printer part/model we will cross reference the product SKU(s) that are compatible. For example, these parts/models:

SKU1: PartY, ModelZ

SKU2: PartY

SKU3: ModelZ

… we will cross reference in our database like:

PartY: SKU1, SKU2

ModelZ: SKU1, SKU3

This is [login to view URL] and [login to view URL]

*** Images need to be downloaded and will be referenced in the [login to view URL] import file for import to our Magento Ecommerce store ***

The scraped data needs to be cleaned and modified to ensure it meets our Magento Ecommerce store data standards.

Once complete each CSV file is to be imported to our non-production Magento Ecommerce store.

Existing Products

Our Magento Ecommerce store currently has products from the website we intend to scrape. We will provide a list of product URLs (The unique product URLs from the website that is being scraped) that you do not need to import. After scraping the data from the website you can remove the product URLs that we provide so the same data is not imported twice.

Data Cleaning and Modification

An example of data cleaning and modification is:

“HP LaserJet 1000” would be imported to our Magento Ecommerce store as “HP”,”LaserJet”,”1000”:

The brand and series are both placed into their own separate column. Again, by looking at our existing store data you can clearly see that Laserjet is a series of HP and should be placed into a separate column.

The product attribute “3,500 pages at 5% coverage” would be imported to our Magento Ecommerce store as “Approx. 3,500 pages at 5% coverage”:

By looking at our existing attributes you can see that we use “Approx.” in our data.

Important:

It is your responsibility to test each file using our non-production Magento Ecommerce store to ensure all data is able to import successfully.

Entrada de datos Excel Extracción de datos web Búsqueda en la web

Nº del proyecto: #5806088

Sobre el proyecto

30 propuestas Proyecto remoto Activo Apr 16, 2014

Adjudicado a:

ghazalpasha

Please send me the login credentials to the non-production shop so that I can take a look and make sure I understand all the requirements.

$684 AUD en 10 días
(40 comentarios)
6.0

30 freelancers están ofertando un promedio de $392 por este trabajo

jeweljitu

Hi i will do it manually team base. "Dear employer, I am a highly trained Data Entry,Research, Web search, Web scrape, Product Add Any Type Of Card, Expert with great knowledge of Excel and all Social Networks. Pleas Más

$526 AUD en 10 días
(1298 comentarios)
8.7
seaanddream

Hi, thank you for the invitation....your 5-star data extracting & database building expert is ready to help your project. pls check my profile and feedbacks first to have some idea about the quality of my work... I had Más

$315 AUD en 5 días
(372 comentarios)
8.1
mantislin

Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi

$250 AUD en 5 días
(311 comentarios)
7.6
HongGiang

Hello sir! I'm expert on web scraping and have more experience in Magento. I have done a lot projects for the importing products on Magento and very familiar with the template of csv files. Please take a look at my pro Más

$250 AUD en 5 días
(414 comentarios)
7.5
uumairkhalid

Hi.. Expert Web Scraper & Data Minor here. I have done too many similar project in past. Having best scraping tools and experience i assure you 100% accurate and good quality work. I have too too scraping experience. Más

$526 AUD en 10 días
(188 comentarios)
7.1
uumarkhalid31

hi, i am expert in web scraping and interested in this project, let me do this work with perfection, accuracy and according to your requirements, plz contact with me so we can discuss further about the project thanks

$315 AUD en 10 días
(261 comentarios)
7.2
FirstChoiceWeb

Hello, Read and understood project details. We have a lot of experience of data scarping and data entry. We can do this job very well also we are ready to do sample entries. Hope to getting consider for this. Regards

$250 AUD en 10 días
(112 comentarios)
6.4
Kamalkishover

Hi We have done similar project, We have expert team. WE provide you quality work as per your requirement. Ready to start Now. Thanks

$333 AUD en 10 días
(256 comentarios)
6.7
BrothersTeam

Hello Still now doing the project for adding projects to magento store manually. I am able to import your products. Pls look at my profile picture. Kind Regards Kamal

$333 AUD en 7 días
(94 comentarios)
5.9
nazmulcb

Respected Sir, I have done a lot of similar project done successfully in freelancer.com. This is my only income to live with my small family. I take the project seriously and finished it sincerely and provided 100% sat Más

$250 AUD en 10 días
(82 comentarios)
6.2
vineet370

Hi, I've read the entire description. I have sound knowledge of data entry, Google search and Internet Research jobs. I am a professional user of Office (Word, Excel, Power Point) and other programs. I have 6 years of Más

$280 AUD en 10 días
(74 comentarios)
5.8
rsoftsl

If you choose my bid, you won't only get that data but also the program that I used to generate it and its source code, so you'll be able to get their latest data again in the future. 1. What website do you want to Más

$650 AUD en 7 días
(11 comentarios)
4.3
AndrwProjects

Hi caphitickens. I\'m experienced with both scraping and Magento importing. I did many magento importing projects so know what to do even in complex case. I feel that i able to work on this project. Regards.

$500 AUD en 15 días
(8 comentarios)
4.1
vcare77

Dear Hiring Manager, I can do this job efficiently and effectively Have a great day to you, I am interested to do this job and please see my elance profile link [Removed b Más

$250 AUD en 5 días
(37 comentarios)
4.4
aszenwts

I have been doing web scraping all my life. I love it and I have recently finished making a software for one of the clients on freelancer although i have joined it recently. When I take a job I ensure you that it will Más

$250 AUD en 4 días
(4 comentarios)
3.1
Preetisaini2013

Hi, I can do this job efficiently. I have 5 years experience in this field. I am interested to handle this work . I am waiting for your approval and anticipating long term relationship. Hope to have a positive resp Más

$305 AUD en 8 días
(5 comentarios)
0.8
DharmaSoft

Very good experience on VBA macro and positive to the project on time. Did several vba projects with connectivity to websites for data.. Recent project includes web data in xls for stock market real time prices Más

$555 AUD en 10 días
(0 comentarios)
0.0
creekside

Thank you for the opportunity to bid on your project. We have a team of technical specialists with degrees through master's level who can easily deliver for you. Please contact me with any questions and further details Más

$444 AUD en 10 días
(0 comentarios)
0.0
dennisochei

Hi! I'm a Duke University graduate with degrees in Computer Science and Neuroscience. I can have this project done in no time! Thanks for your consideration

$250 AUD en 0 días
(0 comentarios)
0.0
paulclarke3

I'm based in the UK. I've worked on Magento stores for years now. All that stuff you mention is easy. All you need is to scrape their xml sitemap and then just concatenate the fields so you can upload it to Ma Más

$333 AUD en 5 días
(0 comentarios)
0.0