Create a scraper / script / crawler to extract product data from an online shop - go through all products - export in csv or excel
$30-250 USD
Cancelled
Posted about 9 years ago
$30-250 USD
Paid on delivery
Dear freelancers,
we need an effecient web scraper, which we can run on one of our own servers. WE ARE LOOKING FOR AN EXPERIENCED DEVELOPER - work must be flawless!
Following should be done:
The website to scrape/crawl is: [login to view URL]
--> It is an online shop with almost 80k products. The scraper should do the following: It should start with these main top level categories:
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
And should scrape EVERY SINGLE product within these categories (as said, something around 80,000 items).
The following information should be exported from EACH product - please use this url to understand different items explained below: [login to view URL]:
1) URL (e.g. "[login to view URL]")
2) Breadcrumbs (e.g. "Início > Masculino > Esporte Masculino > Calçados > Tenis")
3) Brand Name - located above product name (e.g. "Puma")
4) Product Name (e.g. "Tênis Puma Axis 2 Branco")
5) Image URLs --> ALL Images in product page --> USE default resolution (not zoom image) of ~275px × 400px (e.g. "[login to view URL] ; [login to view URL] ........ etc etc")
6) Current Price (e.g. "99,90")
7) Old Price - if applicable (e.g. "199,90")
8) Payable rates - if applicable (e.g. "5 x 19,98")
9) Available sizes: (e.g. "38, 39, 40, 41, 42, 43")
10) ALL Available Data in the tab "Detalhes do produto" --> Data here is:
--> A) a short text description AND
--> B) a list with multiple different entries (NOTE: products do not always have all these entries --> compare [login to view URL] versus [login to view URL]):
--> List items could be:
- Description (plain text above actual list)
- SKU (e.g. "RA870APM16PQL
- Modelo (e.g. "POLO RALPH LAUREN 89460PRL")
- Material (e.g. "Algodão")
- Composição (e.g. "100% Algodão")
- Cor (e.g. "Preto")
- Lavagem (e.g. "Lavar a mão")
- Medidas (e.g. "Ombro: 17cm/ Manga: 23cm/ Tórax: 116cm/ Comprimento: 76cm")
- Categoria (e.g. "Premium Masculino > Roupas > Pólos > Pólo Manga Curta")
--> That is all data we need for EACH product
***NOTE*** --> We will need to run the script MULTIPLE times per week: SO: The script MUST be effecient an FAST. The data should be extracted and then saved on the server (in csv or any other excel importable format). The script should possible to be run on OUR server.
***NOTE*** --> We are looking for a long term developer - we will not just need ONE script, BUT we will need similar scripts for 10 different online shops. SO: We are looking for somebody to then also develop other scripts.
Please get in touch if you have any questions.
Thank you very much,
Dan
Hi sir,
I am scraping expert, I have did too many similar projects, please check my feedback then you will know.
Can you tell me more details? then I will provide demo data for you.
Thanks,
Kimi
Hello!
Can do this task for you very quickly.
I have wide experience in writing such utilities on PHP/C++/C# (including client-servers scripts, web scraping, create parsers for extract inrormation, grabbers, and so on) for sites that have or not scraping protection.
So I want to disquss with you details of this project and perform it.
Almost always online, waiting for your answer
Thank you.