Web scraping / captcha solving / output csv report

In Progress Posted Feb 19, 2014 Paid on delivery
In Progress Paid on delivery

** PLEASE READ BEFORE BIDDING **

This project is to build a standalone piece of software which will scrape a particular website for particular data, and save the results to a .csv file. I will specify exactly how the site should be scraped, and how the results should be saved.

To do the web scraping, the software will need to answer captchas. I don't believe these are particularly difficult on the site I'm interested in, and I can provide you with samples if necessary. I am open to various methods for solving them -- pure software preferred, but human solver networks are also possible if the cost is low enough. ***In your bid, please tell me specifically how you plan to answer the captchas.**

Features of the needed program:

- Performs an initial scrape to populate search options, which the user then decides among.

- Performs a "full" scrape based on the chosen options (there are about 10 steps to be performed repeatedly, including solving captchas)

- Runs as an application, or from the command line (must be very simple to run on a new machine, and any preparation such as downloading libraries should be executed in the program itself)

- Mac app or Bash command line program preferred, but Windows OK too (let me know which you can do)

- Can run on user's command, and also has scheduling ability (can set dates & times, or pre-specified schedules like daily at a particular time)

- I require the source code of the program (in case I want to make changes/additions in the future in an area outside your expertise)

- It's a plus if the program can detect changes from the last scrape and notify the user of these.

- It's a plus if the program can email the output .csv and the changes (from above) to a specified email address automatically

If you need more information to give an accurate bid, please message me.

Thanks for reading!

Software Architecture Web Scraping

Project ID: #5460536

About the project

12 proposals Remote project Active Feb 26, 2014

Awarded to:

shivamsemwal

Hi, I have 3 years of experience in developing web-automation projects in java, have worked on various scrapping and posting projects for different websites. Can handle javascript heavy, ajax sites. Have utilized More

$210 USD in 7 days
(2 Reviews)
1.6

12 freelancers are bidding on average $434 for this job

mhmhz

Hi I can provide a desktop application in C# that do the scraping. About captcha, we can use service, deathbycaptcha. Thanks

$515 USD in 3 days
(155 Reviews)
7.3
mantislin

Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi

$444 USD in 6 days
(105 Reviews)
6.6
uumairkhalid

Hi.. Expert Web Scraper & Data Minor here. I have done too many similar project in past. Having best scraping tools and experience i assure you 100% accurate and good quality work. I have too too scraping experience. L More

$473 USD in 3 days
(112 Reviews)
6.5
ghazalpasha

I've done many scraping jobs here and a couple of them required solving a captcha. Please let me know the website you want to scrap and the captcha that needs to be solved.

$444 USD in 10 days
(41 Reviews)
6.0
diamond247

Hello Sir, We are a well built set up with excellent skilled operator with lot of experience in this segment/skill,have complete more than 200 similar job, i have gone through your project description, its really a More

$144 USD in 3 days
(49 Reviews)
6.0
Mezh

Hello, can you provide samples of data and websites to scrape, and samples of captcha (which could be the most difficult thing to do). Thanks, Alex

$300 USD in 7 days
(3 Reviews)
3.0
indilo53

Dear employer, I can do this job the right way. I have a very good understanding on http protocol and I know how to use my tools (Charles debugging proxy, Burp) do dissect a webapp. I use DOM, CSS selctors, XPATH More

$300 USD in 3 days
(0 Reviews)
0.0