The following is requirement of crawl program , when finished delivery a EXE program , source code, ACCESS data generated to us.
1. crawl all data in the Deal inside the groupon site & groupon navigation site (total 33 sites)
2. must identify the "Today deal"and"history deal" information. And all district all deal information
3. after crawl the info on groupon site, everytime I run the program, it will only crawl the new deal info./ update deal info. / Latest deal info.
And the program provide a function to re-crawl all data.
4. crawl info. include 20 metadata (20 column)
5. Information on each groupon site put into a sheet , data input to ACCESS file.
6. logging fail crawl case
7. can modify the program 5 times to crawl other different site and data
8. i can run the program to update the data whenever. it generated Latest groupon deal information.