Need a scraper for Google Scholar.
Need a bug-free, clean, well-commented script. Functions should be written simply. Focus should be on readability than showing off your programming skills.
Practical Example of what I want:
Script 1:
1. Input: Given URL to Google Scholar Page for Einstein's paper: "Can quantum-mechanical description of physical reality be considered complete?"
2. What the script does:
a. Goes to 'Cited By..'
b. Output 1: Downloads 100 (user specified number) publicly available papers (pdfs only for now) that cite the paper. Put them in the same directory (again user specified).
c. Output 2: Creates a small csv that tracks basic characteristics of each of the downloaded paper - title, url, whatever else Google scholar presents - so author names, journal etc.
Script 2:
a. Iterates through the pdfs folder.
b. Based on regex, gets the text and puts it in the same csv. (If multiple regex are matched, everything is concatenated with a line space).
Hello,
I'm a novice freelancer with great experience in the development, I want to make the most quickly and efficiently.
Any question welcome!
Best regards,
Vasiliy
(as a research assistant in a university lab) I'm familiar with web scraping and have used it so much for my work currently. I guarantee to deliver high quality work for you.
Currently i am working in python in the same field. Did a lot of work in web scraping . Would like to take the opportunity to do it as my first freelancer project.