Data Mining Programmer - Desktop "Test" Application for Aashish
$250-750 USD
Cancelled
Posted over 8 years ago
$250-750 USD
Paid on delivery
Data Mining Program Details
---------------------------
Minimum of three tabs on the tool:
----------------------------------
An Input TAB includes a form to import a list of keywords
A Search Results TAB will display scraped data and be able to Export a CSV of data
A Settings TAB for adding proxies and setting Region and Language
Functions and features of the tool
----------------------------------
1. Google Advanced search with Phrase Match, Region, and Language settings
• all keywords queries (used for analysis) must be searched by Language, Region (country) and in Phrase Match only (keyword in quotes) for results to be accurate, related, and relevant.
• Broad matched results must not be used to determine any comparison operation or used in any analysis.
• Broad search results are only provided for a single purpose: to give a general "Global" search count. Nothing else. Broad match is useless for determining close values.
**Please ensure that the data we use has search volume monthly and competing pages in phrase match. Broad match data cannot be used as they will skew the results.
2. The tool needs to run multiple, parallel, anonymous proxy searches
• searches performed in complete stealth mode using anonymous proxies and browser identity strings to camouflage real identity
• search operations programmed to be non-robotic, human-like in nature: spontaneous, interrupted for random periods / continued whimsically, to reinforce human-like appearance to Google servers
• massively multi-threaded, high-performance, but resource-conscious operation: support for unlimited number of proxies and concurrent search / download threads, concurrent network activity only limited by number of anonymous proxies configured and enabled
• proxies used as separate "personalities, with pre-configured browser identities - each proxy needs to have a browser id set up to be enabled
• built-in list of 100+ browser ids of recent versions of the most popular browsers out there for several Windows, Mac OS X, and Linux versions, including some tablet devices like iPads, iPod Touch, Google Nexus
• detailed logging of every background operation step performed, any error/failure with clear identification, timestamped
• modern, information-packed (WPF-type) user interface: multi-tabbed UI
• Search Results tab displays the response for the number of searches (also called competing pages)... then, data mines the top 10 results in natural search for: Title, page URL, and the description Google has assembled for search results.
• options to export data in results grid to CSV file, or to copy to Clipboard
• program settings remembered between sessions
• entering new proxies is a simple copy-paste operation - accounts provided in the expected format are parsed automatically
• proxy account format: proxy_IP:proxy_port:login_id:password:........any_text_here......
• when checking proxies, the program contacts [login to view URL] via each proxy and checks whether the proxy is
• "anonymous", and in what country it is located - this information is printed in the proxy list
For Additional details, please see the attached text documents.
Please ask any questions or state your concerns, as you have them, to help with your success.
Thank you!