Find Jobs
Hire Freelancers

Data extraction

$30-100 USD

Cancelled
Posted almost 12 years ago

$30-100 USD

Paid on delivery
I need a Lucene or SOLR-based implementation that does the following: 1) A web-based UI to fetch a web document or a PDF from a defined source (eg. a URL) 2) Allows the user to define what to look for (i.e. keywords, phrases) 3) Scan the document for specific information, eg. <Field A>, <Field B>, etc. 4) Extracts the information and outputs it to a CSV file I have attached a sample PDF file. The script would: a) Go to the Apple website (<[login to view URL]>). It would click on each link: - [[login to view URL]][1] - <[login to view URL]> etc. etc. Each link has a downloadable PDF document, from which I would like to extract info. b) It would allow me to define what to look for. For Apple, I want to extract: - iPhone and Related Products and Services - Units, Revenue for the various periods (Q2 2012, Q32011 and Q32012) - iPad and Related Products and Services - Units, Revenue for the various periods (Q2 2012, Q32011 and Q32012) c) The output of the search will be a CSV file
Project ID: 2764342

About the project

4 proposals
Remote project
Active 12 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average $68 USD for this job
User Avatar
See private message.
$60.35 USD in 14 days
5.0 (14 reviews)
3.3
3.3
User Avatar
See private message.
$72.25 USD in 14 days
5.0 (8 reviews)
3.3
3.3
User Avatar
See private message.
$70.55 USD in 14 days
0.0 (0 reviews)
0.0
0.0
User Avatar
See private message.
$70.55 USD in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
United States
5.0
43
Member since Aug 3, 2005

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.