resume parser

Cancelled Posted Dec 20, 2011 Paid on delivery
Cancelled Paid on delivery

Resume Parser or Parsing

We need a resume parsing application. It needs to run on linux and we would prefer in written in perl or php. If you want to use something else please let us know.

This parser will be used to parse millions of existing resumes in html, word, rtf, text and pdf formats. Most of our resumes are in unstructured html but we have thousands in word, rtf, text and pdf as well. We also have many in the body of emails so if we can parse those as well it is an added bonus but this is not a firm requirement.

The parser needs to be able to extract the following data from the resumes:

------

1. candidate first name

2. candidate last name

3. candidate address

4. candidate city

5. candidate state

6. candidate zip code

7. candidate country

8. candidate email address

9. resume job category (accounting or sales or legal or insurance, or etc.) - we will supply a list of possible categories. It is possible that a resume may fit more than one category so the parser should make a best guess on the correct category

10. resume title

11. candidate career objective

12. years of professional experience

13. employment history

14. education history

15. licenses and certifications

16. military history

17. foreign languages

18. security clearances

19. references

20. skills keywords

21. complete resume in text format. Parser needs to remove all html tags and non-resume information (such as headers, footers, side bars, etc.) in an intelligent way to produce a clean and readable resume in text format.

------

Output of the parser should be an xml tagged file, one xml file for each parsed resume, output file name to be the same as the input file name with extension changing from [login to view URL] to [login to view URL] or [login to view URL] to [login to view URL], etc.

All of the parsed fields will be used to upload into a mysql database. Parser may be asked to do the database insertion as part of the parsing process.

We will supply a sample set of resumes, as many as you need to be successful.

Resumes are unstructured so formats and content vary widely. The ability to score the parsing performance would be beneficial. It would be helpful to be able to look at a parsing report that indicates which resumes the parser thinks it did poorly on so we can manually revisit those parsed resumes that have the highest probabilty of having parsing errors.

Parsing will be done in a batch on all our resumes (millions) and will also need to be able to parse resumes that are added to our system every day. So we would need to be able to integrate the parser with our existing perl and php website applications.

Passing acceptance testing with several thousand resumes will be required at project completion.

Thanks!

P.S. Our budget is somewhat flexible so please submit a bid even if it exceeds the posted budget. We are looking for the best most robust solution possible. Thanks.

HI,

Some samples are attached in the zip file. Please PM for the password. Please keep in mind that 99+% of our resumes are in html format. The ability to accurately parse html resumes is mission critical, being able to parse in other formats (word, pdf, rtf, text, email, etc.) is a nice to have but not a requirement for success.
Thank you.

HI,

Some samples are attached in the zip file. Please PM for the password. Please keep in mind that 99+% of our resumes are in html format. The ability to accurately parse html resumes is mission critical, being able to parse in other formats (word, pdf, rtf, text, email, etc.) is a nice to have but not a requirement for success.
Thank you.

Data Mining MySQL Perl PHP

Project ID: #1350154

About the project

39 proposals Remote project Active Feb 7, 2012

39 freelancers are bidding on average $1425 for this job

gangabass

I can do this for you. See PM for details.

$1500 USD in 15 days
(739 Reviews)
8.1
sainathkohta

We have gone through your requirements. Please check pmb.

$1750 USD in 20 days
(68 Reviews)
7.8
nazmulbh

Hi Sir, I've read and understood the requirements perfectly. I wont say that its an easy task. But definitely possible. It'll be very helpfull if u provide some sample resumes thus I can study on them. Thanks. Na More

$1500 USD in 40 days
(8 Reviews)
5.6
phpplay

Please see inbox. Thanks

$1450 USD in 10 days
(20 Reviews)
5.9
reco233

Let an expert help you with this, please send me the password to my PMB.

$1500 USD in 1 day
(23 Reviews)
5.5
MiguelLam

Hi I can help you with your project. Kind regards.

$800 USD in 8 days
(22 Reviews)
5.5
obodozue

Perl expert in parsing all kinds of unstructured data. Have used perl since 1999.

$780 USD in 5 days
(3 Reviews)
5.4
websoft2009

Hi, I am looking forward to work for you.

$1500 USD in 30 days
(19 Reviews)
4.7
anjeko

Hello, Professional developer with similar expertise (PHP/mySQL) in Australia. I am posting my bid as an expression of interest and appreciate further discussion in private message board. I am waiting for your m More

$1600 USD in 20 days
(3 Reviews)
4.5
sunsriinfosys

Hi, I have over 13 years of Experience in software design, development and implementation of various commercial applications in Client/Server environment, Web and ERP applications using C# 1.1/2.0/3.5, ASP.Net, VB More

$1000 USD in 15 days
(19 Reviews)
4.3
tsendee

It's a whole text mining system. I'm a text/data mining and machine learning researcher. I can develop a scalable text mining system for you.

$3000 USD in 30 days
(3 Reviews)
3.7
amsak

Placing bid

$1400 USD in 30 days
(10 Reviews)
3.8
virajds

Please refer my Profile for more info.

$1000 USD in 10 days
(1 Review)
2.5
vasundhar

I have very good experience in this area I did work on email data extraction

$3200 USD in 30 days
(1 Review)
2.0
gogo1

Hello, please check your PM.

$1500 USD in 15 days
(7 Reviews)
2.1
arunragini

Please see PMB

$1700 USD in 17 days
(3 Reviews)
2.2
cristiansece

I should be able to deliver complete working code even if i don't have any rating to show for it ...

$1000 USD in 14 days
(1 Review)
1.4
WITJerry

Dear Sir, We are having a team of technologies expert working in different technologies like php,Joomla, Smarty,.net, C with our company. Kindly check your PMB for more details.

$1500 USD in 30 days
(0 Reviews)
0.0
lenmartin

Hello Sir, We can confidentially complete the project.. Please check PMB for listing.. Warm Regards

$1000 USD in 10 days
(0 Reviews)
0.0
bluepill

Hello. I'm a perl expert coder. I have very good experience in many perl aplications. I have already developed a parser in perl.

$1200 USD in 7 days
(0 Reviews)
0.0