building profiles by extracting a feature vector from text

In Progress Posted 7 years ago Paid on delivery
In Progress Paid on delivery

Hi everyone,

My project revolves around extracting a feature vector from text and later doing some operations on this vector

I have to extract the following data:

1). Character occurrence

Given a set of characters C and an text M, we define the character occurrence of C in M as the number of times that any of the characters in C occur in M, divided by the length of M.

2). functional word occurrence

Given a word Wr and a set of words W in a text,

we calculate the word occurrence X in W as the number of times Wr occurs in the text, divided by the size of W.

3). special words occurrence

Given a regular expression Rsw representing the special word, a text M, and a set Wm containing the words in M, we calculate the special word occurrence X of Rsw as the number of matches in M for Rsw, divided by the size of Wm

4). Generic style characteristics

5) Style metrics

there are additional features that we require that are unique to our data type, for example:

has HTML?

indented lines?

has signature?

Time characteristics

the data categories will be explained to serious bidders in much more detail

please bid only if you have experience with complex feature extraction from text, future work might be proposed - dependable on the success level of this project

C Programming C++ Programming Data Mining Machine Learning (ML) Regular Expressions

Project ID: #11050721

About the project

4 proposals Remote project Active 7 years ago

Awarded to:

$444 USD in 7 days
(0 Reviews)
0.0

4 freelancers are bidding on average $582 for this job

wesoft21

i have gone through your requirement we done similar kind of job before looking forward your earliest Reply on this for a project discussion Awaiting for your earliest reply

$583 USD in 10 days
(0 Reviews)
0.0
balacevw

A proposal has not yet been provided

$500 USD in 15 days
(0 Reviews)
0.0