Database management / entry / macro / script/ application csv files

Completed Posted Mar 28, 2012 Paid on delivery
Completed Paid on delivery

see advanced summary

## Deliverables

Hello,

I have 55 CSV files, with millions of entries in them, one of the fields is a reference.

I want the 55 CSV files, all split up into different csv files, categorized by the "start" bit of the reference field.

I want an experienced programmer to use code/application/macro to do this.

There is 2849 different possible "start" parts of the code. I will send you all of the 2849 possible reference "start bits"

So when the work is completed I will have 2849 csv files named after the individual reference starts, containing entries which have a reference starting with whatever that file is called. So for example file DL15 will contain all of the entries which has a reference starting DL15.

(I want the 2849 files delivered in 3 different formats.)

There is 5 types of reference:

2 letters and 2 numbers at the start always followed by a number e.g. DL15 7QZ

2 letters and 1 number at the start always followed by a number e.g. NE3 4RR

1 Letter and 1 number at the start always followed by a number e.g. B1 7DE

1 Letter and 2 numbers at the start always followed by a number e.g. B15 9FX

2 Letters and 1 Number followed by a Letter, then always followed by a number e.g. WC1A 1DR

I think the program/code/application will have to function like this:

1) Pick reference start from 2849 options.

2) Search all 55 CSV files for entries which have that reference start.

3) Put all data together in one place/temporary CSV file.

4) Remove duplicates. ( I then want a copy of the file at this stage saved)

5) Then I want to remove some of the fields and reformat one of the fields(ill go into more detail on this when job accepted). It will leave one field and reference.

6) Remove duplicates again at this point ( I want a copy of the file at this stage)

7) Remove one of the fields leaving only 1 field ( I want a copy of the file at this final stage)

You should know exactly how to do this and process millions of entries - I do not want to have to provide any help or advice with this.

----------

I nearly forgot to add - there is 122 different reference starts when categorized by ONLY THE LETTERS - so i also want delivering 122 csv files which combines all of the csv files into each reference start by only the letter. e.g. all of the entries beginning with B or DL.

Project ID: #2726322

About the project

17 proposals Remote project Active Mar 31, 2012

Awarded to:

mcm69sl

See private message.

$60.35 USD in 2 days
(16 Reviews)
3.1

17 freelancers are bidding on average $72 for this job

Lotusdeveloper

See private message.

$68 USD in 2 days
(97 Reviews)
2.6
dpune

See private message.

$85 USD in 2 days
(330 Reviews)
2.0
sergeybarseghyan

See private message.

$70.55 USD in 2 days
(27 Reviews)
2.1
shaarawi

See private message.

$85 USD in 2 days
(43 Reviews)
1.8
ctgr

See private message.

$85 USD in 2 days
(16 Reviews)
0.7
misebal

See private message.

$45.05 USD in 2 days
(1 Review)
0.0
szuge

See private message.

$70 USD in 2 days
(8 Reviews)
0.0
raj1242

See private message.

$70 USD in 2 days
(0 Reviews)
0.0
asheze

See private message.

$75 USD in 2 days
(1 Review)
0.0
evensteven01

See private message.

$65.45 USD in 2 days
(0 Reviews)
0.0
idealprogrammer

See private message.

$80.75 USD in 2 days
(10 Reviews)
0.0
razib

See private message.

$85 USD in 2 days
(1 Review)
0.0
hoesoftware

See private message.

$85 USD in 2 days
(102 Reviews)
0.0
dcarch

See private message.

$50.15 USD in 2 days
(3 Reviews)
0.0
vrsoftza

See private message.

$70.55 USD in 2 days
(0 Reviews)
0.0
solworks

See private message.

$70 USD in 2 days
(3 Reviews)
0.0