File parser to extract information and create report

En curso Publicado Mar 4, 2013 Pagado a la entrega
En curso Pagado a la entrega

Below is the description of the program I want to be implemented.

############### The algorithm ##########################

Initialization phase: Load the attributes from the “[login to view URL]” to the memory.

Repeat every “parsing_interval” seconds

Copy the next “oldest” file (say “[login to view URL]”) from the “inbox_dir” directory to the “work_dir” directory (the oldest file can be determined from the file name; the file name convention is described below)

Unzip the [login to view URL] file (located in the “work_dir” directory). It will result in a text file, say “[login to view URL]”

Parse the “[login to view URL]” as described in the section “Parsing Procedure below” (and add parsing results to the report file).

Delete both “[login to view URL]” and “[login to view URL]” from the “work” directory.

############## The Parsing Procedure #####################

While not EOF do

Read next line from the file “[login to view URL]”

If the value of the “Primary attribute” is equal to ANY of the possible values listed in the “[login to view URL]” then do the following

If the value of of each secondary attribute is equal to ANY of the possible values listed in the "[login to view URL]" then

Get ALL the values associated with the secondary attributes (loaded from the [login to view URL] )

Append the primary attribute name, primary attribute value, and all secondary attribute names and values to the “report_file” (If the report_file is not present, then create a new one)

############## The Config File ##########################

# time interval for parsing the next file in seconds

parsing_interval: 60

# Report lifespan in days

Report_life: 30

# Directory paths

Inbox_dir: C:\Users\jsmith\Documents\inbox_dir

Work_dir: C:\Users\jsmith\Documents\work_dir

Report_dir: C:\Users\jsmith\Documents\report_dir

# Primary Attribute

Attribute2: value2_1, value2_2, value2_3

# Secondary Attribute Lists

Attribute1: Value1_1, Value1_2

Attribute3: Value3_1, Value3_2, Value3_3

############### Important notes ##########################

1) The “gzip” files are very huge (It could be 400MB) so the code should be scalable

2) The code should run on both Windows and Unix/Linus environments.

3) The files in the “inbox_dir” will not be deleted or moved.

4) The files in the “inbox_dir” have the following naming convention: YYYYMMDD-SEQNO_*.gzip.

############ Sample contents of the “[login to view URL]” file ########################

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

mestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

##########################################################

1) Please let me know if you have any questions. I’ll send you a sample gzip file via private message so that you can test the application on real data.

2) If you respond to this request, please tell me the language you'll use and the approximate timeframe you'll need to do this and the approximate cost and I'll send you more details.

Please let me know if you have any questions.

Programación en C Procesamiento de datos Java Perl Python

Nº del proyecto: #4302168

Sobre el proyecto

34 propuestas Proyecto remoto Activo Mar 8, 2013

Adjudicado a:

ShawarmaMan

Hi, I have over 10 years of experience with Perl and I'd be happy to help you with this. Please see my PM for more details.

$200 USD en 3 días
(0 comentarios)
0.0

34 freelancers están ofertando un promedio de $186 por este trabajo

gangabass

I can do this for you. See PMB for details.

$100 USD en 2 días
(568 comentarios)
7.8
dobreiiita

Hi, I am JAVA expert and can surely help you here, Please share sample gzip file. Thanks

$400 USD en 10 días
(460 comentarios)
7.6
mantislin

Hi sir, please check PM, thx Kimi.

$317 USD en 6 días
(120 comentarios)
7.0
hbxfnzwpf

I am c++ expert, I can handle this, please check you private message.

$125 USD en 2 días
(176 comentarios)
7.1
samitXI

Please check your inbox. Thanks

$315 USD en 7 días
(184 comentarios)
7.3
wbslivera

hello, I can help you. Thanks

$150 USD en 7 días
(261 comentarios)
6.5
idleswell

Hello, I am a Perl scripting expert interested in your project. I will provide details of my bid in a PMB. A IDLER

$189 USD en 6 días
(185 comentarios)
6.3
buzzcoder

Hi I can help you, please check your pm,thanks.

$400 USD en 4 días
(56 comentarios)
6.3
atchayagandhi

Hi, The project is easily realisable with java. We have expertise with the same. Request you to kindly refer your private message board for further details.

$60 USD en 2 días
(46 comentarios)
6.4
fejs

Hi Sir. I'm expert in Python programming. I've done similar project recently, so this task shouldn't be problem. Can You send me zip file to test application? Best regards, Fejs.

$50 USD en 2 días
(139 comentarios)
6.3
shupantha

I have over 10 years of experience in applications development in C++. Please check your PM for my comments on your project.

$250 USD en 7 días
(42 comentarios)
6.2
chinbochen

Hi sir, I'm senior java developer, I'm interested. Thanks.

$350 USD en 20 días
(20 comentarios)
6.0
dipakjadhav08

I have 4 years experience in Java. Also I am Oracle certified for java 6. I can help you.

$100 USD en 1 día
(99 comentarios)
5.5
mjef

I have more than 7 years of professional experience in cross-platform C++ programming. I can help you.

$200 USD en 7 días
(16 comentarios)
5.3
j2eesoafree

Hi I can deliver this ensuring high quality and flexibility using Java. Please let me know any questions, and share a sample file. regards, kD

$400 USD en 20 días
(26 comentarios)
5.5
russianitgroup

Hello, Will be glad to discuss the project. Regards, IT-Group

$170 USD en 5 días
(23 comentarios)
4.6
DenialWang

can do in both java and c#, while c# code can't be used in unix/linux; we may discuss this later. thanks/denial

$100 USD en 3 días
(6 comentarios)
4.3
charleslewisjose

I can help you out with this project.

$250 USD en 7 días
(12 comentarios)
4.2
compositesoft

Hi, I have over 15 years of experience in the software industry and have worked on a wide variety of applications, technologies, databases and languages. I am an expert in java having worked in java for around 10 Más

$350 USD en 10 días
(2 comentarios)
4.0
cahbou

This job could easily be done in Python or Java, but given the size of your dataset I would consider Java a better choice. I would most likely complete the project over night (US time), especially if you were to cut bi Más

$80 USD en 2 días
(3 comentarios)
3.7