hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joshi, Rekha" <Rekha_Jo...@intuit.com>
Subject Re: Hadoop HDFS and Mapreducer question
Date Tue, 18 Sep 2012 05:46:07 GMT
Refer hadoop put, get syntax for placing input files on hdfs (automate script) and pig dump,
store after mapreduce to have your output directory - http://pig.apache.org/docs/r0.9.2/start.html#Pig+Tutorial+Files

Thanks
Rekha

From: A Geek <dw.90@live.com<mailto:dw.90@live.com>>
Reply-To: <user@hadoop.apache.org<mailto:user@hadoop.apache.org>>
Date: Tue, 18 Sep 2012 05:04:05 +0000
To: <user@hadoop.apache.org<mailto:user@hadoop.apache.org>>
Subject: Hadoop HDFS and Mapreducer question

Hello All,
I'm learning hadoop, hdfs etc and currently tying to solve one issue. Can someone help me
how to start attacking the following problem:

I'm trying to come up with some sample code  to store the files  \YEAR\Month\Date\account
structure using Hadoop technique s.

Example: The file will submit to the program as below
Test_20120917_ACC1.csv and Test_20120916_ACC2.csv

HDFS has to create structure as below

HDFS_HOME\2012\09\17\ACC1\Test_20120917_ACC1.csv
HDFS_HOME\2012\09\16\ACC1\Test_20120916_ACC2.csv

Can someone give me pointers on how to start on this. Highly Appreciated. Thanks for reading
the question.

Thanks,
DW

Mime
View raw message