hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Namikaze Minato <lloydsen...@gmail.com>
Subject Re: What is the best way to locate the offset and length of all fields in a Hadoop sequential text file?
Date Fri, 22 Jan 2016 09:58:37 GMT
Hello. We don't have any information about your data.

I don't think we can help you with this. Also, I cannot understand what you
are trying to achieve. Please also tell us why you are using hadoop
streaming instead of hive to do your operations.


On 22 January 2016 at 06:30, Rex X <dnsring@gmail.com> wrote:

> The given sequential files correspond to an external Hive table.
> They are stored in
> /tableName/part-00000
> /tableName/part-00001
> ...
> There are about 2000 attributes in the table. Now I want to process the
> data using Hadoop streaming and mapReduce. The first step is to find the
> offset and length for each attribute.
> What is the best way to get this information?

View raw message