hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Allen Wittenauer <awittena...@linkedin.com>
Subject Re: why DFSClient.BlockReader doesnot support seek(long) can I use skip() instead?
Date Mon, 20 Sep 2010 17:10:13 GMT

On Sep 19, 2010, at 7:57 PM, steven zhuang wrote:

> hi, all,
>       I have sent this mail in common user list before, duplicate it
> here to seek for more help from experts.

You'll likely have more luck on hdfs-dev.

>       I am wondering why seek(long) is disabled in HDFS.BlockReader?
> Can I use skip(long) to replace this seek(long)?
> 
>       I have a bunch of small files, each is less than a block in size.

	In other words, using Hadoop against recommended best practices. :)


> In
> my program, given the file/block information, I will try to start a
> process on each datanode and try to read from the HDFS directly
> through a socket connection to the datanode.

	Again, using Hadoop against best practices.

>      The read requires seek OP on the file, cause the file I used is
> TFile, which requires the underlying class to be seekable.


Mime
View raw message