hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Venkates .P.B." <venkates...@gmail.com>
Subject Re: Loading data into HDFS
Date Fri, 03 Aug 2007 06:41:30 GMT
Am I missing something very fundamental ? Can someone comment on these
queries ?

Thanks,
Venkates P B

On 8/1/07, Venkates .P.B. <venkates.pb@gmail.com> wrote:
>
>
> Few queries regarding the way data is loaded into HDFS.
>
> -Is it a common practice to load the data into HDFS only through the
> master node ? We are able to copy only around 35 logs (64K each) per minute
> in a 2 slave configuration.
>
> -We are concerned about time it would take to update filenames and block
> maps in the master node when data is loaded from few/all the slave nodes.
> Can anyone let me know how long generally it takes for this update to
> happen.
>
> And one more question, what if the node crashes soon after the data is
> copied into one it. How is data consistency maintained here ?
>
> Thanks in advance,
> Venkates P B
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message