If your query (scan) needs a region on the failed region server, the client
will fail and silently retry about 10 times. The sleep time increase as
each retry fails and can reach 10min. On the server side, the master takes
3min to realize the RS failed thus issue a region move, which might take a
few minutes, which together explain the 10min query failure period
On Monday, October 28, 2013, Sandeep L wrote:
> Hi,
> We are using HBase 0.94.1.
> We are facing a strange issue form long time not at got any proper
> solution.
> Here is the issue:Whenever one region server in cluster goes down, whole
> HBase wont respond for some time (around 5 to 10 minutes).We are unable to
> get any clue about the issue.
>
> Can some help us to resolve this issue.
>
> Thanks,Sandeep.
|