Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2171A11A55 for ; Tue, 20 May 2014 16:47:43 +0000 (UTC) Received: (qmail 49354 invoked by uid 500); 20 May 2014 16:47:41 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 49285 invoked by uid 500); 20 May 2014 16:47:41 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 49277 invoked by uid 99); 20 May 2014 16:47:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 May 2014 16:47:41 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yuzhihong@gmail.com designates 209.85.160.174 as permitted sender) Received: from [209.85.160.174] (HELO mail-yk0-f174.google.com) (209.85.160.174) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 May 2014 16:47:38 +0000 Received: by mail-yk0-f174.google.com with SMTP id 9so567274ykp.19 for ; Tue, 20 May 2014 09:47:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=EwEfWQC25j4Q5uZsOhi3OjJaEgKwdQn/Kz59ayUbQAs=; b=LfXX2hXqYCMFlDc4zJKtBthzaPZs1UlDwWaWx/uf6NrGxY3KYdZM1+askkBZBbyBCi DYv/s/SLzQeprB08pGOnKBW+zaQPBGrpBct2w9ynG6tMpSqoje2WS5WDem1+SgFq3eqx MmeIK3s+FV6a7ygFO7BJ4oxwW0S8rtz7bDVsio21EjlMT+e+zmncnoD6WEQi3GjH6mDR ab1GlOYYD0Mc7rzzYZjyz8G90z4A5kgnLDCk2Nm+uDopsHKw8ck+hTQLRCP5pMqr5Dpw EQupO96vDvPWmwzL3nUlIPW4tM7Zs+1bu/FiRSVWqX3Op7MbCw6+U9RiFioBDiGdWFSV NPXA== MIME-Version: 1.0 X-Received: by 10.236.14.2 with SMTP id c2mr63848195yhc.73.1400604434622; Tue, 20 May 2014 09:47:14 -0700 (PDT) Received: by 10.170.37.144 with HTTP; Tue, 20 May 2014 09:47:14 -0700 (PDT) In-Reply-To: <1400595028508-4059419.post@n3.nabble.com> References: <1400595028508-4059419.post@n3.nabble.com> Date: Tue, 20 May 2014 09:47:14 -0700 Message-ID: Subject: Re: DataXceiver java.io.InterruptedIOException error on scannning Hbase table From: Ted Yu To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=089e0139fc84cbee6104f9d7a169 X-Virus-Checked: Checked by ClamAV on apache.org --089e0139fc84cbee6104f9d7a169 Content-Type: text/plain; charset=UTF-8 Looks like you're using hadoop-1.1.1 Have you looked at Data node log ? Would be helpful if you pastebin the portion of Data node log when it shutdown. Cheers On Tue, May 20, 2014 at 7:10 AM, AnushaGuntaka wrote: > Hi , > > Thanks in advance. Please help me out in figuring cause of the follwing > error and fixing it. > > Am facing the below error while scanning a hbase table with partial RowKey > filter through MapReduce program. > > Error: org.apache.hadoop.horg.apache.hadoop.hdfs.server.datanode.DataNode: > DatanodeRegistration():DataXceiver java.io.InterruptedIOException: > Interruped while waiting for IO on channel > java.nio.channels.SocketChannel[closed] > > Data node on the slave node is getting shutdown on this error. > > My Map reduce program is running maptsks till 95% and then failing with > this > error. > > I have a hadoop cluster with two mechines , > > Table Size : 652 GB (223 GB in master Node and 514GB in slave node) > > System Disc details: > > Node space available > --------------------------------- > master ---- 22 GB > slave ---- 210 GB > > ------------------------------- core-site.xml ----------------------- > > > > fs.tmp.dir > /home/e521596/hadoop-1.1.1/full > > > > fs.default.name > hdfs://172.20.193.234:9000 > > > > io.sort.factor > 15 > More streams merged at once while sorting > files. > > > > io.sort.mb > 1000 > Higher memory-limit while sorting data. > > > > io.sort.record.percent > 0.207 > Higher memory-limit while sorting data. > > > > io.sort.spill.percent > 1 > Higher memory-limit while sorting data. > > > > ------------------------------- mapred-site.xml ----------------------- > > > > mapred.job.tracker > fedora3:9001 > > > mapred.reduce.tasks > 6 > > > mapred.tasktracker.map.tasks.maximum > 6 > > > mapred.tasktracker.reduce.tasks.maximum > 6 > > > mapred.textoutputformat.separator > # > > > > mapred.compress.map.output > true > > > > mapred.child.java.opts > -Xms1024M -Xmx2048M > > > > > ---------------------------------------- hdfs-site.xml-------------------- > > > > dfs.name.dir > /home/e521596/hadoop-1.1.1/full/dfs/name > > > dfs.data.dir > /home/e521596/hadoop-1.1.1/full/dfs/data > > > dfs.replication > 1 > > > dfs.datanode.max.xcievers > 5096 > > > > dfs.datanode.handler.count > 200 > > > dfs.datanode.socket.write.timeout > 0 > > > > > --------------------------------------------------------------------- > > > > > > > -- > View this message in context: > http://apache-hbase.679495.n3.nabble.com/DataXceiver-java-io-InterruptedIOException-error-on-scannning-Hbase-table-tp4059419.html > Sent from the HBase User mailing list archive at Nabble.com. > --089e0139fc84cbee6104f9d7a169--