Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 17AFC108A9 for ; Mon, 26 Aug 2013 22:52:28 +0000 (UTC) Received: (qmail 14125 invoked by uid 500); 26 Aug 2013 22:52:23 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 13939 invoked by uid 500); 26 Aug 2013 22:52:23 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 13927 invoked by uid 99); 26 Aug 2013 22:52:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 22:52:23 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.223.174 as permitted sender) Received: from [209.85.223.174] (HELO mail-ie0-f174.google.com) (209.85.223.174) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 22:52:18 +0000 Received: by mail-ie0-f174.google.com with SMTP id k14so5908157iea.5 for ; Mon, 26 Aug 2013 15:51:58 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=AocMST2tmk07juBs6W6e1bUp3Td7MuM2kIpJXdnZYRg=; b=CpdWtYyqm4Ee6EAUn6oBqcTzBzdYORwiBZnsdzIYJMtp2bUm4WRfsWo4rA5kQGPaB3 LIeeOe+A5GgDmmIR2TbQxTX0xfq1qsTYQdWbTkB2Qo/xAyWP/d9xF81HgWW+aB+XVrAt wtxo3LQomUm7sm1f5lTqMJRqp5BCpFy/3p0rt7Y5Kxp/JrUpNkuQJM/d6zNh1BpPYuYZ aOux9gg3rWNvyz49qaQs8naNe7efcaBQ0pj8LgoHxbvgbEn86TLd6fFr8UzeiuQrcdjx 8JPoxOTnxwmaGYDwFyxIw4KbXE1Lkf6/EJ5SXaL3FLQ8MoUyaGS8CxP7PRjyiEyRndV0 Z1Hg== X-Gm-Message-State: ALoCoQlTHNF/in+TXseoezcusV9lHO4Hv24DKzh9tgsecqd3mY50haJEeSWmVlpIasL8Nc1bWKWS X-Received: by 10.50.120.6 with SMTP id ky6mr8006377igb.58.1377557518429; Mon, 26 Aug 2013 15:51:58 -0700 (PDT) MIME-Version: 1.0 Received: by 10.50.101.202 with HTTP; Mon, 26 Aug 2013 15:51:38 -0700 (PDT) In-Reply-To: <521BD7B9.90801@plutoz.com> References: <521BABF7.7040702@plutoz.com> <521BD7B9.90801@plutoz.com> From: Harsh J Date: Tue, 27 Aug 2013 04:21:38 +0530 Message-ID: Subject: Re: 1.2.1 ouput, what is bufvoid To: "" Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org The length printed there is the # of elements in an internal array, not a data size length. If you wish to raise that you can raise up io.sort.spill.percent (default 0.80, or 80%). On Tue, Aug 27, 2013 at 4:03 AM, kaveh minooie wrote: > Thaks Harsh, but my io.sort.mb is 320 while, as you can see, length is > only 1mb (1048576) > > > On 08/26/2013 03:14 PM, Harsh J wrote: >> >> You need to raise io.sort.mb to increase the sort buffer capacity. >> >> On Tue, Aug 27, 2013 at 12:56 AM, kaveh minooie wrote: >>> >>> I am getting these in job output in the 1.2.1 >>> >>> Spilling map output: record full = true >>> 2013-08-26 11:40:33,413 INFO org.apache.hadoop.mapred.MapTask: bufstart = >>> 111149676; bufend = 122264640; bufvoid = 318767104 >>> 2013-08-26 11:40:33,413 INFO org.apache.hadoop.mapred.MapTask: kvstart = >>> 1048561; kvend = 838844; length = 1048576 >>> 2013-08-26 11:40:35,879 INFO org.apache.hadoop.mapred.MapTask: Finished >>> spill 10 >>> >>> >>> my question is what is bufvoid? and how can I increase the amount of the >>> length? >>> >>> thanks, >>> -- >>> Kaveh Minooie >> >> >> >> > > -- > Kaveh Minooie -- Harsh J