Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8A128D88A for ; Wed, 15 Aug 2012 17:14:58 +0000 (UTC) Received: (qmail 98964 invoked by uid 500); 15 Aug 2012 17:14:57 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 98918 invoked by uid 500); 15 Aug 2012 17:14:57 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 98908 invoked by uid 99); 15 Aug 2012 17:14:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Aug 2012 17:14:57 +0000 X-ASF-Spam-Status: No, hits=-0.5 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of nitinpawar432@gmail.com designates 209.85.217.176 as permitted sender) Received: from [209.85.217.176] (HELO mail-lb0-f176.google.com) (209.85.217.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Aug 2012 17:14:49 +0000 Received: by lboi15 with SMTP id i15so1132359lbo.35 for ; Wed, 15 Aug 2012 10:14:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=cBn4mPMTCGq+n3ziKIa69xJkSTGP1X67ekJz/2Q3mXk=; b=yA0EfrvB7sGIXEi/B/6iHaIW5UgVfyyGyH6LT5Nze+z2ESiOP4qtS14fyR/JF61K7m i4x3D9Wzf0l39LeIrW8W1myxRX2/AMTwktrtCzn1/qJhjVI+DWtD/9C+xz/V95Ejd4Pw Q5KZt1geg9oUaTmJRqsd7rCEGpRA2HxudPAhOyH0WzeYqtXGNNfqcRiGTGep73octZqy mPbC153WYXTncF8Dinc6AsygYTc3+p5UIJrL5vZ9vbJwHFLL7Vcy3S+ibu8/ZRXC23tP NpoiwZuaclB5mtOowdj7zyNAp5BFte9oUHEeZlZunCMM+PwC8ivKQtavLZVl4Ck29O4E /0OQ== MIME-Version: 1.0 Received: by 10.152.111.71 with SMTP id ig7mr19894971lab.28.1345050868804; Wed, 15 Aug 2012 10:14:28 -0700 (PDT) Received: by 10.112.127.39 with HTTP; Wed, 15 Aug 2012 10:14:28 -0700 (PDT) In-Reply-To: References: Date: Wed, 15 Aug 2012 22:44:28 +0530 Message-ID: Subject: Re: Reducer throwing warning during join operations.Defaulting int columns to 0 From: Nitin Pawar To: user@hive.apache.org Content-Type: text/plain; charset=ISO-8859-1 it might be the case that there are few empty spaces at the end of each row which are being handled when you are reading and writing from disc but when you set autoconvert then looks like one of these tables is really small and it is converted into mapside join which means the entire table is loaded into map memory and there is no need of reduce On Wed, Aug 15, 2012 at 9:13 PM, Himanish Kushary wrote: > Hi, > > I have uploaded few csv files from windows into hive and configured few > external tables using them. When I am trying to run a join on two tables one > of the int columns > get changed to 0. The structure of the tables are as follows: > > > Table-1 Table-2 > ------------ ----------- > > Id(int) id(int) datetime > eid(int) > -- ---- ------------ > ----- > 1 1 2011-02-01 3 > 2 1 2011-03-01 4 > 3 2 2011-04-01 5 > 4 2011-05-01 6 > 6 2011-06-01 7 > > > The join query is - select a.* from Table-2 a join Table-1 b on (a.id=b.id); > > The output is: > > 1 2011-02-01 0 > 1 2011-03-01 0 > 2 2011-04-01 0 > > > I checked the logs and noticed the following warning : WARN > org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct: Extra bytes > detected at the end of the row! Ignoring similar problems.Could this be > causing it ? > > When I turn on hive.auto.convert.join=true , the error goes away as there is > no reduce phase.The output is: > > 1 2011-02-01 3 > 1 2011-03-01 4 > 2 2011-04-01 5 > > Could somebody please help me figure out why we get the wrong results when > running through the reducer. > -- > Thanks -- Nitin Pawar