Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 427EF200B8E for ; Mon, 26 Sep 2016 18:41:47 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 4108E160ACA; Mon, 26 Sep 2016 16:41:47 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 87703160AC8 for ; Mon, 26 Sep 2016 18:41:46 +0200 (CEST) Received: (qmail 80537 invoked by uid 500); 26 Sep 2016 16:41:45 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 80527 invoked by uid 99); 26 Sep 2016 16:41:45 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Sep 2016 16:41:45 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id A548718027E for ; Mon, 26 Sep 2016 16:41:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.102 X-Spam-Level: X-Spam-Status: No, score=-0.102 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id LW4NnfN3sMdZ for ; Mon, 26 Sep 2016 16:41:44 +0000 (UTC) Received: from mail-pf0-f174.google.com (mail-pf0-f174.google.com [209.85.192.174]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id A3A9C5FCED for ; Mon, 26 Sep 2016 16:41:43 +0000 (UTC) Received: by mail-pf0-f174.google.com with SMTP id l25so19378707pfb.1 for ; Mon, 26 Sep 2016 09:41:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=At/Q86Lxfw8mZQPwH0EkcGslqxailaZUOfBKi3hp5Qk=; b=sg13xMDqhx0MWyQiKjmavY+kQ5OtijrQXX8KPF/Q7SVacnqEuAxMXfjZ9MMMZiH2C3 63mZPZe4QnG9rfO36m4MplOYkGeBPGqS11rWUObwJlXoqh4ynBOwoplkZYcWOPfKIZrf WqD4Tl1svi8RDSwJXdXznF8s0NUqHiE1qp7ELWpv41dCdtipvGJnjwjOvlzOadf0hYjj FFlzeZd1pgA3fGAMrP9b6mK1yGZqB2ayEGWJnlDCz0wFqvWgNHNkLvSy74Mu6qKi7RWL l/FOzPf2JD3lOwDoQGBlPfRchHyb/0etEc6vX9fpLDBeUpOhYWoKVjglw0rsdUBIxP8A MsYQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=At/Q86Lxfw8mZQPwH0EkcGslqxailaZUOfBKi3hp5Qk=; b=cpokb0gVtZgBCNNDNHAY/BmYATNl6tAOQdXm4Q+IiMjtyDEGM7possP1HKcNpt4Eqv IDgphMmTAhVUH2HgHu1OQZLm2i3gf9gW54J2vrFUDE9Fdmf7gNogWRFCrDiUf4MC3C6c PKlFAlGaSCLu/ClNF+OWzDZUvwq0aZ1+mExRB/+F13hc0zEBCOpGtE1xvrmX4uUvUb8G ClK3IVyDgh+X6idpTU76ZxD8VgZFfCUkZnUECl3QhiVrGi/rTVl72aVeb7OnIzBoEaa3 0ibcwPsqBeCQAikiowUw7W5tmuAeohVJ0NYWnOYRl6Pi5EH81eYxflYVE0qpSk2d72P+ 5wzQ== X-Gm-Message-State: AE9vXwOvewahcwcsdMA4RO+kGLSvykBJogAsj/bV7mlteT2y1snR8hC0GmFnYiywHtlsWA== X-Received: by 10.98.129.131 with SMTP id t125mr40577497pfd.157.1474908102386; Mon, 26 Sep 2016 09:41:42 -0700 (PDT) Received: from [10.22.16.43] ([192.175.27.10]) by smtp.gmail.com with ESMTPSA id r29sm38953pfd.37.2016.09.26.09.41.41 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Mon, 26 Sep 2016 09:41:41 -0700 (PDT) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) Subject: Re: Hive orc use case From: Alan Gates In-Reply-To: Date: Mon, 26 Sep 2016 09:41:38 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <31627FE1-03C2-413D-A51F-E00FE105AA3F@gmail.com> References: To: user@hive.apache.org X-Mailer: Apple Mail (2.3124) archived-at: Mon, 26 Sep 2016 16:41:47 -0000 ORC does not store data row by row. It decomposes the rows into = columns, and then stores pointer to those columns, as well as a number = of indices and statistics, in a footer of the file. Due to the footer, = in the simple case you cannot read the file before you close it or = append to it. We did address both of these issues to support Hive = streaming, but it=E2=80=99s a low level interface. If you want to take = a look at how Hive streaming handles this you could use it as your = guide. The starting point for that is HiveEndPoint in = org.apache.hive.hcatalog.streaming. Alan. > On Sep 26, 2016, at 01:18, Amey Barve wrote: >=20 > Hi All, >=20 > I have an use case where I need to append either 1 or many rows to = orcFile as well as read 1 or many rows from it. >=20 > I observed that I cannot read rows from OrcFile unless I close the = OrcFile's writer, is this correct? >=20 > Why doesn't write actually flush the rows to the orcFile, is there any = alternative where I write the rows as well as read them without closing = the orcFile's writer ? >=20 > Thanks and Regards, > Amey=20