Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 54E27200BEB for ; Wed, 28 Dec 2016 10:24:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 53383160B2E; Wed, 28 Dec 2016 09:24:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 98A1A160B19 for ; Wed, 28 Dec 2016 10:23:59 +0100 (CET) Received: (qmail 18582 invoked by uid 500); 28 Dec 2016 09:23:58 -0000 Mailing-List: contact dev-help@phoenix.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@phoenix.apache.org Delivered-To: mailing list dev@phoenix.apache.org Received: (qmail 18553 invoked by uid 99); 28 Dec 2016 09:23:58 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Dec 2016 09:23:58 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 894242C03DC for ; Wed, 28 Dec 2016 09:23:58 +0000 (UTC) Date: Wed, 28 Dec 2016 09:23:58 +0000 (UTC) From: "Ankit Singhal (JIRA)" To: dev@phoenix.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (PHOENIX-2565) Store data for immutable tables in single KeyValue MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 28 Dec 2016 09:24:00 -0000 [ https://issues.apache.org/jira/browse/PHOENIX-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15782480#comment-15782480 ] Ankit Singhal commented on PHOENIX-2565: ---------------------------------------- [~tdsilva]/[~jamestaylor], are we considering any other encoding for storing data in single keyValue apart from using VARBINARY array serialisation format? Because in our case, when there are 100s of columns and each column is not null, then VARBINARY array serialisation is taking 2x space(as offset being stored along with data) when compared to RowKey format used for serialising all the columns in a single byte array. should we look into optimising the encoding here, storing columns in byte array separated by delimiter like VAR array encoding but without offset, OR avro serialization or any other option? +[~ndimiduk]/[~enis]/[~sergey.soldatov] > Store data for immutable tables in single KeyValue > -------------------------------------------------- > > Key: PHOENIX-2565 > URL: https://issues.apache.org/jira/browse/PHOENIX-2565 > Project: Phoenix > Issue Type: Improvement > Reporter: James Taylor > Assignee: Thomas D'Silva > Attachments: PHOENIX-2565-v2.patch, PHOENIX-2565-wip.patch, PHOENIX-2565.patch > > > Since an immutable table (i.e. declared with IMMUTABLE_ROWS=true) will never update a column value, it'd be more efficient to store all column values for a row in a single KeyValue. We could use the existing format we have for variable length arrays. > For backward compatibility, we'd need to support the current mechanism. Also, you'd no longer be allowed to transition an existing table to/from being immutable. I think the best approach would be to introduce a new IMMUTABLE keyword and use it like this: > {code} > CREATE IMMUTABLE TABLE ... > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)