Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DA01B200BAA for ; Thu, 27 Oct 2016 10:45:00 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D7008160B01; Thu, 27 Oct 2016 08:45:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 2A0D1160AE4 for ; Thu, 27 Oct 2016 10:45:00 +0200 (CEST) Received: (qmail 89069 invoked by uid 500); 27 Oct 2016 08:44:59 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 89053 invoked by uid 99); 27 Oct 2016 08:44:59 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2016 08:44:59 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 25A1A2C1F54 for ; Thu, 27 Oct 2016 08:44:59 +0000 (UTC) Date: Thu, 27 Oct 2016 08:44:59 +0000 (UTC) From: "Anoop Sam John (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-16438) Create a cell type so that chunk id is embedded in it MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 27 Oct 2016 08:45:01 -0000 [ https://issues.apache.org/jira/browse/HBASE-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15611207#comment-15611207 ] Anoop Sam John commented on HBASE-16438: ---------------------------------------- In CellChunkMap we write the index also to off heap chunks. So its ok to write the seqId also along with chunkId+offset+length. So totally 20 bytes per cell but all can be in offheap area. So we are in better place. The other way of keeping the ref means we will need a ref array + the chunk. 2 data structures keep the index data. Per cell one entry to ref array (that is on heap) and remaining 16 bytes can go to offheap chunk. Only worry of using chunk from pool for the index is the wastage it can make. May be we should share one chunk (One BB of 2 MB size) btw more than one segment's index. Elsewhere [~saint.ack@gmail.com] was mentioning abt upping the chunk size from 2 MB too. Then the concern is more. With 2 MB we can keep 131072 cells index data in it. > Create a cell type so that chunk id is embedded in it > ----------------------------------------------------- > > Key: HBASE-16438 > URL: https://issues.apache.org/jira/browse/HBASE-16438 > Project: HBase > Issue Type: Sub-task > Affects Versions: 2.0.0 > Reporter: ramkrishna.s.vasudevan > Assignee: ramkrishna.s.vasudevan > > For CellChunkMap we may need a cell such that the chunk out of which it was created, the id of the chunk be embedded in it so that when doing flattening we can use the chunk id as a meta data. More details will follow once the initial tasks are completed. > Why we need to embed the chunkid in the Cell is described by [~anastas] in this remark over in parent issue https://issues.apache.org/jira/browse/HBASE-14921?focusedCommentId=15244119&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15244119 -- This message was sent by Atlassian JIRA (v6.3.4#6332)