Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 80B3B200AE2 for ; Fri, 27 May 2016 21:05:14 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7F557160A38; Fri, 27 May 2016 19:05:14 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 03486160A12 for ; Fri, 27 May 2016 21:05:13 +0200 (CEST) Received: (qmail 62045 invoked by uid 500); 27 May 2016 19:05:13 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 61754 invoked by uid 99); 27 May 2016 19:05:13 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 May 2016 19:05:13 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id DF5352C1F61 for ; Fri, 27 May 2016 19:05:12 +0000 (UTC) Date: Fri, 27 May 2016 19:05:12 +0000 (UTC) From: "Matt Kopit (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (CASSANDRA-11911) CQLSSTableWriter should allow for unset fields MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 27 May 2016 19:05:14 -0000 Matt Kopit created CASSANDRA-11911: -------------------------------------- Summary: CQLSSTableWriter should allow for unset fields Key: CASSANDRA-11911 URL: https://issues.apache.org/jira/browse/CASSANDRA-11911 Project: Cassandra Issue Type: Improvement Components: Core Environment: Cassandra 3.0.6 Reporter: Matt Kopit If you are using CQLSSTableWriter to bulk load data into sstables the only way to handle fields without values is by setting them to NULL, which results in the generation of a tombstoned field in the resulting sstable. For a large dataset this can result in a large number of tombstones. CQLSSTableWriter is currently instantiated with a single INSERT statement, so it's not an option to modify the insert statement to specify different fields on a per-row basis. Here are three potential solutions to this problem: 1. Change the default behavior of how NULLs so those fields are not written to the sstable 2. Create a configuration option for CQLSSTableWriter that governs whether NULLs should be ignored. 3. Invent a new constant that represents an UNSET value which can be used in place of NULL -- This message was sent by Atlassian JIRA (v6.3.4#6332)