Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4D4DE4FAD for ; Mon, 6 Jun 2011 11:25:29 +0000 (UTC) Received: (qmail 8501 invoked by uid 500); 6 Jun 2011 11:25:27 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 8474 invoked by uid 500); 6 Jun 2011 11:25:27 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 8466 invoked by uid 99); 6 Jun 2011 11:25:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Jun 2011 11:25:26 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [202.38.128.6] (HELO ihep.ac.cn) (202.38.128.6) by apache.org (qpsmtpd/0.29) with SMTP; Mon, 06 Jun 2011 11:25:19 +0000 Received: from [128.141.36.217] (unknown [128.141.36.217]) by mail.ihep.ac.cn (Coremail) with SMTP id fwD__pBrrzECuexNgEEgBQ--.11709S2; Mon, 06 Jun 2011 19:24:53 +0800 (CST) Message-ID: <4DECB9CA.6060506@ihep.ac.cn> Date: Mon, 06 Jun 2011 13:28:10 +0200 From: Donal Zang Reply-To: zangds@ihep.ac.cn Organization: IHEP User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.17) Gecko/20110414 Thunderbird/3.1.10 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: [SPAM] Re: slow insertion rate with secondary index References: <4DEC077A.5030208@ihep.ac.cn> <4DEC8A3A.7010107@ihep.ac.cn> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-CM-TRANSID: fwD__pBrrzECuexNgEEgBQ--.11709S2 X-Coremail-Antispam: 1Uf129KBjvJXoWruFyxtF4rurWrWFykKrWfKrg_yoW8Jr1Dpr WIk348tr1kt3WrZr13J3W8WFy7uwn7Gay3tw4UKryUArWDWF10qrWSqryY9F9FvFs5u34F qF4kWrZ8AFn0va7anT9S1TB71UUUUUUv73VFW2AGmfu7bjvjm3AaLaJ3UjIYCTnIWjp_UU Uea7k042IE4IxYO2xFxVAqjxCEw4Av424lb7IF0VCFvT92vfvE14ylb7IF0VCIc40E14yl b7Iv0xC_Cr1lb4IE77IF4wAFF20E14v26r1j6r4UM7C26xCjj4IEI4klw4CSwwAFxVCaYx vI4VCIwcAKzIAtM7CIcVAFz4kK6r1j6r18M28EF7xvwVC0I7IYx2IY67AKxVW8JVW5JwA2 z4x0Y4vE2Ix0cI8IcVCY1x0267AKxVWxJVW8Jr1l84ACjcxK6I8E87Iv67AKxVW8Jr0_Cr 1UM28EF7xvwVC2z280aVCY1x0267AKxVW0oVCq3wAawVAYYI1S6c8GOVWUur45Jryln4vE F7Iv6F18KVAqrcv_GVWUtr1rJF1ln4vE4IxY62xKV4CY8xCE548m6r4UJryUGwAa7VCY0V AaVVAqrcv_Jw1UWr13Mc02F40EFcxC0VAKzVAqx4xG6I80ewAv7VC0I7IYx2IY67AKxVWU JVWUGwAv7VC2z280aVAFwI0_Jr0_Gr1lF7xvr2IY64vIr41lc7I2V7IY0VAS07AlzVAYIc xG8wCF04k20xvY0x0EwIxGrwCFI7vE0wC2zVAF1VAY17CE14v26r1j6r15MIIYrxkI7VAK I48JYxBIdaVFxhVjvjDU0xZFpf9x07jVXd8UUUUU= X-CM-SenderInfo: p2dqwvvv6lxvnsoduhdfq/ X-Virus-Checked: Checked by ClamAV on apache.org On 06/06/2011 10:15, David Boxenhorn wrote: > Is there really a 10x difference between indexed CFs and non-indexed CFs? Well, as for my test, it is! I'm using 0.7.6-2, 9 nodes, 3 replicas, write_consistency_level QUORUM, about 90,000,000 rows (~ 1K per row) I use 20 process, 20rows for each insertion. the insertion time for the whole row is about 0.02 seconds without index and then I add a secondary index, and update every row with the indexed column, the insertion time is about 2 seconds and if I remove the index, and update the column, the time is about 0.002 Another thing I noticed is : if you first do insertion, and then build the secondary index use "update column family ...", and then do select based on the index, the result is not right (seems the index is still being built though the "update" commands returns quickly). And after a while, the get_indexed_slices() goes time out from time to time (with pycassa.ConnectionPool('keyspace1', ['host1','host2'], timeout=600, pool_size=1) ). Does some one else have some same experiences using the secondary indexes? -- Donal Zang Computing Center, IHEP 19B YuquanLu, Shijingshan District,Beijing, 100049 zangds@ihep.ac.cn 86 010 8823 6018