Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 46971 invoked from network); 4 Jun 2010 20:43:17 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 Jun 2010 20:43:17 -0000 Received: (qmail 61715 invoked by uid 500); 4 Jun 2010 20:43:17 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 61694 invoked by uid 500); 4 Jun 2010 20:43:17 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 61686 invoked by uid 99); 4 Jun 2010 20:43:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Jun 2010 20:43:17 +0000 X-ASF-Spam-Status: No, hits=-1498.6 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Jun 2010 20:43:16 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o54Kgu4r016513 for ; Fri, 4 Jun 2010 20:42:56 GMT Message-ID: <15475355.188681275684176513.JavaMail.jira@thor> Date: Fri, 4 Jun 2010 16:42:56 -0400 (EDT) From: "Jonathan Ellis (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1046) optimize Memtable.getSliceIterator In-Reply-To: <18552479.15201272901136599.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12875739#action_12875739 ] Jonathan Ellis commented on CASSANDRA-1046: ------------------------------------------- to clarify, the "these changes" that made it fast is 0001-trunk-cassandra-1046.patch ? or some combination of the patch, and improved client code? > optimize Memtable.getSliceIterator > ---------------------------------- > > Key: CASSANDRA-1046 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1046 > Project: Cassandra > Issue Type: Improvement > Reporter: Jonathan Ellis > Assignee: Matthew F. Dennis > Fix For: 0.7 > > Attachments: 0001-trunk-cassandra-1046.patch, insertarator.py, readarator.py > > > As reported by James Golick, about 30% of the time in a read is spent in SliceQueryFilter.getMemColumnIterator, virtually all of which is in ConcurrentSkipListMap$Values.toArrray(). > I wrote on the ML: > Besides the UUID optimization you posted, we should do an audit of ColumnFamily.getSortedColumns and replace with iteration where possible (in this case, we'd be left with one copy of most of the columns, but that's better than two). > We can get rid of the other copy by fixing the logic in Memtable.getSliceIterator, which says "copy all the columns, so we can do a binary search on them to find where to start," but since columns are natively in sorted order we could just use an iterator and a while loo -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.