Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1FE2A10ECE for ; Fri, 12 Jul 2013 13:21:50 +0000 (UTC) Received: (qmail 26981 invoked by uid 500); 12 Jul 2013 13:21:49 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 26847 invoked by uid 500); 12 Jul 2013 13:21:49 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 26815 invoked by uid 99); 12 Jul 2013 13:21:49 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Jul 2013 13:21:49 +0000 Date: Fri, 12 Jul 2013 13:21:49 +0000 (UTC) From: "Fabien Rousseau (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-5677) Performance improvements of RangeTombstones/IntervalTree MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13706930#comment-13706930 ] Fabien Rousseau commented on CASSANDRA-5677: -------------------------------------------- bq. Well, it's not abnormal and this is consistent with what we do in other places for such min/max calculation (minTimestamp in SSTableWriter.appendFromStream for instance). Ok bq. Are we good with that fixed? Yep, it's ok for me > Performance improvements of RangeTombstones/IntervalTree > -------------------------------------------------------- > > Key: CASSANDRA-5677 > URL: https://issues.apache.org/jira/browse/CASSANDRA-5677 > Project: Cassandra > Issue Type: Improvement > Components: Core > Affects Versions: 1.2.0 > Reporter: Fabien Rousseau > Assignee: Sylvain Lebresne > Priority: Minor > Fix For: 1.2.7 > > Attachments: 5677-1.2.overlappingfix.txt, 5677-1.2.txt, 5677-new-IntervalTree-implementation.patch > > > Using massively range tombstones leads to bad response time (ie 100-500 ranges tombstones per row). > After investigation, it seems that the culprit is how the DeletionInfo are merged. Each time a RangeTombstone is added into the DeletionInfo, the whole IntervalTree is rebuilt (thus, if you have 100 tombstones in one row, then 100 instances of IntervalTree are created, the first one having one interval, the second one 2 intervals,... the 100th one : 100 intervals...) > It seems that once the IntervalTree is built, it is not possible to add a new Interval. Idea is to change the implementation of the IntervalTree by another one which support "insert interval". > Attached is a proposed patch which : > - renames the IntervalTree implementation to IntervalTreeCentered (the renaming is inspired from : http://en.wikipedia.org/wiki/Interval_tree) > - adds a new implementation IntervalTreeAvl (which is described here : http://en.wikipedia.org/wiki/Interval_tree#Augmented_tree and here : http://en.wikipedia.org/wiki/AVL_tree ) > - adds a new interface IIntervalTree to abstract the implementation > - adds a new configuration option (interval_tree_provider) which allows to choose between the two implementations (defaults to previous IntervalTreeCentered) > - updates IntervalTreeTest unit tests to test both implementations > - creates a mini benchmark between the two implementations (tree creation, point lookup, interval lookup) > - creates a mini benchmark between the two implementations when merging DeletionInfo (which shows a big performance improvement when using 500 tombstones for a row) > This patch applies for 1.2 branch... -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira