Return-Path: X-Original-To: apmail-incubator-kafka-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-kafka-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0EC72DAB5 for ; Mon, 5 Nov 2012 21:46:13 +0000 (UTC) Received: (qmail 51492 invoked by uid 500); 5 Nov 2012 21:46:13 -0000 Delivered-To: apmail-incubator-kafka-dev-archive@incubator.apache.org Received: (qmail 51439 invoked by uid 500); 5 Nov 2012 21:46:12 -0000 Mailing-List: contact kafka-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: kafka-dev@incubator.apache.org Delivered-To: mailing list kafka-dev@incubator.apache.org Received: (qmail 51429 invoked by uid 99); 5 Nov 2012 21:46:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Nov 2012 21:46:12 +0000 Date: Mon, 5 Nov 2012 21:46:12 +0000 (UTC) From: "Jay Kreps (JIRA)" To: kafka-dev@incubator.apache.org Message-ID: <451473677.70823.1352151972898.JavaMail.jiratomcat@arcas> In-Reply-To: <1279397415.1476.1349379467361.JavaMail.jiratomcat@arcas> Subject: [jira] [Updated] (KAFKA-545) Add a Performance Suite for the Log subsystem MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/KAFKA-545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jay Kreps updated KAFKA-545: ---------------------------- Attachment: KAFKA-545-v3.patch One more tweak. > Add a Performance Suite for the Log subsystem > --------------------------------------------- > > Key: KAFKA-545 > URL: https://issues.apache.org/jira/browse/KAFKA-545 > Project: Kafka > Issue Type: New Feature > Affects Versions: 0.8 > Reporter: Jay Kreps > Priority: Blocker > Labels: features > Attachments: KAFKA-545-draft.patch, KAFKA-545.patch, KAFKA-545-v2.patch, KAFKA-545-v3.patch > > > We have had several performance concerns or potential improvements for the logging subsystem. To conduct these in a data-driven way, it would be good to have a single-machine performance test that isolated the performance of the log. > The performance optimizations we would like to evaluate include > - Special casing appends in a follower which already have the correct offset to avoid decompression and recompression > - Memory mapping either all or some of the segment files to improve the performance of small appends and lookups > - Supporting multiple data directories and avoiding RAID > Having a standalone tool is nice to isolate the component and makes profiling more intelligible. > This test would drive load against Log/LogManager controlled by a set of command line options. These command line program could then be scripted up into a suite of tests that covered variations in message size, message set size, compression, number of partitions, etc. > Here is a proposed usage for the tool: > ./bin/kafka-log-perf-test.sh > Option Description > ------ ----------- > --partitions The number of partitions to write to > --dir The directory in which to write the log > --message-size The size of the messages > --set-size The number of messages per write > --compression Compression alg > --messages The number of messages to write > --readers The number of reader threads reading the data > The tool would capture latency and throughput for the append() and read() operations. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira