Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E37AF200CD1 for ; Tue, 11 Jul 2017 21:53:39 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E2AC11671A3; Tue, 11 Jul 2017 19:53:39 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3B56B1671AD for ; Tue, 11 Jul 2017 21:53:39 +0200 (CEST) Received: (qmail 53293 invoked by uid 500); 11 Jul 2017 19:53:38 -0000 Mailing-List: contact jira-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@kafka.apache.org Delivered-To: mailing list jira@kafka.apache.org Received: (qmail 53282 invoked by uid 99); 11 Jul 2017 19:53:38 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Jul 2017 19:53:38 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id F3340195BB9 for ; Tue, 11 Jul 2017 19:53:37 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id YHQll5P6NDNv for ; Tue, 11 Jul 2017 19:53:37 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id A452962630 for ; Tue, 11 Jul 2017 19:38:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 6AB2CE0059 for ; Tue, 11 Jul 2017 19:38:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 363A6246DA for ; Tue, 11 Jul 2017 19:38:00 +0000 (UTC) Date: Tue, 11 Jul 2017 19:38:00 +0000 (UTC) From: "Jason Aliyetti (JIRA)" To: jira@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (KAFKA-5582) Log compaction with preallocation enabled does not trim segments MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 11 Jul 2017 19:53:40 -0000 Jason Aliyetti created KAFKA-5582: ------------------------------------- Summary: Log compaction with preallocation enabled does not trim segments Key: KAFKA-5582 URL: https://issues.apache.org/jira/browse/KAFKA-5582 Project: Kafka Issue Type: Bug Affects Versions: 0.10.1.1 Environment: Linux, Windows Reporter: Jason Aliyetti Unexpected behavior occurs when a topic is configured to preallocate files and has a retention policy of compact. When log compaction runs, the cleaner attempts to gather groups of segments to consolidate based on the max segment size. When preallocation is enabled all segments are that size and thus each individual segment is considered for compaction. When compaction does occur, the resulting cleaned file is sized based on that same configuration. This means that you can have very large files on disk that contain little or no data which partly defeats the point of compacting. The log cleaner should trim these segments to free up disk space. That way they would free up disk space and be able to be further compacted on subsequent runs. -- This message was sent by Atlassian JIRA (v6.4.14#64029)