Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3B0AF10C24 for ; Thu, 23 Jan 2014 20:53:41 +0000 (UTC) Received: (qmail 24879 invoked by uid 500); 23 Jan 2014 20:53:39 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 24857 invoked by uid 500); 23 Jan 2014 20:53:39 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 24758 invoked by uid 500); 23 Jan 2014 20:53:38 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 24743 invoked by uid 99); 23 Jan 2014 20:53:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Jan 2014 20:53:38 +0000 Date: Thu, 23 Jan 2014 20:53:38 +0000 (UTC) From: "Josh Wills (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (CRUNCH-331) Change default settings for CombineFileInputFormat MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Josh Wills created CRUNCH-331: --------------------------------- Summary: Change default settings for CombineFileInputFormat Key: CRUNCH-331 URL: https://issues.apache.org/jira/browse/CRUNCH-331 Project: Crunch Issue Type: Bug Components: IO Affects Versions: 0.8.2, 0.9.0 Reporter: Josh Wills Currently, we default to enabling the CombineFileInputFormat settings for any extensions of FileSourceImpl b/c it tends to improve performance for common file formats like text, sequence files, and Avro files. However, this default has caused problems for formats like Parquet and for custom file formats that have complex split logic. This JIRA is to track modifying the default combine file settings in at least some contexts, such as with From.formattedFile for custom input formats. -- This message was sent by Atlassian JIRA (v6.1.5#6160)