Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 76B0C200CC2 for ; Wed, 5 Jul 2017 16:08:07 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 752FB1635CD; Wed, 5 Jul 2017 14:08:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C52151635CB for ; Wed, 5 Jul 2017 16:08:06 +0200 (CEST) Received: (qmail 32659 invoked by uid 500); 5 Jul 2017 14:08:05 -0000 Mailing-List: contact issues-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@spark.apache.org Received: (qmail 32471 invoked by uid 99); 5 Jul 2017 14:08:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Jul 2017 14:08:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id DC9451A7AA5 for ; Wed, 5 Jul 2017 14:08:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.201 X-Spam-Level: X-Spam-Status: No, score=-99.201 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id fiQfJ9qku_bC for ; Wed, 5 Jul 2017 14:08:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 1F6D85FDF3 for ; Wed, 5 Jul 2017 14:08:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 59D94E090E for ; Wed, 5 Jul 2017 14:08:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 13B822461C for ; Wed, 5 Jul 2017 14:08:00 +0000 (UTC) Date: Wed, 5 Jul 2017 14:08:00 +0000 (UTC) From: "Apache Spark (JIRA)" To: issues@spark.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (SPARK-21317) Avoid unnecessary sort in FileFormatWriter if data is already bucketed MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 05 Jul 2017 14:08:07 -0000 [ https://issues.apache.org/jira/browse/SPARK-21317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16074813#comment-16074813 ] Apache Spark commented on SPARK-21317: -------------------------------------- User 'pwoody' has created a pull request for this issue: https://github.com/apache/spark/pull/18542 > Avoid unnecessary sort in FileFormatWriter if data is already bucketed > ---------------------------------------------------------------------- > > Key: SPARK-21317 > URL: https://issues.apache.org/jira/browse/SPARK-21317 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.1.1 > Reporter: Patrick Woody > > When bucketing in FileFormatWriter, the partition is always sorted on bucketIdExpression, the partition id produced by the hash bucketing. If the data is already bucketed in that format, then this expression will be constant so there is no need to sort. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org For additional commands, e-mail: issues-help@spark.apache.org