Return-Path: X-Original-To: apmail-spark-reviews-archive@minotaur.apache.org Delivered-To: apmail-spark-reviews-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0CB9417623 for ; Thu, 30 Oct 2014 20:57:45 +0000 (UTC) Received: (qmail 29615 invoked by uid 500); 30 Oct 2014 20:57:44 -0000 Delivered-To: apmail-spark-reviews-archive@spark.apache.org Received: (qmail 29592 invoked by uid 500); 30 Oct 2014 20:57:44 -0000 Mailing-List: contact reviews-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@spark.apache.org Received: (qmail 29580 invoked by uid 99); 30 Oct 2014 20:57:44 -0000 Received: from tyr.zones.apache.org (HELO tyr.zones.apache.org) (140.211.11.114) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Oct 2014 20:57:44 +0000 Received: by tyr.zones.apache.org (Postfix, from userid 65534) id 363B39842E9; Thu, 30 Oct 2014 20:57:44 +0000 (UTC) From: kayousterhout To: reviews@spark.apache.org Reply-To: reviews@spark.apache.org References: In-Reply-To: Subject: [GitHub] spark pull request: [SPARK-3466] Limit size of results that a driv... Content-Type: text/plain Message-Id: <20141030205744.363B39842E9@tyr.zones.apache.org> Date: Thu, 30 Oct 2014 20:57:44 +0000 (UTC) Github user kayousterhout commented on a diff in the pull request: https://github.com/apache/spark/pull/3003#discussion_r19635887 --- Diff: docs/configuration.md --- @@ -112,6 +112,18 @@ of the most common options to set are: + spark.driver.maxResultSize + 1g + + Limit of total size of serialized bytes of all partitions for each Spark action (e.g. collect), + it should be at least 1M or 0 (means unlimited). The stage will be aborted if the total size + go above this limit. + Having high limit may cause out-of-memory errors in driver (depends on spark.driver.memory + and memory overhead of objects in JVM). Set a proper limit can protect driver from --- End diff -- "Set a proper limit can protect driver" --> "Setting a proper limit can protect the driver" --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org For additional commands, e-mail: reviews-help@spark.apache.org