Return-Path: X-Original-To: apmail-spark-commits-archive@minotaur.apache.org Delivered-To: apmail-spark-commits-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 375F510178 for ; Thu, 28 Nov 2013 02:25:15 +0000 (UTC) Received: (qmail 30347 invoked by uid 500); 28 Nov 2013 02:25:15 -0000 Delivered-To: apmail-spark-commits-archive@spark.apache.org Received: (qmail 30284 invoked by uid 500); 28 Nov 2013 02:25:15 -0000 Mailing-List: contact commits-help@spark.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@spark.incubator.apache.org Delivered-To: mailing list commits@spark.incubator.apache.org Received: (qmail 30271 invoked by uid 99); 28 Nov 2013 02:25:14 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Nov 2013 02:25:14 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 28 Nov 2013 02:25:11 +0000 Received: (qmail 30148 invoked by uid 99); 28 Nov 2013 02:24:50 -0000 Received: from tyr.zones.apache.org (HELO tyr.zones.apache.org) (140.211.11.114) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Nov 2013 02:24:50 +0000 Received: by tyr.zones.apache.org (Postfix, from userid 65534) id F1D2690F380; Thu, 28 Nov 2013 02:24:49 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: matei@apache.org To: commits@spark.incubator.apache.org Date: Thu, 28 Nov 2013 02:24:50 -0000 Message-Id: In-Reply-To: <0cf89ac2ed8346ef93802c866714af2e@git.apache.org> References: <0cf89ac2ed8346ef93802c866714af2e@git.apache.org> X-Mailer: ASF-Git Admin Mailer Subject: [2/2] git commit: Merge pull request #210 from haitaoyao/http-timeout X-Virus-Checked: Checked by ClamAV on apache.org Merge pull request #210 from haitaoyao/http-timeout add http timeout for httpbroadcast While pulling task bytecode from HttpBroadcast server, there's no timeout value set. This may cause spark executor code hang and other task in the same executor process wait for the lock. I have encountered the issue in my cluster. Here's the stacktrace I captured : https://gist.github.com/haitaoyao/7655830 So add a time out value to ensure the task fail fast. Project: http://git-wip-us.apache.org/repos/asf/incubator-spark/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-spark/commit/743a31a7 Tree: http://git-wip-us.apache.org/repos/asf/incubator-spark/tree/743a31a7 Diff: http://git-wip-us.apache.org/repos/asf/incubator-spark/diff/743a31a7 Branch: refs/heads/master Commit: 743a31a7ca4421cbbd5b615b773997a06a7ab4ee Parents: fb6875d db998a6 Author: Matei Zaharia Authored: Wed Nov 27 18:24:39 2013 -0800 Committer: Matei Zaharia Committed: Wed Nov 27 18:24:39 2013 -0800 ---------------------------------------------------------------------- .../scala/org/apache/spark/broadcast/HttpBroadcast.scala | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) ----------------------------------------------------------------------