Return-Path: X-Original-To: apmail-hama-dev-archive@www.apache.org Delivered-To: apmail-hama-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AB37DD3EE for ; Mon, 24 Sep 2012 05:40:18 +0000 (UTC) Received: (qmail 93133 invoked by uid 500); 24 Sep 2012 05:40:18 -0000 Delivered-To: apmail-hama-dev-archive@hama.apache.org Received: (qmail 92891 invoked by uid 500); 24 Sep 2012 05:40:11 -0000 Mailing-List: contact dev-help@hama.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hama.apache.org Delivered-To: mailing list dev@hama.apache.org Received: (qmail 92795 invoked by uid 99); 24 Sep 2012 05:40:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Sep 2012 05:40:07 +0000 Date: Mon, 24 Sep 2012 16:40:07 +1100 (NCT) From: "Yuesheng Hu (JIRA)" To: dev@hama.apache.org Message-ID: <334484078.114982.1348465207429.JavaMail.jiratomcat@arcas> In-Reply-To: <1050018736.114829.1348462447645.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HAMA-647) Make the input spliter robustly MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HAMA-647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461608#comment-13461608 ] Yuesheng Hu commented on HAMA-647: ---------------------------------- mvn build successfully, but still need test on cluster environment. I will keep going with this. > Make the input spliter robustly > -------------------------------- > > Key: HAMA-647 > URL: https://issues.apache.org/jira/browse/HAMA-647 > Project: Hama > Issue Type: Improvement > Components: bsp core > Affects Versions: 0.5.0, 0.6.0 > Reporter: Yuesheng Hu > Assignee: Yuesheng Hu > Priority: Critical > Fix For: 0.6.0 > > Attachments: HAMA-647.patch > > > Currently, the spliter in FileInputFormat is based on the Mapreduce's spliter. But, Hama is different from Mapreduce, Hama's task can not be pended until the slot becomes free. So, the current spliter is not suitable for Hama. When input file is small, it may be ok, but when input is very large, the number of splits will be very large too, even our cluster is powerful enough to handle the input. More details, please see the comments. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira