Return-Path: X-Original-To: apmail-hama-dev-archive@www.apache.org Delivered-To: apmail-hama-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8B922DC36 for ; Fri, 28 Sep 2012 01:33:07 +0000 (UTC) Received: (qmail 82598 invoked by uid 500); 28 Sep 2012 01:33:07 -0000 Delivered-To: apmail-hama-dev-archive@hama.apache.org Received: (qmail 82542 invoked by uid 500); 28 Sep 2012 01:33:07 -0000 Mailing-List: contact dev-help@hama.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hama.apache.org Delivered-To: mailing list dev@hama.apache.org Received: (qmail 82530 invoked by uid 99); 28 Sep 2012 01:33:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Sep 2012 01:33:07 +0000 Date: Fri, 28 Sep 2012 12:33:07 +1100 (NCT) From: "Edward J. Yoon (JIRA)" To: dev@hama.apache.org Message-ID: <1729132871.137122.1348795987445.JavaMail.jiratomcat@arcas> In-Reply-To: <1050018736.114829.1348462447645.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HAMA-647) Make the input spliter robustly MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HAMA-647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13465274#comment-13465274 ] Edward J. Yoon commented on HAMA-647: ------------------------------------- By adding unit tests testComputeGoalSize() and testComputeSplitSize(), you can check whether your formula works for for every case correctly. > Make the input spliter robustly > -------------------------------- > > Key: HAMA-647 > URL: https://issues.apache.org/jira/browse/HAMA-647 > Project: Hama > Issue Type: Improvement > Components: bsp core > Affects Versions: 0.5.0, 0.6.0 > Reporter: Yuesheng Hu > Assignee: Yuesheng Hu > Priority: Critical > Fix For: 0.6.0 > > Attachments: HAMA-647-2.patch, HAMA-647_3.patch, HAMA-647.patch > > > Currently, the spliter in FileInputFormat is based on the Mapreduce's spliter. But, Hama is different from Mapreduce, Hama's task can not be pended until the slot becomes free. So, the current spliter is not suitable for Hama. When input file is small, it may be ok, but when input is very large, the number of splits will be very large too, even our cluster is powerful enough to handle the input. More details, please see the comments. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira