hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HAMA-557) Implement Checkpointing service in Hama
Date Sun, 05 Aug 2012 23:05:03 GMT

    [ https://issues.apache.org/jira/browse/HAMA-557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13428931#comment-13428931
] 

Hudson commented on HAMA-557:
-----------------------------

Integrated in Hama-Nightly #633 (See [https://builds.apache.org/job/Hama-Nightly/633/])
    Committing the merge from HAMA-505-branch. Contains changes for HAMA-557 HAMA-587 HAMA-610
HAMA-611 (Revision 1369575)

     Result = FAILURE
surajsmenon : 
Files : 
* /hama/trunk
* /hama/trunk/conf/hama-default.xml
* /hama/trunk/core/src/main/java/org/apache/hama/Constants.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/BSPJobClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/BSPMaster.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/BSPPeerImpl.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/BSPTask.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/GroomServer.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/GroomServerAction.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/JobInProgress.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/JobInProgressListener.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/JobStatus.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/LaunchTaskAction.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/LocalBSPRunner.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/RecoverTaskAction.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/SimpleTaskScheduler.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/TaskInProgress.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/TaskRunner.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/TaskStatus.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/UpdatePeerAction.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/ft
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/ft/AsyncRcvdMsgCheckpointImpl.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/ft/BSPFaultTolerantService.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/ft/FaultTolerantMasterService.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/ft/FaultTolerantPeerService.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/AbstractMessageManager.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/AvroMessageManagerImpl.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/HadoopMessageManager.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/HadoopMessageManagerImpl.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/MessageEventListener.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/message/MessageManager.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/BSPMasterSyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/BSPPeerSyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/MasterSyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/PeerSyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/SyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/SyncEvent.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/SyncEventListener.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/SyncServiceFactory.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/ZKSyncBSPMasterClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/ZKSyncClient.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/ZKSyncEventFactory.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/ZKSyncEventListener.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/taskallocation
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/taskallocation/BSPResource.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/taskallocation/BestEffortDataLocalTaskAllocator.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/taskallocation/RawSplitResource.java
* /hama/trunk/core/src/main/java/org/apache/hama/bsp/taskallocation/TaskAllocationStrategy.java
* /hama/trunk/core/src/test/java/org/apache/hama/bsp/TestBSPTaskFaults.java
* /hama/trunk/core/src/test/java/org/apache/hama/bsp/TestCheckpoint.java
* /hama/trunk/core/src/test/java/org/apache/hama/bsp/TestTaskAllocation.java
* /hama/trunk/core/src/test/java/org/apache/hama/bsp/TestZooKeeper.java
* /hama/trunk/core/src/test/java/org/apache/hama/bsp/sync/TestSyncServiceFactory.java

                
> Implement Checkpointing service in Hama
> ---------------------------------------
>
>                 Key: HAMA-557
>                 URL: https://issues.apache.org/jira/browse/HAMA-557
>             Project: Hama
>          Issue Type: Sub-task
>          Components: bsp core
>    Affects Versions: 0.6.0
>            Reporter: Suraj Menon
>            Assignee: Suraj Menon
>             Fix For: 0.6.0
>
>         Attachments: HAMA-505-557-610-611-v1.patch, HAMA-505-557-610-611-v2.patch, HAMA-557-ft-framework.patch
>
>
> Implement checkpointing service in Apache Hama. My patches for HAMA-533 and HAMA-534
are blocked on this.
> - Checkpointing should be done as messages are either sent or received. I prefer while
receiving messages, as we can achieve some parallelism with asynchronous messages. Please
comment if you differ.
> - BSPMaster should hold the checkpoint status for each task. Checkpoint status includes
superstep count and file information for which checkpointing is complete
> - MessageManager should notify Checkpointer of a new message at BSPPeer.
> - Implement/Reuse MessageBundle class as splitClass in BSPPeerImpl for recovery in initInput.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message