Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 14B6C113DC for ; Fri, 23 May 2014 20:09:02 +0000 (UTC) Received: (qmail 7843 invoked by uid 500); 23 May 2014 20:09:02 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 7802 invoked by uid 500); 23 May 2014 20:09:02 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 7791 invoked by uid 99); 23 May 2014 20:09:02 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 May 2014 20:09:01 +0000 Date: Fri, 23 May 2014 20:09:01 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-2842) Allow for use of hflush instead of hsync MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14007636#comment-14007636 ] ASF subversion and git services commented on ACCUMULO-2842: ----------------------------------------------------------- Commit f654e7faadb9e94e2923087af334a707a4d2c81a in accumulo's branch refs/heads/1.5.2-SNAPSHOT from [~elserj] [ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=f654e7f ] ACCUMULO-2842 Allow configuration of FSDataOutputStream method used to sync WALs. A critical piece of the bigtable design is using write-ahead logs to ensure recovery after node failure. The append only file structure is meant to provide a durable log of events so that the correct state can be reconstructed after a failure when key-values have not been yet serialized to disk. Hadoop 2 offers two levels of durability in regards to writes to files in HDFS. HFlush ensures that all others clients who try to read the file you just wrote data to will see that new data that was written. HSync will ensure that the data you wrote for that file is actually sent to the underlying disk. As such, HFlush does not provide durability when the node crashes without that data being flushed to the drive, while HSync does. HSync does, however, increase the amount of time needed for sync/flush to complete (as hitting disk consistently is slower than not). > Allow for use of hflush instead of hsync > ---------------------------------------- > > Key: ACCUMULO-2842 > URL: https://issues.apache.org/jira/browse/ACCUMULO-2842 > Project: Accumulo > Issue Type: Improvement > Components: logger > Reporter: Josh Elser > Assignee: Josh Elser > Fix For: 1.5.2, 1.6.1, 1.7.0 > > > We keep getting onto the discussion on hsync versus hflush performance; however, we lack a way to actually test this. > Make the invocation of hflush or hsync configurable. It should not change the default, so users on existing major releases should not be affected unless they choose to change their configuration. -- This message was sent by Atlassian JIRA (v6.2#6252)