Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ECE1311CF1 for ; Fri, 27 Jun 2014 02:49:20 +0000 (UTC) Received: (qmail 81171 invoked by uid 500); 27 Jun 2014 02:49:15 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 81125 invoked by uid 500); 27 Jun 2014 02:49:15 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 81108 invoked by uid 99); 27 Jun 2014 02:49:15 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Jun 2014 02:49:15 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of josh.elser@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Jun 2014 02:49:12 +0000 Received: by mail-qc0-f176.google.com with SMTP id w7so4015481qcr.35 for ; Thu, 26 Jun 2014 19:48:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=7WR1TnmhnCo3JCG+YqtDhx6VjTKWV5idlkWaueUgO50=; b=Q6I+AZRv9ZZMzGHFkncNwXbdMquIjn5umTs79+tQombJUrztc1Mog0lGPbtOvnivf4 KVBDHM59mUUD78OINxpSyQHPAtUyNzYc8Tl7Yo/H2mdEMBJS/VqBvIJJBjwWH9KouKFy FJFxY6iD/epQFAbw8NQABq5PnMNVV1DIA9Ifo8aL5wq0Ix76YPsrl4g12gxUVc86TROz 1vn5uLP3zePOfqFKW70ccoraYmC0dwSfw+UlDiD7DSq1niAoL7BfSLRr13OvRa2U8iYD 1af0OQp3oO2u9T+D5zMPfLOc+Vm4w32VxdnxCzPiMF4Njfrom9yIiC2W6Tr1X4UkIYO6 kYJQ== X-Received: by 10.224.72.13 with SMTP id k13mr28599397qaj.54.1403837328189; Thu, 26 Jun 2014 19:48:48 -0700 (PDT) Received: from HW10447.local (pool-71-166-48-47.bltmmd.fios.verizon.net. [71.166.48.47]) by mx.google.com with ESMTPSA id x1sm14258166qaj.19.2014.06.26.19.48.47 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 26 Jun 2014 19:48:47 -0700 (PDT) Message-ID: <53ACDB90.3080907@gmail.com> Date: Thu, 26 Jun 2014 22:48:48 -0400 From: Josh Elser User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.6.0 MIME-Version: 1.0 To: dev@accumulo.apache.org Subject: Re: Accumulo 1.7 and Data Center Replication References: In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Joe, I'm the guy to ask if you'd like more information about the replication feature. You already found the parent ticket, so that has a bunch of technical "what's been done". At a high level, replication was implemented as a framework in Accumulo to copy data that was written to a table to another "location". The provided initial implementation is to replicate the data as-is to another Accumulo table (usually some other Accumulo instance). You'll also find a new page in the monitor some basic administration tools in the code via Instance#replicationOperations. I've published a recent version of the user manual[1] which goes into some more detail on the feature, as well as how to configure it. You can also check the replication component on JIRA [2] to see what I have lined up. Automatically replicating bulk-loaded files will be a bit of work. There are some other minor things that could be improved. We can delve into the more technical implementation difficulties if you'd like. I've written a basic test to evaluate equivalence by generating a Merkle tree for two tables. This has been promising so far, but it currently is living in my Github[3]. I need to figure out where/how best to include it in Apache. Finally, having resources to do a larger-scale test would be great, and testing failure conditions over multiple nodes is probably the biggest area that needs to be tested more. I can simulate this on a small scale, but I don't have the resources to do an appropriate larger test with injected failure. If you have something specific you'd like to help out with, I'd be happy to work with you. This feature will also be included in the next version of Accumulo shipped in HDP - Josh [1] http://people.apache.org/~elserj/accumulo_user_manual.html#_replication [2] https://issues.apache.org/jira/issues/?jql=project%20%3D%20ACCUMULO%20AND%20component%20%3D%20replication%20AND%20resolution%20%3D%20Unresolved%20ORDER%20BY%20priority%20DESC%2C%20key%20DESC [3] https://github.com/joshelser/merkle On 6/26/14, 8:50 PM, Joe Stein wrote: > Hi, I was hoping to get some more info around the 1.7 release and what are > the to-be-dos and plans around it? > > Is there any help that is needed from a contribution perspective in > anyways? Testing? Documentation? Pending coding or such? > > We are going to be rolling trunk into two of our lab environments > specifically for https://issues.apache.org/jira/browse/ACCUMULO-378 as it > is a requirement for one of my projects at Bloomberg for Accumulo to have > data center replication before we go live. This works is going to be over > the next month(s) with lots of cycles dedicated to Accumulo 1.7 in the next > few sprints. > > Also, I wanted to reach out if folks are looking for full time, contract or > even side work with Accumulo. We have projects right now going on and are > looking for more hands on keyboards. > > Anyways, thanks for all the great work!!!! I am looking forward to more > continued success with the system, more integrations and to be able to > become more active in the community. > > /******************************************* > Joe Stein > Founder, Principal Consultant > Big Data Open Source Security LLC > http://www.stealth.ly > Twitter: @allthingshadoop > ********************************************/ >