Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4FBBE173DF for ; Wed, 8 Oct 2014 05:14:17 +0000 (UTC) Received: (qmail 11504 invoked by uid 500); 8 Oct 2014 05:14:12 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 11460 invoked by uid 500); 8 Oct 2014 05:14:12 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 11444 invoked by uid 99); 8 Oct 2014 05:14:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Oct 2014 05:14:11 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of josh.elser@gmail.com designates 209.85.216.54 as permitted sender) Received: from [209.85.216.54] (HELO mail-qa0-f54.google.com) (209.85.216.54) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Oct 2014 05:14:06 +0000 Received: by mail-qa0-f54.google.com with SMTP id i13so5339686qae.27 for ; Tue, 07 Oct 2014 22:13:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=cevZaGX9WrAvHjFUPdBHpw/SUvwJQtO57qlVHPOHPIA=; b=YPhm3XhjytOq91Epw0CMRvByaJQNCFbeeyZO4Ez1tHozGUbYgpRXE1C3LUvtCQ5FlI CFBo2w2srz1yDA0OHuQRRyQ8oOhu+DZZ4Dcw4sh1e47pldcDSuzl3YCOzWmQdVPYJo/4 oioDo/kucEJvhuMfY39I9fB4skwKNb3x0X96je34Zc1aZvTMOAYTHqogI5MyLtiHpW1d 1PMfyyTQYXbOg6X897j+uZQ/8knlfSKXmRu/xGGCb1xOHtVPhxhOkQvAzQY3nuv/SXQo otUCzkj48PIWJiydypALG7zjx8epVBlct1fKzSCICQzmu1jN2PO+fZwke83gq4RrgOgl uuJg== X-Received: by 10.224.65.9 with SMTP id g9mr10004613qai.59.1412745225998; Tue, 07 Oct 2014 22:13:45 -0700 (PDT) Received: from HW10447.local (pool-71-166-48-47.bltmmd.fios.verizon.net. [71.166.48.47]) by mx.google.com with ESMTPSA id b12sm3849237qaj.4.2014.10.07.22.13.44 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 07 Oct 2014 22:13:44 -0700 (PDT) Message-ID: <5434C808.6070608@gmail.com> Date: Wed, 08 Oct 2014 01:13:44 -0400 From: Josh Elser User-Agent: Postbox 3.0.11 (Macintosh/20140602) MIME-Version: 1.0 To: dev@accumulo.apache.org Subject: Re: 1.7 release timeline References: <5433360C.3020601@gmail.com> <5434AA1C.10303@gmail.com> In-Reply-To: <5434AA1C.10303@gmail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Forgot one: *Drop Hadoop 1 support* - We would no longer care about maintaining Hadoop 1 APIs (get rid of crappy reflection) - 2.2.0 (Hadoop 2 "stable") came out just under 1 year ago - Can be done for 1.7 or reconsidered for 2.0 Josh Elser wrote: > Some more information on the subject. A few of us got together to > co-work today and had an informal discussion on our individual interests > for 1.7. Summary incoming: > > *Monitor re-write* > - I was pushing this one, I think the monitor still has merit despite > the goal of the desire of other to just integrate with external systems > - I have some code in place, but still needs more work. > - Is a unified/stable "metrics" API necessary for integration w/ > external tools? (or is JMX enough?) > - An API would probably be a more usable interface than JMX > - Such an API should be stateless (no log aggregation nor statistics > over time) > - Monitor still has uses for standalone/small deployments > - If still being used, MVC approach would ease testing and addition of > new data and views > - Not necessary to hold up 1.7.0 from happening > > *Revisit performance* > - Eric mentioned that he wants to spend some time running some Accumulo > benchmarks, specifically YCSB. > - Lots of related topics were mentioned that might be relevant > * Other HDFS block cache implementations (HBase has lots of nice > benchmarks, could learn from them) > * A WIP patch for metadata updates have some promise (ACCUMULO-2889) > * Collapse iterator stack (ACCUMULO-3079) > * Possible improvements to Scanner for single-batch cases (reduce a few > RPCs to one RPC) > - Actual changes made likely to be found via investigation > - Changing default conf values where relevant also mentioned > > *Distributed Tracing* > - Billie has been spending some time working w/ some people on replacing > Cloudtrace with HTrace > - Mentioned that HTrace shares a remarkable amount of similarity with > our existing tracing library > - Upstream efforts in Hadoop-3 to integrate htrace to DN/NN calls > - Some consideration given to replace traceserver with zipkin however > not required for the first implementation > > *Decouple MiniAccumuloCluster from ITs* > - Another one I've started working on > - ITs are really great, we have a lot for really good cases > - Running them against a real instance in infeasible right now > - Would be good to express as many as possible in terms of only using > Instance+Connector > - Christopher mentioned possible benefit outside of tests to using the > accumulo-maven-plugin as the "shim" between a real instance and a > MiniAccumuloCluster > - Some tests are written explicitly for MAC and must be ignored or run > against a MAC when a real instance is available. > > *Upgrade test script* > - Keith mentioned there's some code from John McNamee that might help > testing upgrade paths > > *Hadoop Metrics2* > - Metrics2 is the current library in use by Hadoop > - Integration gives us a lot more flexibility, notably good integration > with Ganglia provided (ACCUMULO-1817) > - No one expressed interest in working on this directly (potential to slip) > > *Deprecate MockAccumulo?* > - Talked about this for 1.6, decided against > - It's now 1.7. Is it time? > - Remember, deprecate != removal > > There are some outstanding things we need to investigate more: > - Is improved JMX or metrics2 impl sufficient for integration with > external monitoring tools? (considerations: nagios, ganglia, statsd, > collectd, carbon, riemann... others?) > - BatchWriter has some weird cases around error handling. Is intended > that it survives failures, but that's very much not the case. Should > probably be fixed around a major release, but need to figure out how > exactly to fix it (needs someone to get behind it) > > If people want to continue discussion on these, let's break off > individual topics into their own thread for clarity (and my sanity). > > Also, anyone have a desire to be "release manager"? > > - Josh > > Josh Elser wrote: >> Thanks, John. >> >> I was thinking about trying to gun for January time-frame for a release. >> I'd love to say before 2014 is over, but that probably just won't happen >> for a major release with the holidays. >> >> For 1.7 right now, I see the following "bigger" items (correct me where >> I'm wrong): >> >> * Replication (done) >> * Upgrade rules/guarantees (proposed) >> * Replace cloudtrace (in-progress) >> * Rewrite monitor, include REST service (in-progress) >> * Drop Hadoop 1 support (proposed) >> * Decouple MiniAccumulo from ITs (in-progress) >> * Other minicluster types: in-process, shim to real instance >> (in-progress) >> * Support Hadoop metrics2 (proposed) >> * A few WAL/metadata related performance improvements (in-progress) >> >> Also, would be good to check the In-Progress state issues on JIRA. What >> do people think? >> >> John Vines wrote: >>> Moving this to it's own thread... >>> >>> On Mon, Oct 6, 2014 at 5:54 PM, Mike Drob wrote: >>> >>>> Related: Do we have a release timeline for 1.7? >>>>