Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9B36A1087E for ; Thu, 15 Jan 2015 15:47:46 +0000 (UTC) Received: (qmail 30249 invoked by uid 500); 15 Jan 2015 15:47:48 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 30202 invoked by uid 500); 15 Jan 2015 15:47:48 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 30191 invoked by uid 99); 15 Jan 2015 15:47:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jan 2015 15:47:47 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (athena.apache.org: local policy) Received: from [209.85.216.51] (HELO mail-qa0-f51.google.com) (209.85.216.51) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jan 2015 15:47:43 +0000 Received: by mail-qa0-f51.google.com with SMTP id f12so10725628qad.10 for ; Thu, 15 Jan 2015 07:47:02 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=2FqcS8nfHuW63GVfs7CXYJWnOWpc8Gu0wjmzT21oulA=; b=h6In17WBBuzwbgaiu1WNpYtOQp3YVY0P/WOjJrZqw+PfFUbgpRn1LkzWfdwA5Wu8ts DYWG+xwAuYbDnULjPIc2gu3zw977/O2/i5H73k5WYzzd+53ouhq2ndOAlP3Bp7PbiN1G wkq74P6mn1y248jW+hSoZrJEExHR/QqL9IsKbPQR+qIpEoj3SHcPXWYv+4EX2yB8V+59 8FqHGdhYSAPd3ta5Js1B13MDrcUiZeB36zOYvv3in1rltLBCMaF3kg8Au2dHgyGE3gpr wTZOjN3XO893vrn/o5B9j11lxmer5U89sJbsW7XHjDupqCoReJuG22377KTLsN8EVo0t X/1Q== X-Gm-Message-State: ALoCoQmtO6zW2H38cL4JZc6W2AQTFpVOMNWjziwg9Hxm3pGLIi30hh02Kbt32Pg4x4jwn3mKuLok MIME-Version: 1.0 X-Received: by 10.140.20.50 with SMTP id 47mr7284955qgi.61.1421336822334; Thu, 15 Jan 2015 07:47:02 -0800 (PST) Received: by 10.229.104.6 with HTTP; Thu, 15 Jan 2015 07:47:02 -0800 (PST) In-Reply-To: References: <54B58981.406@gmail.com> Date: Thu, 15 Jan 2015 10:47:02 -0500 Message-ID: Subject: Re: Growing project involvement From: Keith Turner To: Accumulo Dev List Content-Type: multipart/alternative; boundary=001a11c13c8266df61050cb2c498 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c13c8266df61050cb2c498 Content-Type: text/plain; charset=UTF-8 On Tue, Jan 13, 2015 at 5:49 PM, Joe Stein wrote: > I have had a lot of feedback in the market place on Accumulo. This feedback > was 100% from folks that didn't have Accumulo as a requirement to run and > feel that it is very relevant to broader adoption. All of the below > comments are a combination of my own opinions and what I have heard from > others in the market in discussion about Accumulo. > > 1) Iterators are awesome from a software architecture perspective. From a > development perspective if you have worked with them you have an experience > or two to share on how to improve them. Anything that can be done to > improve this experience for developers will be welcomed for new and > existing users. > > 2) Lots of little cosmetic surface things in lots of places and attentions > to details. e.g. https://github.com/apache/accumulo the branch is not the > latest and even the latest branch (master?) README isn't really welcoming > or appealing from a "my first time visiting the project" perspective. For > This is all really good feedback, thanks. I am going to take a wack at improving the README today or tomorrow and put it up for review. I am going to look at improving the content and converting to markdown so that it looks good in github. > new users you only get 1 impression for a first impression, this is > important under the "technical marketing umbrella". Some Vagrant and/or > Docker will make getting up and running quickly fantastic for folks that > have to (or want to) interact with Accumulo. > > 3) The project should/could have more out of the box integrations and > support from the core project release cycles. e.g. Accumulo Framework for > Apache Mesos. I don't think the drive for this (Mesos support) is lacking > but having spoken to other Accumulo users there is no clear path how folks > can help to make this happen. The eco system just isn't big enough for > these type of projects to exist successfully outside the core project on > some github url. > > 4) Some eco system page or place where "all things accumulo" can be sought > after... planet accumulo, something like that (no reason to reinvent this > wheel). This is probably a combined issue of lack of aggregatable things > (which we should try to improve) and the ability to have them seen in one > place. One of the coolest things I have seen Accumulo release since > following the project has been > https://blogs.apache.org/accumulo/entry/scaling_accumulo_with_multi_volume > but haven't seen anything else since this posting. Is it that the > information isn't bubbling up or that people aren't posting more about cool > things in place? Are people even using it? > > 5) Not; just; Java; please; => how about more Scala (maybe Iterator > examples) and/or Go with some ProtoBuf interface? from an implementation > perspective Java; just; kills; things; in; their; tracks; ! and Thrift has > a way to-do that too... > > 6) Operations is almost an opaque box. Getting something up and running for > development is important but so is pushing it into production and > sustaining it at scale. The more information about how this is done and > where things work and do not work will be a *HUGE* driver for the > community (IMHO). Again, maybe all this stuff is out there and #4 is really > how to solve this for folks to not spend their nights and weekends > googling. > > 7) Apache Spark support. While arguably this goes under #3 I think it has > to be called out as another (better?) option for MapReduce. It is really > easy to get Spark to use AccumuloInputFormat which is wonderful and a > fantastic opportunity for making Accumulo shine with Spark. A few samples > people can run with Spark and Accumulo together that do something more than > word count will go a long way to attracting an audience too. > > 8) More ways to highlight the work loads that Accumulo was built for and > what it does now and how it is not about website or social or ads is > important to organizations in verticals that care differently about their > data. > > 9) Better call out features and highlight them with more examples > explicitly. I might be repeating myself at this point but wanted to bring > up "Tracing" as another good example of a REALLY cool feature that folks > when they see it don't entirely understand what/how todo with it. Google > for "accumulo trace" or even going through the documentation it is > impossible to figure out how to use it and make it work without late nights > and tender loving care. > > None of these things are easy and are very demanding for open source > projects and communities. I think this is a great discussion and hope to > continue to contribute moving forward. > > /******************************************* > Joe Stein > Founder, Principal Consultant > Big Data Open Source Security LLC > http://www.stealth.ly > Twitter: @allthingshadoop > ********************************************/ > > On Tue, Jan 13, 2015 at 4:37 PM, Keith Turner wrote: > > > I think a minimal getting started guide is needed on the web site. > > Something that will take a user from download to running on a cluster in > as > > few steps as possible. This info is buried in the README, but there is > too > > much other stuff in the readme. > > > > On Tue, Jan 13, 2015 at 4:09 PM, Josh Elser > wrote: > > > > > I meant to send this out closer to the new year (to ride on the new > year > > > resolution stereotype), but I slacked. Forgive me. > > > > > > As should be aware by those paying attention, we have had very little > > > growth within the project over the past 6-9 months. We've had our > normal > > > spattering of contributions, a few from some repeat people, but I don't > > > think we've grown as much as we could. > > > > > > I wanted to see if anyone has any suggestions on what we could try to > do > > > better in the coming year to help more people get involved with the > > > project. I don't want this to turn into a "we do X wrong" discussion, > so > > > please try to stay positive and include suggestion(s) for every problem > > > presented when possible. > > > > > > Also, everyone should feel welcome to participate in the discussion > here. > > > If you fall into the "bucket" described, I'd love to hear from you. If > > > anyone doesn't want to publicly respond, please feel free to email me > > > privately and I'll anonymously post to the list on your behalf. > > > > > > Some ideas to start off discussion: > > > > > > * Help reduce barrier to entry for new developers > > > - Ensure imple/easy-to-process instructions for getting and building > > > code in common environments > > > - Instructions on running tests and reporting issues > > > > > > * More high-level examples > > > - Maybe we start too deep in distributed-systems land and we scare > away > > > devs who think they "don't know enough to help" > > > - Recording "newbie" tickets and providing adequate information for > > > anyone to come along and try to take it on > > > - Encourage/help/promote "concrete" ideas/code in the project. > > Something > > > that is more tangible for devs to wrap their head around (also can help > > > with adoption from new users) > > > > > > * Better documentation and "marketing" > > > - We do "ok" with the occasional blog post, and the user manual is > > > usually thorough, but we can obviously do better. > > > - Can we create more "literature" to encourage more users and devs to > > > get involved, trying to lower the barrier to entry? > > > > > > Thanks all. > > > > > > --001a11c13c8266df61050cb2c498--