Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BF76418700 for ; Thu, 20 Aug 2015 19:01:01 +0000 (UTC) Received: (qmail 15620 invoked by uid 500); 20 Aug 2015 19:00:59 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 15528 invoked by uid 500); 20 Aug 2015 19:00:59 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 15208 invoked by uid 99); 20 Aug 2015 19:00:58 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Aug 2015 19:00:58 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 01777C019C for ; Thu, 20 Aug 2015 19:00:58 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.901 X-Spam-Level: ** X-Spam-Status: No, score=2.901 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id etSB-F2eiwO5 for ; Thu, 20 Aug 2015 19:00:43 +0000 (UTC) Received: from mail-ig0-f181.google.com (mail-ig0-f181.google.com [209.85.213.181]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id E9AE442984 for ; Thu, 20 Aug 2015 19:00:42 +0000 (UTC) Received: by igxp17 with SMTP id p17so136461555igx.1 for ; Thu, 20 Aug 2015 12:00:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=9FMpySLkLNhvUwUBvsILCTv0ifoIuA4Nr1mKVpozZ/0=; b=iJrZxeYsdhfb/vLXRKchzog9LLIHYKCol3Qsg4swsn40/49J5xUOpz4t6CHxfwI3aN C7+5RfjhnKBsK5MEZltrBPvHBh70H9Zf5VUCSXTzYDuiys5SW7m4/TVBTUvdXohLEuKW +Z3CctKKoqBa11NMQ5TVZKv60oE6JvArTG8fQESgftr09FbP1rMHqWKyqaudJGg1XQQW 24BvWGEl+vfKGnDQ/3SBPh4ytZjFeKA7EuNOk0TMJNRQ4uS7MQE6srtuR1B9l8xb94iv 9LJndyLBE2QTXvOIQtQDhhI7lf/kn7UdkJPYpPKuiNj5pOPY3l1zz0Yp3e/n8HbzB7r8 yuQw== MIME-Version: 1.0 X-Received: by 10.50.17.9 with SMTP id k9mr8936575igd.93.1440097242518; Thu, 20 Aug 2015 12:00:42 -0700 (PDT) Received: by 10.79.107.142 with HTTP; Thu, 20 Aug 2015 12:00:42 -0700 (PDT) In-Reply-To: References: Date: Thu, 20 Aug 2015 15:00:42 -0400 Message-ID: Subject: Re: DISCUSSION: lets do a developer workshop on near-term work From: Biju N To: dev@hbase.apache.org Cc: Stephen Jiang Content-Type: multipart/alternative; boundary=089e01182a349518d9051dc2c49a --089e01182a349518d9051dc2c49a Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Thanks a lot Stack. On Thu, Aug 20, 2015 at 2:37 PM, Stack wrote: > On Thu, Aug 20, 2015 at 11:13 AM, Biju N wrote: > > > Is there a way to participate remotely or at least listen in to this > > meet-up? There will be at least a few who will be interested to dial in > > from the east coast. > > > > > > Should be able to get you at least audio. Will post something here on thi= s > thread and to the meetup page just before the meeting starts. > St.Ack > > > > > On Wed, Aug 12, 2015 at 3:29 PM, Stack wrote: > > > > > I posted this meetup notice: > > > http://www.meetup.com/hackathon/events/224589819/ > > > St.Ack > > > > > > On Wed, Aug 12, 2015 at 1:34 AM, Enis S=C3=B6ztutar > wrote: > > > > > > > Agreed, too many fat topics, but all important. I guess we can spen= d > > > first > > > > 10-20 mins on the agenda based on who is in the room and come up > with a > > > > shorter list and go from there. > > > > > > > > Enis > > > > > > > > On Tue, Aug 11, 2015 at 9:23 PM, Stack wrote: > > > > > > > > > On Mon, Jul 20, 2015 at 1:04 PM, Stephen Jiang < > > > syuanjiangdev@gmail.com> > > > > > wrote: > > > > > > > > > > > [Let us move back to the main topic - a meeting to talk about t= he > > > next > > > > > > direction on HBASE development] > > > > > > > > > > > > Are we firm on the *August 26th* meeting date? > > > > > > > > > > > > Given the long list of topics from St.Ack, even a one day meeti= ng > > > might > > > > > > not cover all of them (in depth). We need to either trim the > topic > > > > list > > > > > or > > > > > > limit the time to discuss a single topic (30 min for one topic > > > > enough?). > > > > > > > > > > > > > > > > > Thanks for bringing us back to topic Stephen. > > > > > > > > > > Yes, lets do 26th. Speak up if this does not suit. I will file a > > meetup > > > > > page in an hour or so. Where should we do it? Enis offered his ni= ce > > > > place. > > > > > Could try and get space at ours too... in Palo Alto (less 'deep > > > south', a > > > > > little easier for the SFers). > > > > > > > > > > As to too many topics, in my experience, a bunch of smelly > engineers > > > all > > > > in > > > > > a room starts to fall apart after a couple of hours especially wh= en > > > > ranging > > > > > discussion. Suggest we cut the time-per-topic and list of topics = so > > can > > > > do > > > > > in an afternoon. If some topics are too fat, can do break out or > > > put-off > > > > to > > > > > another day and smaller, interested group. > > > > > > > > > > St.Ack > > > > > > > > > > > > > > > > > > > > > > > > > > Thanks > > > > > > Stephen > > > > > > > > > > > > > > > > > > On Mon, Jul 20, 2015 at 9:50 AM, Anoop John < > anoop.hbase@gmail.com > > > > > > > > wrote: > > > > > > > > > > > >> We will be doing some more large data tests in coming week > Andy.. > > > > Will > > > > > >> report back more. Also will do a write up , in what all ways > the > > > work > > > > > >> might help us. As Sean said, we will continue in another thre= ad > > if > > > > any > > > > > >> thing further.. Will soon write back on the test result. > Thanks. > > > > > >> > > > > > >> -Anoop- > > > > > >> > > > > > >> On Mon, Jul 20, 2015 at 9:59 PM, Andrew Purtell < > > > > > andrew.purtell@gmail.com > > > > > >> > > > > > > >> wrote: > > > > > >> > > > > > >> > Cool, thanks. > > > > > >> > > > > > > >> > Is a 20% latency reduction the most we can expect or do you > > think > > > > > there > > > > > >> is > > > > > >> > room for more improvement? Just curious. > > > > > >> > > > > > > >> > Is latency reduction the only goal? Anything here about > > supporting > > > > > >> larger > > > > > >> > heaps? Is there something we can measure in that regard? > > > > > >> > > > > > > >> > Hope you see my point and there's enough here to prime a goa= ls > > and > > > > > >> metrics > > > > > >> > discussion at the pow wow or on the relevant JIRAs. > > > > > >> > > > > > > >> > > On Jul 20, 2015, at 4:43 AM, ramkrishna vasudevan < > > > > > >> > ramkrishna.s.vasudevan@gmail.com> wrote: > > > > > >> > > > > > > > >> > > Hi Andy > > > > > >> > > > > > > > >> > > Based on our POCs done, we expect around 20% improvement i= n > > > > latency. > > > > > >> For > > > > > >> > > scans it will be little lesser than 20%. > > > > > >> > > > > > > > >> > > Regards > > > > > >> > > Ram > > > > > >> > > > > > > > >> > > > > > > > >> > > On Sun, Jul 19, 2015 at 10:20 AM, Andrew Purtell < > > > > > >> > andrew.purtell@gmail.com> > > > > > >> > > wrote: > > > > > >> > > > > > > > >> > >> Hi Ram, > > > > > >> > >> > > > > > >> > >> Do you have any targets for what you are measuring? What > are > > > the > > > > > >> goals > > > > > >> > you > > > > > >> > >> guys are working toward with the off heaping changes? > > > > > >> > >> > > > > > >> > >> > > > > > >> > >>>> On Jul 18, 2015, at 9:16 PM, ramkrishna vasudevan < > > > > > >> > >>> ramkrishna.s.vasudevan@gmail.com> wrote: > > > > > >> > >>> > > > > > >> > >>> Thanks Vladimir. > > > > > >> > >>> Yeah, the reports that were attached specifically captur= ed > > the > > > > > >> 95/99th > > > > > >> > >>> percentile. > > > > > >> > >>> The reason for checking the server side perf was to > > > specifically > > > > > see > > > > > >> > the > > > > > >> > >>> improvement in the server side and also the client was > > sending > > > > > large > > > > > >> > >>> results in multiple threads. So wanted to avoid the n/w > > > > > >> interference. I > > > > > >> > >>> think it was a general practice that we were following. > > > > > >> > >>> We Wil do some more tests and get some latest readings > with > > > > bigger > > > > > >> data > > > > > >> > >>> sets. > > > > > >> > >>> Sent from mobile. > > > > > >> > >>>> On Jul 19, 2015 1:05 AM, "Andrew Purtell" < > > > > > >> andrew.purtell@gmail.com> > > > > > >> > >> wrote: > > > > > >> > >>>> > > > > > >> > >>>> +1 > > > > > >> > >>>> > > > > > >> > >>>> Yeah, something like that, with aspirational targets fo= r > > > > > >> improvement > > > > > >> > >> from > > > > > >> > >>>> current releases. Then what to measure, the tests to ru= n, > > and > > > > > >> criteria > > > > > >> > >> for > > > > > >> > >>>> evaluation are clear and organized and we're able to > better > > > > > assess > > > > > >> how > > > > > >> > >> the > > > > > >> > >>>> work in progress is meeting its goals (or not) > > > > > >> > >>>> > > > > > >> > >>>> > > > > > >> > >>>> > > > > > >> > >>>> On Jul 18, 2015, at 12:05 PM, Vladimir Rodionov < > > > > > >> > vladrodionov@gmail.com > > > > > >> > >>> > > > > > >> > >>>> wrote: > > > > > >> > >>>> > > > > > >> > >>>>>>> Umbrella jira to make sure we can have blocks cached > in > > > > > offheap > > > > > >> > >> backed > > > > > >> > >>>>> cache. In the entire read path, we can refer to this > > offheap > > > > > >> buffer > > > > > >> > and > > > > > >> > >>>>> avoid onheap copying. > > > > > >> > >>>>> > > > > > >> > >>>>> I think, on a read path, the most important improvemen= t > we > > > > could > > > > > >> > >> imagine > > > > > >> > >>>> is > > > > > >> > >>>>> elimination or reducing of object creations (KVs, > > iterators > > > > > etc). > > > > > >> > >>>>> object reuse, byte buffers reuse or offheap buffers > reuse, > > > API > > > > > >> change > > > > > >> > >>>> etc. > > > > > >> > >>>>> If this is a part of this JIRA, then I would easily > > define a > > > > > goal: > > > > > >> > >>>>> improving 95/99% latency of a read operations. Not > > > > performance, > > > > > >> but > > > > > >> > >>>> latency > > > > > >> > >>>>> matters > > > > > >> > >>>>> > > > > > >> > >>>>> -Vlad > > > > > >> > >>>>> > > > > > >> > >>>>> > > > > > >> > >>>>> > > > > > >> > >>>>> On Sat, Jul 18, 2015 at 11:24 AM, Andrew Purtell < > > > > > >> > >>>> andrew.purtell@gmail.com> > > > > > >> > >>>>> wrote: > > > > > >> > >>>>> > > > > > >> > >>>>>> That's not a realistic or useful test scenario, unles= s > > the > > > > goal > > > > > >> is > > > > > >> > to > > > > > >> > >>>>>> accelerate queries where all cells are filtered at th= e > > > > server. > > > > > >> > >>>>>> > > > > > >> > >>>>>> > > > > > >> > >>>>>> > > > > > >> > >>>>>>> On Jul 18, 2015, at 11:02 AM, Anoop John < > > > > > anoop.hbase@gmail.com > > > > > >> > > > > > > >> > >>>> wrote: > > > > > >> > >>>>>>> > > > > > >> > >>>>>>> No Andy. 11425 having doc attached to it. At the end > of > > > it, > > > > we > > > > > >> have > > > > > >> > >>>> added > > > > > >> > >>>>>>> perf numbers in a cluster testing. This was done > using > > PE > > > > get > > > > > >> and > > > > > >> > >> scan > > > > > >> > >>>>>>> tests with filtering all cells at server (to not > > consider > > > > n/w > > > > > >> > >> bandwidth > > > > > >> > >>>>>>> constraints) > > > > > >> > >>>>>>> > > > > > >> > >>>>>>> -Anoop- > > > > > >> > >>>>>>> > > > > > >> > >>>>>>> On Sat, Jul 18, 2015 at 9:30 PM, Andrew Purtell < > > > > > >> > >>>>>> andrew.purtell@gmail.com> > > > > > >> > >>>>>>> wrote: > > > > > >> > >>>>>>> > > > > > >> > >>>>>>>> We have some microbenchmarks, not evidence of > > differences > > > > > seen > > > > > >> > from > > > > > >> > >> a > > > > > >> > >>>>>>>> client application. I'm not saying that > microbenchmarks > > > are > > > > > not > > > > > >> > >>>> totally > > > > > >> > >>>>>>>> necessary and a great start - they are - but that > they > > > > don't > > > > > >> > measure > > > > > >> > >>>> an > > > > > >> > >>>>>> end > > > > > >> > >>>>>>>> goal. Furthermore unless I've missed one somewhere = we > > > don't > > > > > >> have a > > > > > >> > >>>> JIRA > > > > > >> > >>>>>> or > > > > > >> > >>>>>>>> design doc that states a clear end goal metric like > the > > > > > >> strawman I > > > > > >> > >>>> threw > > > > > >> > >>>>>>>> together in my previous mail. A measurable system > level > > > > goal > > > > > >> and > > > > > >> > >> some > > > > > >> > >>>>>> data > > > > > >> > >>>>>>>> from full cluster testing would go a lot further > toward > > > > > letting > > > > > >> > all > > > > > >> > >> of > > > > > >> > >>>>>> us > > > > > >> > >>>>>>>> evaluate the potential and payoff of the work. In t= he > > > > > meantime > > > > > >> we > > > > > >> > >>>> should > > > > > >> > >>>>>>>> probably be assembling these changes on a branch > > instead > > > of > > > > > in > > > > > >> > >> trunk, > > > > > >> > >>>>>> for > > > > > >> > >>>>>>>> as long as the goal is not clearly defined and the > > payoff > > > > and > > > > > >> > >>>> potential > > > > > >> > >>>>>> for > > > > > >> > >>>>>>>> perf regressions is untested and unknown. > > > > > >> > >>>>>>>> > > > > > >> > >>>>>>>> > > > > > >> > >>>>>>>>> On Jul 18, 2015, at 8:05 AM, Anoop John < > > > > > >> anoop.hbase@gmail.com> > > > > > >> > >>>> wrote: > > > > > >> > >>>>>>>>> > > > > > >> > >>>>>>>>> Thanks Andy and Lars. The parent jira has doc > > attached > > > > > which > > > > > >> > >>>> contains > > > > > >> > >>>>>>>> some > > > > > >> > >>>>>>>>> perf gain numbers.. We will be doing more tests i= n > > > next 2 > > > > > >> weeks > > > > > >> > >>>>>> (before > > > > > >> > >>>>>>>>> end of this month) and will publish them. Yes it > > will > > > be > > > > > >> great > > > > > >> > if > > > > > >> > >>>> it > > > > > >> > >>>>>> is > > > > > >> > >>>>>>>>> more IST friendly time :-) > > > > > >> > >>>>>>>>> > > > > > >> > >>>>>>>>> -Anoop- > > > > > >> > >>>>>>>>> > > > > > >> > >>>>>>>>> On Fri, Jul 17, 2015 at 9:44 PM, Andrew Purtell < > > > > > >> > >>>>>>>> andrew.purtell@gmail.com> > > > > > >> > >>>>>>>>> wrote: > > > > > >> > >>>>>>>>> > > > > > >> > >>>>>>>>>>> I can represent your side Ram (and Anoop). I've > been > > > > known > > > > > >> > always > > > > > >> > >>>>>> argue > > > > > >> > >>>>>>>>>> both side of a discussion and to never take sides > > > easily > > > > > >> (drives > > > > > >> > >>>> some > > > > > >> > >>>>>>>> folks > > > > > >> > >>>>>>>>>> crazy). > > > > > >> > >>>>>>>>>> > > > > > >> > >>>>>>>>>> I can vouch for this (smile) > > > > > >> > >>>>>>>>>> > > > > > >> > >>>>>>>>>> I also can offer support for off heaping there. A= t > > the > > > > same > > > > > >> time > > > > > >> > >> we > > > > > >> > >>>> do > > > > > >> > >>>>>>>>>> have a gap where we can't point to a timeline of > > > > > improvements > > > > > >> > >> (yet, > > > > > >> > >>>>>>>> anyway) > > > > > >> > >>>>>>>>>> with benchmarks showing gains where your goals ne= ed > > > them. > > > > > For > > > > > >> > >>>> example, > > > > > >> > >>>>>>>>>> stock HBase in one JVM can address max N GB for > > > response > > > > > time > > > > > >> > >>>>>>>> distribution > > > > > >> > >>>>>>>>>> D; dev version of HBase in off heap branch can > > address > > > > max > > > > > >> N' GB > > > > > >> > >> for > > > > > >> > >>>>>>>>>> distribution D', where N' > N and D > D' > > (distribution > > > D' > > > > > >> > >>>>>> statistically > > > > > >> > >>>>>>>>>> shows better/lower response times). > > > > > >> > >>>>>>>>>> > > > > > >> > >>>>>>>>>> > > > > > >> > >>>>>>>>>> > > > > > >> > >>>>>>>>>>> On Jul 17, 2015, at 6:56 AM, lars hofhansl < > > > > > >> larsh@apache.org> > > > > > >> > >>>> wrote: > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> I'm in favor of anything that improves performan= ce > > > (and > > > > > >> > >> preferably > > > > > >> > >>>>>>>>>> doesn't set us back into a world that's worse tha= n > C > > > due > > > > to > > > > > >> the > > > > > >> > >> lack > > > > > >> > >>>>>> of > > > > > >> > >>>>>>>>>> pointers in Java).Never said "I don't like it", > it's > > > just > > > > > >> that > > > > > >> > I'm > > > > > >> > >>>>>>>> perhaps > > > > > >> > >>>>>>>>>> asking for more numbers and justification in > weighing > > > the > > > > > >> pros > > > > > >> > and > > > > > >> > >>>>>> cons. > > > > > >> > >>>>>>>>>>> I can represent your side Ram (and Anoop). I've > been > > > > known > > > > > >> > always > > > > > >> > >>>>>> argue > > > > > >> > >>>>>>>>>> both side of a discussion and to never take sides > > > easily > > > > > >> (drives > > > > > >> > >>>> some > > > > > >> > >>>>>>>> folks > > > > > >> > >>>>>>>>>> crazy). And Stack's there too, he yell at me wher= e > > > needed > > > > > :) > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> Perhaps we can do it a bit later in the evening = so > > > there > > > > > is > > > > > >> a > > > > > >> > >>>>>> fighting > > > > > >> > >>>>>>>>>> chance that folks on IST can participate. I know > that > > > > some > > > > > of > > > > > >> > our > > > > > >> > >>>>>> folks > > > > > >> > >>>>>>>> on > > > > > >> > >>>>>>>>>> IST would love to participate in the backup > > > discussion). > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> Like Enis, I'm also happy to host. We're in > Downtown > > > SF. > > > > > I'd > > > > > >> > just > > > > > >> > >>>>>> need > > > > > >> > >>>>>>>>>> an approx. number of folks. > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> -- Lars > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> From: ramkrishna vasudevan < > > > > > >> ramkrishna.s.vasudevan@gmail.com> > > > > > >> > >>>>>>>>>>> To: "dev@hbase.apache.org" >; > > > lars > > > > > >> > >> hofhansl < > > > > > >> > >>>>>>>>>> larsh@apache.org> > > > > > >> > >>>>>>>>>>> Sent: Wednesday, July 15, 2015 10:10 AM > > > > > >> > >>>>>>>>>>> Subject: Re: DISCUSSION: lets do a developer > > workshop > > > on > > > > > >> > >> near-term > > > > > >> > >>>>>> work > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> Hi > > > > > >> > >>>>>>>>>>> What time will it be on August 26th? > > > > > >> > >>>>>>>>>>> @LarsYa. I know that you are not generally in > favour > > > of > > > > > this > > > > > >> > >>>>>> offheaping > > > > > >> > >>>>>>>>>> stuff. May be if we (from India) can attend this > > > meeting > > > > > >> > remotely > > > > > >> > >>>>>> your > > > > > >> > >>>>>>>>>> thoughts can be discussed and also the current > state > > of > > > > > this > > > > > >> > work. > > > > > >> > >>>>>>>>>>> RegardsRam > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> On Wed, Jul 15, 2015 at 9:28 PM, lars hofhansl < > > > > > >> > larsh@apache.org > > > > > >> > >>> > > > > > >> > >>>>>>>> wrote: > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> Works for me. I'll be back in the Bay Area the > week > > of > > > > > >> August > > > > > >> > >> 9th. > > > > > >> > >>>>>>>>>>> We have done a _lot_ of work on backups as well = - > > ours > > > > are > > > > > >> more > > > > > >> > >>>>>>>>>> complicated as we wanted fast per-tenant restores= , > so > > > > data > > > > > is > > > > > >> > >>>>>> "grouped" > > > > > >> > >>>>>>>> by > > > > > >> > >>>>>>>>>> tenant. Would like to sync up on that (hopefully > some > > > of > > > > > the > > > > > >> > folks > > > > > >> > >>>> who > > > > > >> > >>>>>>>>>> wrote most of the code will be in town, I'll > check). > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> Also interested in the "Time" and "offheap" part= s > > > > > (although > > > > > >> you > > > > > >> > >>>> folks > > > > > >> > >>>>>>>>>> usually do not like what I think about the offhea= p > > > > efforts > > > > > >> :) ). > > > > > >> > >>>>>>>>>>> Would like to add the following topics: > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> - "Timestamp Resolution". Or making space for mo= re > > > bits > > > > in > > > > > >> the > > > > > >> > >>>>>>>>>> timestamps (happy to cover that, unless it's part > of > > > the > > > > > >> "Time" > > > > > >> > >>>> topic) > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> - "Replication". We found that replication canno= t > > keep > > > > up > > > > > >> with > > > > > >> > >> high > > > > > >> > >>>>>>>>>> write loads, due to the fact that replicated is > > > strictly > > > > > >> single > > > > > >> > >>>>>> threaded > > > > > >> > >>>>>>>>>> per regionserver (even though we have multiple > region > > > > > >> servers on > > > > > >> > >> the > > > > > >> > >>>>>>>> sink > > > > > >> > >>>>>>>>>> side) > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> - "Spark integration" (Ted Malaska?) > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> OK... Out now to make a "bullshit hat". > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> -- Lars > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> ________________________________ > > > > > >> > >>>>>>>>>>> From: Sean Busbey > > > > > >> > >>>>>>>>>>> To: dev > > > > > >> > >>>>>>>>>>> Sent: Tuesday, July 14, 2015 7:11 PM > > > > > >> > >>>>>>>>>>> Subject: Re: DISCUSSION: lets do a developer > > workshop > > > on > > > > > >> > >> near-term > > > > > >> > >>>>>> work > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> I'm planning to be in the Bay area the week of t= he > > > 24th > > > > of > > > > > >> > >> August. > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> -- > > > > > >> > >>>>>>>>>>> Sean > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> On Jul 14, 2015 7:53 PM, "Andrew Purtell" < > > > > > >> > apurtell@apache.org> > > > > > >> > >>>>>>>> wrote: > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> I can be up in your area in August. > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> On Tue, Jul 14, 2015 at 5:31 PM, Stack < > > > > > stack@duboce.net > > > > > >> > > > > > > >> > >>>> wrote: > > > > > >> > >>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> On Tue, Jul 14, 2015 at 3:39 PM, Enis S=C3=B6= ztutar > < > > > > > >> > >>>>>> enis.soz@gmail.com> > > > > > >> > >>>>>>>>>>>>> wrote: > > > > > >> > >>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> Sounds good. It has been a while we did the > > > > talk-aton. > > > > > >> > >>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> I'll be off starting 25 of July, so I prefer > > > > something > > > > > >> next > > > > > >> > >> week > > > > > >> > >>>>>> if > > > > > >> > >>>>>>>>>>>>>> possible. > > > > > >> > >>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> You ever coming back? If so, when? I'm back o= n > > 10th > > > > of > > > > > >> > August > > > > > >> > >>>>>>>> (Mikhail > > > > > >> > >>>>>>>>>>>> on > > > > > >> > >>>>>>>>>>>>> the 20th). > > > > > >> > >>>>>>>>>>>>> St.Ack > > > > > >> > >>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>> Enis > > > > > >> > >>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> On Tue, Jul 14, 2015 at 3:18 PM, Stack < > > > > > >> stack@duboce.net> > > > > > >> > >>>> wrote: > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> Matteo and I were thinking it time devs got > > > together > > > > > >> for a > > > > > >> > >>>>>> pow-wow. > > > > > >> > >>>>>>>>>>>>> There > > > > > >> > >>>>>>>>>>>>>>> is a bunch of stuff in flight at the moment > (see > > > > below > > > > > >> > list) > > > > > >> > >>>> and > > > > > >> > >>>>>> it > > > > > >> > >>>>>>>>>>>>> would > > > > > >> > >>>>>>>>>>>>>>> be good to meet and whiteboard, surface good= o > > > ideas > > > > > that > > > > > >> > have > > > > > >> > >>>>>> gone > > > > > >> > >>>>>>>>>>>>>> dormant > > > > > >> > >>>>>>>>>>>>>>> in JIRA, or revisit designs/proposals out in > > > > > >> JIRA-attached > > > > > >> > >>>> google > > > > > >> > >>>>>>>> doc > > > > > >> > >>>>>>>>>>>>>> that > > > > > >> > >>>>>>>>>>>>>>> need socializing. > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> You can only come if you are wearing your > > bullshit > > > > > hat. > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> Topics we'd go over could include: > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> + Our filesystem layout will not work if 1M > > > regions > > > > > >> > >>>>>> (Matteo/Stack) > > > > > >> > >>>>>>>>>>>>>>> + Current state of the offheaping of read pa= th > > and > > > > > >> > alternate > > > > > >> > >>>>>>>> KeyValue > > > > > >> > >>>>>>>>>>>>>>> implementation (Anoop/Ram) > > > > > >> > >>>>>>>>>>>>>>> + Append rejigger (Elliott) > > > > > >> > >>>>>>>>>>>>>>> + A Pv2-based Assign (Matteo/Steven) > > > > > >> > >>>>>>>>>>>>>>> + Splitting meta/1M regions > > > > > >> > >>>>>>>>>>>>>>> + The revived Backup (Vladimir) > > > > > >> > >>>>>>>>>>>>>>> + Time (Enis) > > > > > >> > >>>>>>>>>>>>>>> + The overloaded SequenceId (Stack) > > > > > >> > >>>>>>>>>>>>>>> + Upstreaming IT testing (Dima/Sean) > > > > > >> > >>>>>>>>>>>>>>> + hbase-2.0.0 > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> I put names by folks I know could talk to th= e > > > topic. > > > > > If > > > > > >> you > > > > > >> > >>>> want > > > > > >> > >>>>>> to > > > > > >> > >>>>>>>>>>>>> take > > > > > >> > >>>>>>>>>>>>>>> over a topic or put your name by one, just > say. > > > > > Suggest > > > > > >> > that > > > > > >> > >>>>>>>>>>>>> discussion > > > > > >> > >>>>>>>>>>>>>>> lead off with a 5-10minute on current state = of > > > > > >> > >>>>>>>>>>>>>>> thought/design/implementation. > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> What do others think? > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> What date would suit folks? > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> Anyone want to host? > > > > > >> > >>>>>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>>>>> Thanks, > > > > > >> > >>>>>>>>>>>>>>> Matteo and St.Ack > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> -- > > > > > >> > >>>>>>>>>>>> Best regards, > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> - Andy > > > > > >> > >>>>>>>>>>>> > > > > > >> > >>>>>>>>>>>> Problems worthy of attack prove their worth by > > > hitting > > > > > >> back. - > > > > > >> > >>>> Piet > > > > > >> > >>>>>>>> Hein > > > > > >> > >>>>>>>>>>>> (via Tom White) > > > > > >> > >> > > > > > >> > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > --089e01182a349518d9051dc2c49a--