Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@hbase.apache.org
Message-ID: <BLU437-SMTP489CBD4D65D48CAD9089A38FA90@phx.gbl>
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0 (Mac OS X Mail 8.2 \(2098\))
Subject: Re: [DISCUSS] Multi-Cluster HBase Client
From: Michael Segel <michael_segel@hotmail.com>
In-Reply-To: 
 <CANc1aFOM1y609ROjnW202iVdzE_SJMmH4L_0u2UdHkGgBxee9A@mail.gmail.com>
Date: Tue, 30 Jun 2015 09:51:21 -0700
Content-Transfer-Encoding: quoted-printable
References: 
 <CANc1aFMzpRNgpwSEEpsW4V1sYHNz6KzLz0OYeBrUwnKAbjzBvQ@mail.gmail.com>
 <BLU436-SMTP38E08BA1081A8138F207628FAA0@phx.gbl>
 <CANc1aFOvjwAiQhOAhrmtbA-YzZTaPmwxkCmyzApX5rko-D9h8Q@mail.gmail.com>
 <BLU436-SMTP366800C6B6181A77133BB98FAA0@phx.gbl>
 <CANc1aFMdYvd7S-rt1O1s66soBxNHZUgPwCfM1KUfzbig0Uk=Og@mail.gmail.com>
 <BLU437-SMTP5812F5EF70CC6245C451EB8FAA0@phx.gbl>
 <CANc1aFP52T-MgFRyocN+URX5rJWFvgc70FcRCNyP2BktCxZAaw@mail.gmail.com>
 <CAGHyZ6JTeax2sj36MJZ8HJ5jNEgt6zjGC=8r1q6hGJaQAGSDsg@mail.gmail.com>
 <CANc1aFMXAzN=_kmgqrX5QrMA_aQPnBfgXHqcHz=KUQONJv5DPQ@mail.gmail.com>
 <CA+RK=_A33QArKhCLBsrfp3MUnDMGHCOYvUneiTneBQCqx+uihA@mail.gmail.com>
 <CANc1aFMkXShu72aohP1h=wedFt0ukrw98nR10yCXp1tQvnE2=A@mail.gmail.com>
 <CAAT7MkpaXkb+wh01Qp9w6HD043uhEWN0Rru=9Q0c7fh9JEV-dw@mail.gmail.com>
 <CANc1aFOXA7i6evacLSRtvxtdwz+8S+OJRxuN2V6PSgf82pruRA@mail.gmail.com>
 <CAAT7Mkr83ZLMGpX21jPeACnAjgO-EGKR0pauZPS7NtQQAFnXgw@mail.gmail.com>
 <CANc1aFP1L-ihGxO4xrAZ1bR3TY3Mvj1h2=_we7Uhcg=N60kkog@mail.gmail.com>
 <BLU436-SMTP1816B4B59AA6BE6CC770AD48FA90@phx.gbl>
 <CADY20s7S5nKuV4DkrSQcLNswp4_iuLNJJ6HZ5Mta+m2=LY1U-A@mail.gmail.com>
 <BLU436-SMTP1986A2CA76C2D5F92BBE8588FA90@phx.gbl>
 <BLU436-SMTP1549B8E3FE2D7ED280FD7998FA90@phx.gbl>
 <CANc1aFOM1y609ROjnW202iVdzE_SJMmH4L_0u2UdHkGgBxee9A@mail.gmail.com>
To: dev@hbase.apache.org

And how do you address the eventual consistency?=20


> On Jun 30, 2015, at 9:39 AM, Ted Malaska <ted.malaska@cloudera.com> =
wrote:
>=20
> Agreed Scope creep is bad.  That is why this solution is so great.  It =
is
> isolated, optional, and can be transparent  to users wishing from =
going
> from single to multi cluster setups.
>=20
> Also I want to correct something you said about the design.  You asked =
"how
> do I know when a cluster has failed?".
>=20
> So I thought about it more.  I'm not knowing in all cases that the =
primary
> have failed but that isn't my goal.  My goal is to give a constant =
response
> rate to my client based on configs and rules they have defined with =
the
> parameters defined in the doc like:
>=20
> hbase.failover.mode
> hbase.wait.time.before.accepting.failover.result
> hbase.wait.time.before.request.failover
> hbase.wait.time.before.mutating.failover
> hbase.multi.cluster.allow.check.and.mutate
> hbase.wait.time.before.mutating.failover.with.primary.exception
> hbase.wait.time.before.trying.primary.after.failure
>=20
> Ted Malaska
>=20
>=20
> On Tue, Jun 30, 2015 at 12:32 PM, Michael Segel =
<michael_segel@hotmail.com>
> wrote:
>=20
>> Just to clarify something=E2=80=A6
>>=20
>> I don=E2=80=99t want to be the negative voice of reason for not doing =
something.
>> Too many times, an idea that may sound good in theory but doesn=E2=80=99=
t work in
>> practice gets put in to code because no one stopped to think that =
maybe its
>> not that good of an idea.
>>=20
>> As a developer / architect / product owner(manager) , you need to =
address
>> scope creep and bloat .  You have to ask =E2=80=9CDo we really need =
to do this=E2=80=A6=E2=80=9D
>> and that=E2=80=99s a tougher question to ask and discuss.
>>=20
>> Design by committee, you end up with a Duck Billed Platypus. Or a =
Pontiac
>> Aztec.
>>=20
>>=20
>>> On Jun 30, 2015, at 9:11 AM, Michael Segel =
<michael_segel@hotmail.com>
>> wrote:
>>>=20
>>> Todd,
>>>=20
>>> You said:
>>> "As far as I'm aware, this has been a very common deployment =
strategy at
>> Google for close to a decade, for applications that do not require =
strict
>> consistency.=E2=80=9D
>>>=20
>>> And for those applications that do? (Require =E2=80=98strict' =
consistency)
>>> And the overhead of running your scans in parallel?
>>>=20
>>> How many of Cloudera=E2=80=99s customers are running clusters that =
are at FB,
>> Google or Yahoo! scale?
>>> Sometime you need a voice of reason like Aaron Kimball=E2=80=99s. =
;-)
>>>=20
>>> Again=E2=80=A6 you really need to think this through before putting =
finger to
>> keyboard.
>>>=20
>>> Now if you=E2=80=99re talking about a client being able to manage =
multiple
>> cluster connections=E2=80=A6 that=E2=80=99s a different story and =
that=E2=80=99s not what Ted was
>> suggesting.
>>>=20
>>> Oh and I want to be clear. I think that Ted=E2=80=99s thinking about =
potential
>> problems is a good thing. I=E2=80=99m suggesting that there be a bit =
more thought
>> about it before you actually try to tackle it as a worthy problem.
>>>=20
>>> And to also be clear.
>>> I=E2=80=99m saying its not a good idea, not for *my* applications, =
but that I=E2=80=99m
>> thinking in terms of a general concept. If you look at the solution,
>> applications that do not require strict consistency is a growing =
smaller
>> subset of applications.
>>> Its not to say that you can=E2=80=99t find a use case, but in terms =
of a general
>> approach, it fails for a large subset of use cases, and can cause =
confusion
>> when someone tries to use it for a use case where you do have a =
consistency
>> issue and you don=E2=80=99t understand why your result sets don=E2=80=99=
t match.
>>>=20
>>>=20
>>>=20
>>>> On Jun 30, 2015, at 8:46 AM, Todd Lipcon <todd@cloudera.com> wrote:
>>>>=20
>>>> Michael-- the experiences of Google before us and many Cassandra =
users
>>>> indicate that there are many valid use cases for multi-datacenter
>> clients
>>>> such as the one Ted has built.
>>>>=20
>>>> Go read the Tail at Scale paper from CACM for an example of how a =
multi
>>>> cluster client can drop tail latencies by an order of magnitude. As =
far
>> as
>>>> I'm aware, this has been a very common deployment strategy at =
Google for
>>>> close to a decade, for applications that do not require strict
>> consistency.
>>>>=20
>>>> It's certainly valid and constructive to point out downsides and
>>>> limitations of the design, and explain why it might not work for =
your
>>>> applications. Perhaps you need something closer to megastore. But
>> claiming
>>>> that all applications look just like your applications, in the =
presence
>> of
>>>> evidence to the contrary, doesn't benefit anyone.
>>>>=20
>>>> Todd
>>>> On Jun 30, 2015 8:37 AM, "Michael Segel" =
<michael_segel@hotmail.com>
>> wrote:
>>>>=20
>>>>> Guys,
>>>>>=20
>>>>> You really don=E2=80=99t want to do this. (Fault tolerant across a =
single
>> cluster
>>>>> pair=E2=80=A6)
>>>>>=20
>>>>> What I didn=E2=80=99t say in my other email is that you=E2=80=99re =
not being specific
>> as
>>>>> to what constitutes a failure.  Read: How does your client know =
that
>> you=E2=80=99ve
>>>>> lost a connection to its primary client?
>>>>>=20
>>>>> What you might as well do is to create a load balancing server =
that
>> will
>>>>> then manage the connection to one of N clusters in your =
replication
>> group.
>>>>> And even then you=E2=80=99ll want to make this redundant.
>>>>> Really?
>>>>>=20
>>>>> How often do you have a problem with your client connection?
>>>>>=20
>>>>> If so=E2=80=A6 get a new HBase Admin or switch to MapRDB because =
you have a
>>>>> stability problem=E2=80=A6
>>>>>=20
>>>>>=20
>>>>> In terms of a generic client who wants to manage multiple =
connections=E2=80=A6
>>>>> yeah, that=E2=80=99s a pretty straight forward problem to solve.
>>>>> But keep in mind that your cluster isn=E2=80=99t across multiple =
data centers
>> but
>>>>> that you have multiple clusters.
>>>>>=20
>>>>> Of course=E2=80=A6 maybe you=E2=80=99re all on a single cluster =
and you=E2=80=99re using
>> slider =E2=80=A6
>>>>> ;-)
>>>>>=20
>>>>> Again, please think before you pound code.
>>>>>=20
>>>>>=20
>>>>> But hey! What do I know?  I don=E2=80=99t own my IP :-(
>>>>>=20
>>>>> ;-P
>>>>>=20
>>>>>> On Jun 30, 2015, at 6:24 AM, Ted Malaska =
<ted.malaska@cloudera.com>
>>>>> wrote:
>>>>>>=20
>>>>>> Cool Let me know.  If we appeal HBase.MCC correctly maybe we can =
hit
>> two
>>>>>> birds with one stone.  At least the client part.  It would be =
nice to
>>>>> have
>>>>>> a client that was configurable and in the core that would support =
use
>>>>> cases
>>>>>> like this.
>>>>>>=20
>>>>>> On Tue, Jun 30, 2015 at 9:18 AM, ramkrishna vasudevan <
>>>>>> ramkrishna.s.vasudevan@gmail.com> wrote:
>>>>>>=20
>>>>>>> Thanks Ted.
>>>>>>>=20
>>>>>>> Ya as you said the idea is to solve a bigger use case where =
there is
>> a
>>>>>>> globally distributed cluster but the data is local to each =
cluster -
>> ie.
>>>>>>> the data that we write and read is local to that geography or
>> cluster.
>>>>> The
>>>>>>> cross site Big table will help you to read and write from such a
>> cluster
>>>>>>> transparently just by differentiating them with a cluster id.
>>>>>>>=20
>>>>>>> But the other subset of the problem that HBase.MCC solves can =
also be
>>>>>>> achieved because the failover switching during writes/reads =
happens
>>>>> based
>>>>>>> on the replication setup that is available in that local =
cluster.
>>>>>>>=20
>>>>>>> The state of CSBT - I need to know the latest update but it was
>> earlier
>>>>>>> discussed that CSBT cannot be part of the hbase-package but as a
>> stand
>>>>>>> alone tool. I can get the update on that.
>>>>>>>=20
>>>>>>> Regards
>>>>>>> Ram
>>>>>>>=20
>>>>>>>=20
>>>>>>> On Tue, Jun 30, 2015 at 5:05 PM, Ted Malaska <
>> ted.malaska@cloudera.com>
>>>>>>> wrote:
>>>>>>>=20
>>>>>>>> Hey Ramkrishna,
>>>>>>>>=20
>>>>>>>> I think your right that are some things that are the same.  The
>>>>>>> difference
>>>>>>>> is the problem they are trying to solve and the scope.
>>>>>>>>=20
>>>>>>>> In the HBase.MCC design it is only about cluster fail over and
>> keeping
>>>>>>> 100%
>>>>>>>> up time in the case of single site failure.  The Cross-site Big
>> Table
>>>>>>> looks
>>>>>>>> to have some of that too, but also it is more complex because =
it has
>>>>> the
>>>>>>>> requirement of data being local to a single cluster.  So you =
need to
>>>>> see
>>>>>>>> all the clusters to get all the data.
>>>>>>>>=20
>>>>>>>> May be I'm wrong by they are not solving for the same problem.  =
Also
>>>>>>>> because of the HBase.MCC limited scope it is far easier to =
implement
>>>>> and
>>>>>>>> maintain.
>>>>>>>>=20
>>>>>>>> Now all through I agree that the Cross site Big Table has a =
valid
>> use
>>>>>>>> case.  The use case for HBase.MCC is to more set an equal the =
ground
>>>>> with
>>>>>>>> Cassandra in the market place.  To allow us to have eventual
>>>>> consistency
>>>>>>> in
>>>>>>>> the case of single site failure.  With configs to determine =
what
>>>>>>> thresholds
>>>>>>>> must be pasted before exciting those eventual consistency =
records.
>>>>>>>>=20
>>>>>>>> This will allow HBase to better compete for use cases that =
involve
>> Near
>>>>>>>> Real Time Streaming.  This is important because this is the new =
hot
>>>>> trend
>>>>>>>> in the market today to move your batch to near real time.  I =
think
>>>>> HBase
>>>>>>> is
>>>>>>>> the best solution out there today for this but for the fake =
that at
>>>>> site
>>>>>>> or
>>>>>>>> region server failure we loss functionality. (Read and Write on =
site
>>>>>>>> failure, and write on RS failure)
>>>>>>>>=20
>>>>>>>> In the end HBase.MCC's scope is what hopefully should make it
>> exciting.
>>>>>>>> All we need to do is make a new client and update the =
connection
>>>>> factory
>>>>>>> to
>>>>>>>> give you that multi cluster client when requested through the
>> configs.
>>>>>>> No
>>>>>>>> updates to ZK or HBase core would have to be touched.
>>>>>>>>=20
>>>>>>>> Side note: Because of the flexibility in the HBase.MCC configs =
there
>>>>> is a
>>>>>>>> way to reach a good majority of the Cross-site BigTable goals =
with
>> just
>>>>>>>> HBase.MCC.
>>>>>>>> Last question: What became of Cross-site BigTable?
>>>>>>>>=20
>>>>>>>> Let me know if you find this correct.
>>>>>>>> Thanks
>>>>>>>> Ted Malaska
>>>>>>>>=20
>>>>>>>> On Tue, Jun 30, 2015 at 12:42 AM, ramkrishna vasudevan <
>>>>>>>> ramkrishna.s.vasudevan@gmail.com> wrote:
>>>>>>>>=20
>>>>>>>>> Hi Ted
>>>>>>>>>=20
>>>>>>>>> I think the idea here is very similar to the Cross-site Big =
Table
>>>>>>> project
>>>>>>>>> that was presented in HBaseCon 2014.
>>>>>>>>>=20
>>>>>>>>> Pls find the slide linke below
>>>>>>>>> http://www.slideshare.net/HBaseCon/ecosystem-session-3.
>>>>>>>>> This project also adds a client side wrappers so that the =
client
>> can
>>>>>>>>> internally do a failover in case of a cluster going down and
>>>>>>>> automatically
>>>>>>>>> switching over to the replicated clusters based on the
>> configurations.
>>>>>>>> Let
>>>>>>>>> us know if you find this interesting.
>>>>>>>>>=20
>>>>>>>>> Regards
>>>>>>>>> Ram
>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>> On Tue, Jun 30, 2015 at 4:01 AM, Ted Malaska <
>>>>> ted.malaska@cloudera.com
>>>>>>>>=20
>>>>>>>>> wrote:
>>>>>>>>>=20
>>>>>>>>>> lol I did sorry, this is the right doc
>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>=20
>>>>>=20
>> =
https://github.com/tmalaska/HBase.MCC/blob/master/MultiHBaseClientDesignDo=
c.docx.pdf
>>>>>>>>>>=20
>>>>>>>>>> On Mon, Jun 29, 2015 at 6:30 PM, Andrew Purtell <
>> apurtell@apache.org
>>>>>>>>=20
>>>>>>>>>> wrote:
>>>>>>>>>>=20
>>>>>>>>>>> I think you may have put up the wrong document? That link =
goes to
>>>>>>>>> product
>>>>>>>>>>> doc.
>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>> On Mon, Jun 29, 2015 at 3:24 PM, Ted Malaska <
>>>>>>>> ted.malaska@cloudera.com
>>>>>>>>>>=20
>>>>>>>>>>> wrote:
>>>>>>>>>>>=20
>>>>>>>>>>>> Here is the PDF link.
>>>>>>>>>>>>=20
>>>>>>>>>>>>=20
>>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>=20
>>>>>=20
>> =
https://github.com/tmalaska/HBase.MCC/blob/master/MultiClusterAndEDH_Lates=
t.docx.pdf
>>>>>>>>>>>>=20
>>>>>>>>>>>> On Mon, Jun 29, 2015 at 6:09 PM, Sean Busbey <
>>>>>>> busbey@cloudera.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>>=20
>>>>>>>>>>>>> Michael,
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> This is the dev list, no sound-bite pitch is needed. We =
have
>>>>>>>> plenty
>>>>>>>>>> of
>>>>>>>>>>>>> features that take time to explain the nuance. Please =
either
>>>>>>>> engage
>>>>>>>>>>> with
>>>>>>>>>>>>> the complexity of the topic or wait for the feature to =
land and
>>>>>>>> get
>>>>>>>>>>>>> user-accessible documentation. We all get busy from time =
to
>>>>>>> time,
>>>>>>>>> but
>>>>>>>>>>>>> that's no reason to push a higher burden on those who are
>>>>>>>> currently
>>>>>>>>>>>> engaged
>>>>>>>>>>>>> with a particular effort, especially this early in =
development.
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> That said, the first paragraph gives a suitable brief
>>>>>>> motivation
>>>>>>>>>>>> (slightly
>>>>>>>>>>>>> rephrased below):
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>> Some applications require response and availability SLAs
>>>>>>> that a
>>>>>>>>>>> single
>>>>>>>>>>>>> HBase cluster can not meet alone. Particularly for
>>>>>>>>>>>>>> high percentiles, queries to a single cluster can be =
delayed
>>>>>>> by
>>>>>>>>>> e.g.
>>>>>>>>>>> GC
>>>>>>>>>>>>> pauses, individual server process failure, or maintenance
>>>>>>>>>>>>>> activity. By providing clients with a transparent
>>>>>>> multi-cluster
>>>>>>>>>>>>> configuration option we can avoid these outlier conditions =
by
>>>>>>>>>>>>>> mask these failures from applications that are tolerant =
to
>>>>>>>> weaker
>>>>>>>>>>>>> consistency guarantees than HBase provides out of the box.
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> Ted,
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> Thanks for writing this up! We'd prefer to keep discussion =
of
>>>>>>> it
>>>>>>>> on
>>>>>>>>>> the
>>>>>>>>>>>>> mailing list, so please avoid moving to private webex's.
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> Would you mind if I or one of the other community members
>>>>>>>> converted
>>>>>>>>>> the
>>>>>>>>>>>>> design doc to pdf so that it's more accessible?
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> On Mon, Jun 29, 2015 at 4:52 PM, Ted Malaska <
>>>>>>>>>> ted.malaska@cloudera.com
>>>>>>>>>>>>=20
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>> Why don't we set up a webex to talk out the detail.  What
>>>>>>> times
>>>>>>>>> r u
>>>>>>>>>>>> open
>>>>>>>>>>>>> to
>>>>>>>>>>>>>> talk this week.
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>> But to answer your questions.  This is for active active =
and
>>>>>>>>> active
>>>>>>>>>>>>>> failover clusters.  There is a primary and n number of =
fail
>>>>>>>> overs
>>>>>>>>>> per
>>>>>>>>>>>>>> client.  This is for gets and puts.
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>> There r a number of configs in the doc to define how to
>>>>>>>> failover.
>>>>>>>>>>> The
>>>>>>>>>>>>>> options allow a couple different use cases.  There is a =
lot
>>>>>>> of
>>>>>>>>>> detail
>>>>>>>>>>>> in
>>>>>>>>>>>>>> the doc and I just didn't want to put it all in the =
email.
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>> But honestly I put a lot of time in the doc.   I would =
love
>>>>>>> to
>>>>>>>>> know
>>>>>>>>>>>> what
>>>>>>>>>>>>> u
>>>>>>>>>>>>>> think.
>>>>>>>>>>>>>> On Jun 29, 2015 5:46 PM, "Michael Segel" <
>>>>>>>>>> michael_segel@hotmail.com>
>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> Ted,
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> If you can=E2=80=99t do a 30 second pitch, then its not =
worth the
>>>>>>>>> effort.
>>>>>>>>>>> ;-)
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> Look, when someone says that they want to have a single
>>>>>>>> client
>>>>>>>>>> talk
>>>>>>>>>>>> to
>>>>>>>>>>>>>>> multiple HBase clusters, that could mean two very =
different
>>>>>>>>>> things.
>>>>>>>>>>>>>>> First, you could mean that you want a single client to
>>>>>>>> connect
>>>>>>>>> to
>>>>>>>>>>> an
>>>>>>>>>>>>>>> active/active pair of HBase clusters where they =
replicate
>>>>>>> to
>>>>>>>>> each
>>>>>>>>>>>>> other.
>>>>>>>>>>>>>>> (Active / Passive would also be implied, but then you =
have
>>>>>>>> the
>>>>>>>>>>> issue
>>>>>>>>>>>> of
>>>>>>>>>>>>>>> when does the passive cluster go active? )
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> Then you have the issue of someone wanting to talk to
>>>>>>>> multiple
>>>>>>>>>>>>> different
>>>>>>>>>>>>>>> clusters so that they can query the data, create local =
data
>>>>>>>>> sets
>>>>>>>>>>>> which
>>>>>>>>>>>>>> they
>>>>>>>>>>>>>>> wish to join, combining data from various sources.
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> The second is a different problem from the first.
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> -Mike
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>> On Jun 29, 2015, at 3:38 PM, Ted Malaska <
>>>>>>>>>>> ted.malaska@cloudera.com
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>> Hey Michael,
>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>> Read the doc please.  It goes through everything at a =
low
>>>>>>>>>> level.
>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>> Thanks
>>>>>>>>>>>>>>>> Ted Malaska
>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>> On Mon, Jun 29, 2015 at 4:36 PM, Michael Segel <
>>>>>>>>>>>>>>> michael_segel@hotmail.com>
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>> No down time?
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>> So you want a client to go against a pair of
>>>>>>> active/active
>>>>>>>>>> hbase
>>>>>>>>>>>>>>> instances
>>>>>>>>>>>>>>>>> on tied clusters?
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> On Jun 29, 2015, at 3:20 PM, Ted Malaska <
>>>>>>>>>>>> ted.malaska@cloudera.com
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> Hey Michael,
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> The use case is simple "No down time use cases" even =
in
>>>>>>>> the
>>>>>>>>>>> case
>>>>>>>>>>>> of
>>>>>>>>>>>>>>> site
>>>>>>>>>>>>>>>>>> failure.
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> Now on this statement
>>>>>>>>>>>>>>>>>> "Why not simply manage each connection/context via a
>>>>>>>>> threaded
>>>>>>>>>>>>> child?"
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> That is the point, to make that simple, tested, easy,
>>>>>>> and
>>>>>>>>>>>>> transparent
>>>>>>>>>>>>>>> for
>>>>>>>>>>>>>>>>>> HBase users.
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> Ted Malaska
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>> On Mon, Jun 29, 2015 at 4:11 PM, Michael Segel <
>>>>>>>>>>>>>>>>> michael_segel@hotmail.com>
>>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>> So if I understand your goal, you want a client who
>>>>>>> can
>>>>>>>>>>> connect
>>>>>>>>>>>> to
>>>>>>>>>>>>>> one
>>>>>>>>>>>>>>>>> or
>>>>>>>>>>>>>>>>>>> more hbase clusters at the same time=E2=80=A6
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>> Ok, so lets walk through the use case and help me
>>>>>>>>>> understand a
>>>>>>>>>>>>>> couple
>>>>>>>>>>>>>>> of
>>>>>>>>>>>>>>>>>>> use cases for this=E2=80=A6
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>> Why not simply manage each connection/context via a
>>>>>>>>> threaded
>>>>>>>>>>>>> child?
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> On Jun 29, 2015, at 1:48 PM, Ted Malaska <
>>>>>>>>>>>>> ted.malaska@cloudera.com
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Hey Dev List,
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> My name is Ted Malaska, long time lover and user of
>>>>>>>>> HBase.
>>>>>>>>>> I
>>>>>>>>>>>>> would
>>>>>>>>>>>>>>> like
>>>>>>>>>>>>>>>>>>> to
>>>>>>>>>>>>>>>>>>>> discuss adding in a multi-cluster client into =
HBase.
>>>>>>>> Here
>>>>>>>>>> is
>>>>>>>>>>>> the
>>>>>>>>>>>>>> link
>>>>>>>>>>>>>>>>> for
>>>>>>>>>>>>>>>>>>>> the design doc (
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>=20
>>>>>=20
>> =
https://github.com/tmalaska/HBase.MCC/blob/master/MultiHBaseClientDesignDo=
c.docx%20(1).docx
>>>>>>>>>>>>>>>>>>> )
>>>>>>>>>>>>>>>>>>>> but I have pulled some parts into this main e-mail =
to
>>>>>>>>> give
>>>>>>>>>>> you
>>>>>>>>>>>> a
>>>>>>>>>>>>>> high
>>>>>>>>>>>>>>>>>>> level
>>>>>>>>>>>>>>>>>>>> understanding of it's scope.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Goals*
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> The proposed solution is a multi-cluster HBase =
client
>>>>>>>>> that
>>>>>>>>>>>> relies
>>>>>>>>>>>>>> on
>>>>>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>>> existing HBase Replication functionality to provide
>>>>>>> an
>>>>>>>>>>> eventual
>>>>>>>>>>>>>>>>>>> consistent
>>>>>>>>>>>>>>>>>>>> solution in cases of primary cluster down time.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>=20
>>>>>>>>>>=20
>> https://github.com/tmalaska/HBase.MCC/blob/master/FailoverImage.png
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Additional goals are:
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> -
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Be able to switch between single HBase clusters to
>>>>>>>>>>> Multi-HBase
>>>>>>>>>>>>>> Client
>>>>>>>>>>>>>>>>>>>> with limited or no code changes.  This means using
>>>>>>> the
>>>>>>>>>>>>>>>>>>> HConnectionManager,
>>>>>>>>>>>>>>>>>>>> Connection, and Table interfaces to hide =
complexities
>>>>>>>>> from
>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>> developer
>>>>>>>>>>>>>>>>>>>> (Connection and Table are the new classes for
>>>>>>>>> HConnection,
>>>>>>>>>>> and
>>>>>>>>>>>>>>>>>>>> HTableInterface in HBase version 0.99).
>>>>>>>>>>>>>>>>>>>> -
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Offer thresholds to allow developers to decide
>>>>>>> between
>>>>>>>>>>> degrees
>>>>>>>>>>>> of
>>>>>>>>>>>>>>>>>>>> strongly consistent and eventually consistent.
>>>>>>>>>>>>>>>>>>>> - Support N number of linked HBase Clusters
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Read-Replicas*
>>>>>>>>>>>>>>>>>>>> Also note this is in alinement with Read-Replicas =
and
>>>>>>>> can
>>>>>>>>>>> work
>>>>>>>>>>>>> with
>>>>>>>>>>>>>>>>> that.
>>>>>>>>>>>>>>>>>>>> This client is multi-cluster where Read-Replicas =
help
>>>>>>>> us
>>>>>>>>> to
>>>>>>>>>>> be
>>>>>>>>>>>>>> multi
>>>>>>>>>>>>>>>>>>> Region
>>>>>>>>>>>>>>>>>>>> Server.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Replication*
>>>>>>>>>>>>>>>>>>>> You will also see in the document that this works
>>>>>>> with
>>>>>>>>>>> current
>>>>>>>>>>>>>>>>>>> replication
>>>>>>>>>>>>>>>>>>>> and requires no changes to it.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Only a Client change*
>>>>>>>>>>>>>>>>>>>> You will also see in the doc this is only a new
>>>>>>> client.
>>>>>>>>>> Which
>>>>>>>>>>>>> means
>>>>>>>>>>>>>>> no
>>>>>>>>>>>>>>>>>>>> extra code for the end developer, only addition
>>>>>>> configs
>>>>>>>>> to
>>>>>>>>>>> set
>>>>>>>>>>>> it
>>>>>>>>>>>>>> up.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Github*
>>>>>>>>>>>>>>>>>>>> This is a github project that shows that this works
>>>>>>> at:
>>>>>>>>>>>>>>>>>>>> https://github.com/tmalaska/HBase.MCC
>>>>>>>>>>>>>>>>>>>> Note this is only a prototype. When adding it to
>>>>>>> HBase
>>>>>>>> we
>>>>>>>>>>> will
>>>>>>>>>>>>> use
>>>>>>>>>>>>>> it
>>>>>>>>>>>>>>>>> as
>>>>>>>>>>>>>>>>>>> a
>>>>>>>>>>>>>>>>>>>> starting point but there will be changes.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> *Initial Results:*
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Red is where our primary cluster has failed and you
>>>>>>>> will
>>>>>>>>>> see
>>>>>>>>>>>> from
>>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>>>>> bottom to graphs that our puts, deletes, and gets =
are
>>>>>>>> not
>>>>>>>>>>>>>>> interrupted.
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>=20
>>>>>=20
>> =
https://github.com/tmalaska/HBase.MCC/blob/master/AveragePutTimeWithMultiR=
estartsAndShutDowns.png
>>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>> Thanks
>>>>>>>>>>>>>>>>>>>> Ted Malaska
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>> The opinions expressed here are mine, while they may
>>>>>>>>>> reflect a
>>>>>>>>>>>>>>> cognitive
>>>>>>>>>>>>>>>>>>> thought, that is purely accidental.
>>>>>>>>>>>>>>>>>>> Use at your own risk.
>>>>>>>>>>>>>>>>>>> Michael Segel
>>>>>>>>>>>>>>>>>>> michael_segel (AT) hotmail.com
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>> The opinions expressed here are mine, while they may
>>>>>>>>> reflect a
>>>>>>>>>>>>>> cognitive
>>>>>>>>>>>>>>>>> thought, that is purely accidental.
>>>>>>>>>>>>>>>>> Use at your own risk.
>>>>>>>>>>>>>>>>> Michael Segel
>>>>>>>>>>>>>>>>> michael_segel (AT) hotmail.com
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>> The opinions expressed here are mine, while they may
>>>>>>> reflect
>>>>>>>> a
>>>>>>>>>>>>> cognitive
>>>>>>>>>>>>>>> thought, that is purely accidental.
>>>>>>>>>>>>>>> Use at your own risk.
>>>>>>>>>>>>>>> Michael Segel
>>>>>>>>>>>>>>> michael_segel (AT) hotmail.com
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>>=20
>>>>>>>>>>>>> --
>>>>>>>>>>>>> Sean
>>>>>>>>>>>>>=20
>>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>> --
>>>>>>>>>>> Best regards,
>>>>>>>>>>>=20
>>>>>>>>>>> - Andy
>>>>>>>>>>>=20
>>>>>>>>>>> Problems worthy of attack prove their worth by hitting back. =
-
>> Piet
>>>>>>>>> Hein
>>>>>>>>>>> (via Tom White)
>>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>=20
>>>>>=20
>>>>> The opinions expressed here are mine, while they may reflect a
>> cognitive
>>>>> thought, that is purely accidental.
>>>>> Use at your own risk.
>>>>> Michael Segel
>>>>> michael_segel (AT) hotmail.com
>>>>>=20
>>>>>=20
>>>>>=20
>>>>>=20
>>>>>=20
>>>>>=20
>>>=20
>>> The opinions expressed here are mine, while they may reflect a =
cognitive
>> thought, that is purely accidental.
>>> Use at your own risk.
>>> Michael Segel
>>> michael_segel (AT) hotmail.com
>>>=20
>>>=20
>>>=20
>>>=20
>>>=20
>>>=20
>>=20
>> The opinions expressed here are mine, while they may reflect a =
cognitive
>> thought, that is purely accidental.
>> Use at your own risk.
>> Michael Segel
>> michael_segel (AT) hotmail.com
>>=20
>>=20
>>=20
>>=20
>>=20
>>=20

The opinions expressed here are mine, while they may reflect a cognitive =
thought, that is purely accidental.=20
Use at your own risk.=20
Michael Segel
michael_segel (AT) hotmail.com