From user-return-1288-archive-asf-public=cust-asf.ponee.io@kudu.apache.org  Wed Mar  7 05:42:52 2018
Return-Path: <user-return-1288-archive-asf-public=cust-asf.ponee.io@kudu.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [140.211.11.3])
	by mx-eu-01.ponee.io (Postfix) with SMTP id 92812180652
	for <archive-asf-public@cust-asf.ponee.io>; Wed,  7 Mar 2018 05:42:50 +0100 (CET)
Received: (qmail 46769 invoked by uid 500); 7 Mar 2018 04:42:49 -0000
Mailing-List: contact user-help@kudu.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:user-help@kudu.apache.org>
List-Unsubscribe: <mailto:user-unsubscribe@kudu.apache.org>
List-Post: <mailto:user@kudu.apache.org>
List-Id: <user.kudu.apache.org>
Reply-To: user@kudu.apache.org
Delivered-To: mailing list user@kudu.apache.org
Received: (qmail 46759 invoked by uid 99); 7 Mar 2018 04:42:49 -0000
Received: from mail-relay.apache.org (HELO mailrelay2-lw-us.apache.org) (207.244.88.137)
    by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Mar 2018 04:42:49 +0000
Received: from mail-it0-f47.google.com (mail-it0-f47.google.com [209.85.214.47])
	by mailrelay2-lw-us.apache.org (ASF Mail Server at mailrelay2-lw-us.apache.org) with ESMTPSA id 184FEF26
	for <user@kudu.apache.org>; Wed,  7 Mar 2018 04:42:47 +0000 (UTC)
Received: by mail-it0-f47.google.com with SMTP id d13so1698465itf.0
        for <user@kudu.apache.org>; Tue, 06 Mar 2018 20:42:47 -0800 (PST)
X-Gm-Message-State: AElRT7ElqOM/I2QkcOe0Dr/WhlIDMv0eU2rKnhkOJ35vLwN1hDEFIDzl
	eGr/2OZAz2rOg5qViCjVpM4kx/fFvmXn6F6AcV8=
X-Google-Smtp-Source: AG47ELuXRX7Wf5w4GVZGQLGFObtm4Fsk8QBmtYnVxVI3fLrWzg0xDkFFydPjTQgW9Hheow2e6NPJerwpQER2PiEZM3c=
X-Received: by 10.36.212.3 with SMTP id x3mr18760149itg.22.1520397767054; Tue,
 06 Mar 2018 20:42:47 -0800 (PST)
MIME-Version: 1.0
Received: by 10.2.151.176 with HTTP; Tue, 6 Mar 2018 20:42:06 -0800 (PST)
In-Reply-To: <CA+qWYhgJ07CYh9qd3CVkTfD7mAjeobC68JVJ1DgvonJ8APUahg@mail.gmail.com>
References: <8cf65af9-47c0-421f-a71c-a314fc3443f7@email.android.com>
 <CA+qWYhg9af+1kWA+VzgS6na6MkKaXbzQdzLsg98iOqBCezWsFw@mail.gmail.com>
 <CAJLbxRZi3TXvaX+AU1WEiFvBUDAUijN1kRU8XKgMJd5N=K_3Fw@mail.gmail.com>
 <CA+qWYhiDBY8pG=DZpY3bJpUvFkQfvKX_o_GJYGKsaVg0GKqxqA@mail.gmail.com>
 <CAJLbxRbHPKH5GVe7e85yMNJUScqBo29osp5UoYzTuyw5=eOrDA@mail.gmail.com>
 <CA+qWYhjerRgmiR65rsyW30vWXe10PMC+u=yfB-7kQ1kmxccaqw@mail.gmail.com>
 <CA+qWYhjxDgr8Rjhb1_ohUgHh=aih4ygG=pHFVPqajrK6+AGhXA@mail.gmail.com>
 <CAJLbxRbppiyVfU8mMuKMKWsxK3mFU5r-nLBG1ktRnjC3G0PsAQ@mail.gmail.com>
 <CA+qWYhgWdQ_z8R6ggRZESvGUF8BnAjpDwk14M9veGwQcx40FKA@mail.gmail.com>
 <CAJLbxRbCMa21KHkL8yadykbBTt0wVJrUoo1ijcp_f=RN0tDtKQ@mail.gmail.com> <CA+qWYhgJ07CYh9qd3CVkTfD7mAjeobC68JVJ1DgvonJ8APUahg@mail.gmail.com>
From: Mike Percy <mpercy@apache.org>
Date: Tue, 6 Mar 2018 20:42:06 -0800
X-Gmail-Original-Message-ID: <CAJLbxRbq5ZzWQ7oJc3ad8XiL47qmL+F4oAkdwmnpYBnkQgfp3A@mail.gmail.com>
Message-ID: <CAJLbxRbq5ZzWQ7oJc3ad8XiL47qmL+F4oAkdwmnpYBnkQgfp3A@mail.gmail.com>
Subject: Re: Spark Streaming + Kudu
To: user@kudu.apache.org
Content-Type: multipart/alternative; boundary="94eb2c0b19ced29d5a0566cb306f"

--94eb2c0b19ced29d5a0566cb306f
Content-Type: text/plain; charset="UTF-8"

Hmm, could you try in spark local mode? i.e. https://jaceklaskowski.
gitbooks.io/mastering-apache-spark/content/spark-local.html

Mike

On Tue, Mar 6, 2018 at 7:14 PM, Ravi Kanth <ravikanth.4b0@gmail.com> wrote:

> Mike,
>
> Can you clarify a bit on grabbing the jstack for the process? I launched
> my Spark application and tried to get the pid using which I thought I can
> grab jstack trace during hang. Unfortunately, I am not able to figure out
> grabbing pid for Spark application.
>
> Thanks,
> Ravi
>
> On 6 March 2018 at 18:36, Mike Percy <mpercy@apache.org> wrote:
>
>> Thanks Ravi. Would you mind attaching the output of jstack on the process
>> during this hang? That would show what the Kudu client threads are doing,
>> as what we are seeing here is just the netty boss thread.
>>
>> Mike
>>
>> On Tue, Mar 6, 2018 at 8:52 AM, Ravi Kanth <ravikanth.4b0@gmail.com>
>> wrote:
>>
>>>
>>> Yes, I have debugged to find the root cause. Every logger before "table
>>> = client.openTable(tableName);" is executing fine and exactly at the
>>> point of opening the table, it is throwing the below exception and nothing
>>> is being executed after that. Still the Spark batches are being processed
>>> and at opening the table is failing. I tried catching it with no luck.
>>> Please find below the exception.
>>>
>>> 8/02/23 00:16:30 ERROR client.TabletClient: [Peer
>>> bd91f34d456a4eccaae50003c90f0fb2] Unexpected exception from downstream
>>> on [id: 0x6e13b01f]
>>> java.net.ConnectException: Connection refused:
>>> kudu102.dev.sac.int.threatmetrix.com/10.112.3.12:7050
>>>     at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
>>>     at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl
>>> .java:717)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.channel.socket
>>> .nio.NioClientBoss.connect(NioClientBoss.java:152)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.channel.socket
>>> .nio.NioClientBoss.processSelectedKeys(NioClientBoss.java:105)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.channel.socket
>>> .nio.NioClientBoss.process(NioClientBoss.java:79)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.channel.socket
>>> .nio.AbstractNioSelector.run(AbstractNioSelector.java:337)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.channel.socket
>>> .nio.NioClientBoss.run(NioClientBoss.java:42)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.util.ThreadRen
>>> amingRunnable.run(ThreadRenamingRunnable.java:108)
>>>     at org.apache.kudu.client.shaded.org.jboss.netty.util.internal.
>>> DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
>>>     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPool
>>> Executor.java:1142)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoo
>>> lExecutor.java:617)
>>>     at java.lang.Thread.run(Thread.java:745)
>>>
>>>
>>> Thanks,
>>> Ravi
>>>
>>> On 5 March 2018 at 23:52, Mike Percy <mpercy@apache.org> wrote:
>>>
>>>> Have you considered checking your session error count or pending errors
>>>> in your while loop every so often? Can you identify where your code is
>>>> hanging when the connection is lost (what line)?
>>>>
>>>> Mike
>>>>
>>>> On Mon, Mar 5, 2018 at 9:08 PM, Ravi Kanth <ravikanth.4b0@gmail.com>
>>>> wrote:
>>>>
>>>>> In addition to my previous comment, I raised a support ticket for this
>>>>> issue with Cloudera and one of the support person mentioned below,
>>>>>
>>>>> *"Thank you for clarifying, The exceptions are logged but not
>>>>> re-thrown to an upper layer, so that explains why the Spark application is
>>>>> not aware of the underlying error."*
>>>>>
>>>>> On 5 March 2018 at 21:02, Ravi Kanth <ravikanth.4b0@gmail.com> wrote:
>>>>>
>>>>>> Mike,
>>>>>>
>>>>>> Thanks for the information. But, once the connection to any of the
>>>>>> Kudu servers is lost then there is no way I can have a control on the
>>>>>> KuduSession object and so with getPendingErrors(). The KuduClient in this
>>>>>> case is becoming a zombie and never returned back till the connection is
>>>>>> properly established. I tried doing all that you have suggested with no
>>>>>> luck. Attaching my KuduClient code.
>>>>>>
>>>>>> package org.dwh.streaming.kudu.sparkkudustreaming;
>>>>>>
>>>>>> import java.util.HashMap;
>>>>>> import java.util.Iterator;
>>>>>> import java.util.Map;
>>>>>> import org.apache.hadoop.util.ShutdownHookManager;
>>>>>> import org.apache.kudu.client.*;
>>>>>> import org.apache.spark.api.java.JavaRDD;
>>>>>> import org.slf4j.Logger;
>>>>>> import org.slf4j.LoggerFactory;
>>>>>> import org.dwh.streaming.kudu.sparkkudustreaming.constants.SpecialN
>>>>>> ullConstants;
>>>>>>
>>>>>> public class KuduProcess {
>>>>>> private static Logger logger = LoggerFactory.getLogger(KuduPr
>>>>>> ocess.class);
>>>>>> private KuduTable table;
>>>>>> private KuduSession session;
>>>>>>
>>>>>> public static void upsertKudu(JavaRDD<Map<String, Object>> rdd,
>>>>>> String host, String tableName) {
>>>>>> rdd.foreachPartition(iterator -> {
>>>>>> RowErrorsAndOverflowStatus errors = upsertOpIterator(iterator,
>>>>>> tableName, host);
>>>>>> int errorCount = errors.getRowErrors().length;
>>>>>> if(errorCount > 0){
>>>>>> throw new RuntimeException("Failed to write " + errorCount + "
>>>>>> messages into Kudu");
>>>>>> }
>>>>>> });
>>>>>> }
>>>>>> private static RowErrorsAndOverflowStatus
>>>>>> upsertOpIterator(Iterator<Map<String, Object>> iter, String
>>>>>> tableName, String host) {
>>>>>> try {
>>>>>> AsyncKuduClient asyncClient = KuduConnection.getAsyncClient(host);
>>>>>> KuduClient client = asyncClient.syncClient();
>>>>>> table = client.openTable(tableName);
>>>>>> session = client.newSession();
>>>>>> session.setFlushMode(SessionConfiguration.FlushMode.AUTO_FLU
>>>>>> SH_BACKGROUND);
>>>>>> while (iter.hasNext()) {
>>>>>> upsertOp(iter.next());
>>>>>> }
>>>>>> } catch (KuduException e) {
>>>>>> logger.error("Exception in upsertOpIterator method", e);
>>>>>> }
>>>>>> finally{
>>>>>> try {
>>>>>> session.close();
>>>>>> } catch (KuduException e) {
>>>>>> logger.error("Exception in Connection close", e);
>>>>>> }
>>>>>> }
>>>>>> return session.getPendingErrors();        --------------------->
>>>>>> Once, the connection is lost, this part of the code never gets called and
>>>>>> the Spark job will keep on running and processing the records while
>>>>>> the KuduClient is trying to connect to Kudu. Meanwhile, we are loosing all
>>>>>> the records.
>>>>>> }
>>>>>> public static void upsertOp(Map<String, Object> formattedMap) {
>>>>>> if (formattedMap.size() != 0) {
>>>>>> try {
>>>>>> Upsert upsert = table.newUpsert();
>>>>>> PartialRow row = upsert.getRow();
>>>>>> for (Map.Entry<String, Object> entry : formattedMap.entrySet()) {
>>>>>> if (entry.getValue().getClass().equals(String.class)) {
>>>>>> if (entry.getValue().equals(SpecialNullConstants.specialStringNull))
>>>>>> row.setNull(entry.getKey());
>>>>>> else
>>>>>> row.addString(entry.getKey(), (String) entry.getValue());
>>>>>> } else if (entry.getValue().getClass().equals(Long.class)) {
>>>>>> if (entry.getValue().equals(SpecialNullConstants.specialLongNull))
>>>>>> row.setNull(entry.getKey());
>>>>>> else
>>>>>> row.addLong(entry.getKey(), (Long) entry.getValue());
>>>>>> } else if (entry.getValue().getClass().equals(Integer.class)) {
>>>>>> if (entry.getValue().equals(SpecialNullConstants.specialIntNull))
>>>>>> row.setNull(entry.getKey());
>>>>>> else
>>>>>> row.addInt(entry.getKey(), (Integer) entry.getValue());
>>>>>> }
>>>>>> }
>>>>>>
>>>>>> session.apply(upsert);
>>>>>> } catch (Exception e) {
>>>>>> logger.error("Exception during upsert:", e);
>>>>>> }
>>>>>> }
>>>>>> }
>>>>>> }
>>>>>> class KuduConnection {
>>>>>> private static Logger logger = LoggerFactory.getLogger(KuduCo
>>>>>> nnection.class);
>>>>>> private static Map<String, AsyncKuduClient> asyncCache = new
>>>>>> HashMap<>();
>>>>>> private static int ShutdownHookPriority = 100;
>>>>>>
>>>>>> static AsyncKuduClient getAsyncClient(String kuduMaster) {
>>>>>> if (!asyncCache.containsKey(kuduMaster)) {
>>>>>> AsyncKuduClient asyncClient = new AsyncKuduClient.AsyncKuduClien
>>>>>> tBuilder(kuduMaster).build();
>>>>>> ShutdownHookManager.get().addShutdownHook(new Runnable() {
>>>>>> @Override
>>>>>> public void run() {
>>>>>> try {
>>>>>> asyncClient.close();
>>>>>> } catch (Exception e) {
>>>>>> logger.error("Exception closing async client", e);
>>>>>> }
>>>>>> }
>>>>>> }, ShutdownHookPriority);
>>>>>> asyncCache.put(kuduMaster, asyncClient);
>>>>>> }
>>>>>> return asyncCache.get(kuduMaster);
>>>>>> }
>>>>>> }
>>>>>>
>>>>>>
>>>>>>
>>>>>> Thanks,
>>>>>> Ravi
>>>>>>
>>>>>> On 5 March 2018 at 16:20, Mike Percy <mpercy@apache.org> wrote:
>>>>>>
>>>>>>> Hi Ravi, it would be helpful if you could attach what you are
>>>>>>> getting back from getPendingErrors() -- perhaps from dumping
>>>>>>> RowError.toString() from items in the returned array -- and indicate what
>>>>>>> you were hoping to get back. Note that a RowError can also return to you
>>>>>>> the Operation
>>>>>>> <https://kudu.apache.org/releases/1.6.0/apidocs/org/apache/kudu/client/RowError.html#getOperation-->
>>>>>>> that you used to generate the write. From the Operation, you can get the
>>>>>>> original PartialRow
>>>>>>> <https://kudu.apache.org/releases/1.6.0/apidocs/org/apache/kudu/client/PartialRow.html>
>>>>>>> object, which should be able to identify the affected row that the write
>>>>>>> failed for. Does that help?
>>>>>>>
>>>>>>> Since you are using the Kudu client directly, Spark is not involved
>>>>>>> from the Kudu perspective, so you will need to deal with Spark on your own
>>>>>>> in that case.
>>>>>>>
>>>>>>> Mike
>>>>>>>
>>>>>>> On Mon, Mar 5, 2018 at 1:59 PM, Ravi Kanth <ravikanth.4b0@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi Mike,
>>>>>>>>
>>>>>>>> Thanks for the reply. Yes, I am using AUTO_FLUSH_BACKGROUND.
>>>>>>>>
>>>>>>>> So, I am trying to use Kudu Client API to perform UPSERT into Kudu
>>>>>>>> and I integrated this with Spark. I am trying to test a case where in if
>>>>>>>> any of Kudu server fails. So, in this case, if there is any problem in
>>>>>>>> writing, getPendingErrors() should give me a way to handle these errors so
>>>>>>>> that I can successfully terminate my Spark Job. This is what I am trying to
>>>>>>>> do.
>>>>>>>>
>>>>>>>> But, I am not able to get a hold of the exceptions being thrown
>>>>>>>> from with in the KuduClient when retrying to connect to Tablet Server. My
>>>>>>>> getPendingErrors is not getting ahold of these exceptions.
>>>>>>>>
>>>>>>>> Let me know if you need more clarification. I can post some
>>>>>>>> Snippets.
>>>>>>>>
>>>>>>>> Thanks,
>>>>>>>> Ravi
>>>>>>>>
>>>>>>>> On 5 March 2018 at 13:18, Mike Percy <mpercy@apache.org> wrote:
>>>>>>>>
>>>>>>>>> Hi Ravi, are you using AUTO_FLUSH_BACKGROUND
>>>>>>>>> <https://kudu.apache.org/releases/1.6.0/apidocs/org/apache/kudu/client/SessionConfiguration.FlushMode.html>?
>>>>>>>>> You mention that you are trying to use getPendingErrors()
>>>>>>>>> <https://kudu.apache.org/releases/1.6.0/apidocs/org/apache/kudu/client/KuduSession.html#getPendingErrors--> but
>>>>>>>>> it sounds like it's not working for you -- can you be more specific about
>>>>>>>>> what you expect and what you are observing?
>>>>>>>>>
>>>>>>>>> Thanks,
>>>>>>>>> Mike
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Mon, Feb 26, 2018 at 8:04 PM, Ravi Kanth <
>>>>>>>>> ravikanth.4b0@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>>> Thank Clifford. We are running Kudu 1.4 version. Till date we
>>>>>>>>>> didn't see any issues in production and we are not losing tablet servers.
>>>>>>>>>> But, as part of testing I have to generate few unforeseen cases to analyse
>>>>>>>>>> the application performance. One among that is bringing down the tablet
>>>>>>>>>> server or master server intentionally during which I observed the loss of
>>>>>>>>>> records. Just wanted to test cases out of the happy path here. Once again
>>>>>>>>>> thanks for taking time to respond to me.
>>>>>>>>>>
>>>>>>>>>> - Ravi
>>>>>>>>>>
>>>>>>>>>> On 26 February 2018 at 19:58, Clifford Resnick <
>>>>>>>>>> cresnick@mediamath.com> wrote:
>>>>>>>>>>
>>>>>>>>>>> I'll have to get back to you on the code bits, but I'm pretty
>>>>>>>>>>> sure we're doing simple sync batching. We're not in production yet, but
>>>>>>>>>>> after some months of development I haven't seen any failures, even when
>>>>>>>>>>> pushing load doing multiple years' backfill. I think the real question is
>>>>>>>>>>> why are you losing tablet servers? The only instability we ever had with
>>>>>>>>>>> Kudu was when it had that weird ntp sync issue that was fixed I think for
>>>>>>>>>>> 1.6. What version are you running?
>>>>>>>>>>>
>>>>>>>>>>> Anyway I would think that infinite loop should be catchable
>>>>>>>>>>> somewhere. Our pipeline is set to fail/retry with Flink snapshots. I
>>>>>>>>>>> imagine there is similar with Spark. Sorry I cant be of more help!
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> On Feb 26, 2018 9:10 PM, Ravi Kanth <ravikanth.4b0@gmail.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>> Cliff,
>>>>>>>>>>>
>>>>>>>>>>> Thanks for the response. Well, I do agree that its simple and
>>>>>>>>>>> seamless. In my case, I am able to upsert ~25000 events/sec into Kudu. But,
>>>>>>>>>>> I am facing the problem when any of the Kudu Tablet or master server is
>>>>>>>>>>> down. I am not able to get a hold of the exception from client. The client
>>>>>>>>>>> is going into an infinite loop trying to connect to Kudu. Meanwhile, I am
>>>>>>>>>>> loosing my records. I tried handling the errors through getPendingErrors()
>>>>>>>>>>> but still it is helpless. I am using AsyncKuduClient to establish the
>>>>>>>>>>> connection and retrieving the syncClient from the Async to open the session
>>>>>>>>>>> and table. Any help?
>>>>>>>>>>>
>>>>>>>>>>> Thanks,
>>>>>>>>>>> Ravi
>>>>>>>>>>>
>>>>>>>>>>> On 26 February 2018 at 18:00, Cliff Resnick <cresny@gmail.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>> While I can't speak for Spark, we do use the client API from
>>>>>>>>>>> Flink streaming and it's simple and seamless. It's especially nice if you
>>>>>>>>>>> require an Upsert semantic.
>>>>>>>>>>>
>>>>>>>>>>> On Feb 26, 2018 7:51 PM, "Ravi Kanth" <ravikanth.4b0@gmail.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>> Hi,
>>>>>>>>>>>
>>>>>>>>>>> Anyone using Spark Streaming to ingest data into Kudu and using
>>>>>>>>>>> Kudu Client API to do so rather than the traditional KuduContext API? I am
>>>>>>>>>>> stuck at a point and couldn't find a solution.
>>>>>>>>>>>
>>>>>>>>>>> Thanks,
>>>>>>>>>>> Ravi
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

--94eb2c0b19ced29d5a0566cb306f
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hmm, could you try in spark local mode? i.e.=C2=A0<a href=
=3D"https://jaceklaskowski.gitbooks.io/mastering-apache-spark/content/spark=
-local.html" target=3D"_blank">https://jaceklaskowski.<wbr>gitbooks.io/mast=
ering-apache-<wbr>spark/content/spark-local.html</a><div class=3D"gmail_ext=
ra"><br></div><div class=3D"gmail_extra">Mike</div><div class=3D"gmail_extr=
a"><br><div class=3D"gmail_quote">On Tue, Mar 6, 2018 at 7:14 PM, Ravi Kant=
h <span dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0@gmail.com" target=
=3D"_blank">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:<br><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;p=
adding-left:1ex"><div dir=3D"ltr">Mike,<div><br></div><div>Can you clarify =
a bit on grabbing the jstack for the process? I launched my Spark applicati=
on and tried to get the pid using which I thought I can grab jstack trace d=
uring hang. Unfortunately, I am not able to figure out grabbing pid for Spa=
rk application.</div><div><br></div><div>Thanks,</div><div>Ravi</div></div>=
<div class=3D"m_-6283129410843123603HOEnZb"><div class=3D"m_-62831294108431=
23603h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On 6 Mar=
ch 2018 at 18:36, Mike Percy <span dir=3D"ltr">&lt;<a href=3D"mailto:mpercy=
@apache.org" target=3D"_blank">mpercy@apache.org</a>&gt;</span> wrote:<br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thanks Ravi. Would you mind =
attaching the output of jstack on the process during this hang? That would =
show what the Kudu client threads are doing, as what we are seeing here is =
just the netty boss thread.<span class=3D"m_-6283129410843123603m_-32122545=
70011668864HOEnZb"><font color=3D"#888888"><div><br></div><div>Mike</div></=
font></span></div><div class=3D"m_-6283129410843123603m_-321225457001166886=
4HOEnZb"><div class=3D"m_-6283129410843123603m_-3212254570011668864h5"><div=
 class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Mar 6, 2018 a=
t 8:52 AM, Ravi Kanth <span dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0=
@gmail.com" target=3D"_blank">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:=
<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-lef=
t:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><br></div><div>Yes=
, I have debugged to find the root cause. Every logger before &quot;<span s=
tyle=3D"font-size:12.800000190734863px">table =3D client.openTable(tableNam=
e);</span>&quot; is executing fine and exactly at the point of opening the =
table, it is throwing the below exception and nothing is being executed aft=
er that. Still the Spark batches are being processed and at opening the tab=
le is failing. I tried catching it with no luck. Please find below the exce=
ption.</div><div><br></div><div><div>8/02/23 00:16:30 ERROR client.TabletCl=
ient: [Peer bd91f34d456a4eccaae50003c90f0f<wbr>b2] Unexpected exception fro=
m downstream on [id: 0x6e13b01f]</div><div>java.net.ConnectException: Conne=
ction refused: <a href=3D"http://kudu102.dev.sac.int.threatmetrix.com/10.11=
2.3.12:7050" target=3D"_blank">kudu102.dev.sac.int.threatmetr<wbr>ix.com/10=
.112.3.12:7050</a></div><div>=C2=A0 =C2=A0 at sun.nio.ch.SocketChannelImpl.=
c<wbr>heckConnect(Native Method)</div><div>=C2=A0 =C2=A0 at sun.nio.ch.Sock=
etChannelImpl.f<wbr>inishConnect(SocketChannelImpl<wbr>.java:717)</div><div=
>=C2=A0 =C2=A0 at org.apache.kudu.client.shaded.<wbr>org.jboss.netty.channe=
l.socket<wbr>.nio.NioClientBoss.connect(Nio<wbr>ClientBoss.java:152)</div><=
div>=C2=A0 =C2=A0 at org.apache.kudu.client.shaded.<wbr>org.jboss.netty.cha=
nnel.socket<wbr>.nio.NioClientBoss.processSele<wbr>ctedKeys(NioClientBoss.j=
ava:10<wbr>5)</div><div>=C2=A0 =C2=A0 at org.apache.kudu.client.shaded.<wbr=
>org.jboss.netty.channel.socket<wbr>.nio.NioClientBoss.process(Nio<wbr>Clie=
ntBoss.java:79)</div><div>=C2=A0 =C2=A0 at org.apache.kudu.client.shaded.<w=
br>org.jboss.netty.channel.socket<wbr>.nio.AbstractNioSelector.run(A<wbr>bs=
tractNioSelector.java:337)</div><div>=C2=A0 =C2=A0 at org.apache.kudu.clien=
t.shaded.<wbr>org.jboss.netty.channel.socket<wbr>.nio.NioClientBoss.run(Nio=
Clie<wbr>ntBoss.java:42)</div><div>=C2=A0 =C2=A0 at org.apache.kudu.client.=
shaded.<wbr>org.jboss.netty.util.ThreadRen<wbr>amingRunnable.run(ThreadRena=
mi<wbr>ngRunnable.java:108)</div><div>=C2=A0 =C2=A0 at org.apache.kudu.clie=
nt.shaded.<wbr>org.jboss.netty.util.internal.<wbr>DeadLockProofWorker$1.run=
(Dead<wbr>LockProofWorker.java:42)</div><div>=C2=A0 =C2=A0 at java.util.con=
current.ThreadPoo<wbr>lExecutor.runWorker(ThreadPool<wbr>Executor.java:1142=
)</div><div>=C2=A0 =C2=A0 at java.util.concurrent.ThreadPoo<wbr>lExecutor$W=
orker.run(ThreadPoo<wbr>lExecutor.java:617)</div><div>=C2=A0 =C2=A0 at java=
.lang.Thread.run(Thread.ja<wbr>va:745)</div></div><div><br></div><div><br><=
/div><div>Thanks,</div><div>Ravi</div></div><div class=3D"m_-62831294108431=
23603m_-3212254570011668864m_7795618734202079044HOEnZb"><div class=3D"m_-62=
83129410843123603m_-3212254570011668864m_7795618734202079044h5"><div class=
=3D"gmail_extra"><br><div class=3D"gmail_quote">On 5 March 2018 at 23:52, M=
ike Percy <span dir=3D"ltr">&lt;<a href=3D"mailto:mpercy@apache.org" target=
=3D"_blank">mpercy@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D=
"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding=
-left:1ex"><div dir=3D"ltr">Have you considered checking your session error=
 count or pending errors in your while loop every so often? Can you identif=
y where your code is hanging when the connection is lost (what line)?<span =
class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m=
_-8346886424083693801HOEnZb"><font color=3D"#888888"><div><br></div><div>Mi=
ke</div></font></span><div><div class=3D"m_-6283129410843123603m_-321225457=
0011668864m_7795618734202079044m_-8346886424083693801h5"><div class=3D"gmai=
l_extra"><br><div class=3D"gmail_quote">On Mon, Mar 5, 2018 at 9:08 PM, Rav=
i Kanth <span dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0@gmail.com" ta=
rget=3D"_blank">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:<br><blockquot=
e class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc sol=
id;padding-left:1ex"><div dir=3D"ltr">In addition to my previous comment, I=
 raised a support ticket for this issue with Cloudera and one of the suppor=
t person mentioned below,<div><br></div><div><i><b>&quot;<span style=3D"col=
or:rgb(51,51,51);font-family:Helvetica,Arial,sans-serif;font-size:14px;whit=
e-space:pre-wrap">Thank you for clarifying, The exceptions are logged but n=
ot re-thrown to an upper layer, so that explains why the Spark application =
is not aware of the underlying error.</span>&quot;</b></i></div></div><div =
class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m=
_-8346886424083693801m_6486822345494282471m_3141261326382143034HOEnZb"><div=
 class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044=
m_-8346886424083693801m_6486822345494282471m_3141261326382143034h5"><div cl=
ass=3D"gmail_extra"><br><div class=3D"gmail_quote">On 5 March 2018 at 21:02=
, Ravi Kanth <span dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0@gmail.co=
m" target=3D"_blank">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex"><div dir=3D"ltr">Mike,=C2=A0<div><br></div><div>T=
hanks for the information. But, once the connection to any of the Kudu serv=
ers is lost then there is no way I can have a control on the KuduSession ob=
ject and so with getPendingErrors(). The KuduClient in this case is becomin=
g a zombie and never returned back till the connection is properly establis=
hed. I tried doing all that you have suggested with no luck. Attaching my K=
uduClient code.</div><div><br></div><div><div>package org.dwh.streaming.kud=
u.sparkku<wbr>dustreaming;</div><div><br></div><div>import java.util.HashMa=
p;</div><div>import java.util.Iterator;</div><div>import java.util.Map;</di=
v><div>import org.apache.hadoop.util.Shutdow<wbr>nHookManager;</div><div>im=
port org.apache.kudu.client.*;</div><div>import org.apache.spark.api.java.J=
ava<wbr>RDD;</div><div>import org.slf4j.Logger;</div><div>import org.slf4j.=
LoggerFactory;</div><div>import org.dwh.streaming.kudu.sparkku<wbr>dustream=
ing.constants.SpecialN<wbr>ullConstants;</div><div><br></div><div>public cl=
ass KuduProcess {</div><div><span class=3D"m_-6283129410843123603m_-3212254=
570011668864m_7795618734202079044m_-8346886424083693801m_648682234549428247=
1m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple=
-tab-span" style=3D"white-space:pre-wrap">	</span>private static Logger log=
ger =3D LoggerFactory.getLogger(KuduPr<wbr>ocess.class);</div><div><span cl=
ass=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-=
8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196101=
486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-=
wrap">	</span>private KuduTable table;</div><div><span class=3D"m_-62831294=
10843123603m_-3212254570011668864m_7795618734202079044m_-834688642408369380=
1m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729744234=
1382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</span>priv=
ate KuduSession session;</div><div><br></div><div><span class=3D"m_-6283129=
410843123603m_-3212254570011668864m_7795618734202079044m_-83468864240836938=
01m_6486822345494282471m_3141261326382143034m_876064196101486489m_-72974423=
41382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</span>pub=
lic static void upsertKudu(JavaRDD&lt;Map&lt;String, Object&gt;&gt; rdd, St=
ring host, String tableName) {</div><div><span class=3D"m_-6283129410843123=
603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868=
22345494282471m_3141261326382143034m_876064196101486489m_-72974423413821160=
41gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>rdd.foreac=
hPartition(iterator -&gt; {</div><div><span class=3D"m_-6283129410843123603=
m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868223=
45494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041g=
mail-Apple-tab-span" style=3D"white-space:pre-wrap">				</span>RowErrorsAnd=
OverflowStatus errors =3D upsertOpIterator(iterator, tableName, host);</div=
><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187=
34202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034=
m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"wh=
ite-space:pre-wrap">				</span>int errorCount =3D errors.getRowErrors().len=
gth;</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m=
_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3141261326=
382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" st=
yle=3D"white-space:pre-wrap">				</span>if(errorCount &gt; 0){</div><div><s=
pan class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079=
044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064=
196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-spac=
e:pre-wrap">					</span>throw new RuntimeException(&quot;Failed to write &q=
uot; + errorCount + &quot; messages into Kudu&quot;);</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">				</span>}</div><div><span class=3D"m_-6283129410843123603m_-321225457=
0011668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m=
_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-t=
ab-span" style=3D"white-space:pre-wrap">			</span>});</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">	</span>}</div><div><span class=3D"m_-6283129410843123603m_-321225457001=
1668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_31=
41261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-=
span" style=3D"white-space:pre-wrap">	</span></div><div><span class=3D"m_-6=
283129410843123603m_-3212254570011668864m_7795618734202079044m_-83468864240=
83693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-72=
97442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</sp=
an>private static RowErrorsAndOverflowStatus upsertOpIterator(Iterator&lt;M=
ap&lt;<wbr>String, Object&gt;&gt; iter, String tableName, String host) {</d=
iv><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_779561=
8734202079044m_-8346886424083693801m_6486822345494282471m_31412613263821430=
34m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"=
white-space:pre-wrap">		</span>try {</div><div><span class=3D"m_-6283129410=
843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m=
_6486822345494282471m_3141261326382143034m_876064196101486489m_-72974423413=
82116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>Asyn=
cKuduClient asyncClient =3D KuduConnection.getAsyncClient(<wbr>host);</div>=
<div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_779561873=
4202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m=
_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"whi=
te-space:pre-wrap">			</span>KuduClient client =3D asyncClient.syncClient()=
;</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77=
95618734202079044m_-8346886424083693801m_6486822345494282471m_3141261326382=
143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=
=3D"white-space:pre-wrap">			</span>table =3D client.openTable(tableName);<=
/div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7795=
618734202079044m_-8346886424083693801m_6486822345494282471m_314126132638214=
3034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=
=3D"white-space:pre-wrap">			</span>session =3D client.newSession();</div><=
div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734=
202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_=
876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"whit=
e-space:pre-wrap">			</span>session.setFlushMode(SessionCo<wbr>nfiguration.=
FlushMode.AUTO_FLU<wbr>SH_BACKGROUND);</div><div><span class=3D"m_-62831294=
10843123603m_-3212254570011668864m_7795618734202079044m_-834688642408369380=
1m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729744234=
1382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>wh=
ile (iter.hasNext()) {</div><div><span class=3D"m_-6283129410843123603m_-32=
12254570011668864m_7795618734202079044m_-8346886424083693801m_6486822345494=
282471m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-=
Apple-tab-span" style=3D"white-space:pre-wrap">				</span>upsertOp(iter.nex=
t());</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864=
m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_314126132=
6382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" s=
tyle=3D"white-space:pre-wrap">			</span>}</div><div><span class=3D"m_-62831=
29410843123603m_-3212254570011668864m_7795618734202079044m_-834688642408369=
3801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729744=
2341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">		</span>=
} catch (KuduException e) {</div><div><span class=3D"m_-6283129410843123603=
m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868223=
45494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041g=
mail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>logger.error(=
&quot;Exception in upsertOpIterator method&quot;, e);</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">		</span>}</div><div><span class=3D"m_-6283129410843123603m_-32122545700=
11668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3=
141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab=
-span" style=3D"white-space:pre-wrap">		</span>finally{</div><div><span cla=
ss=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-8=
346886424083693801m_6486822345494282471m_3141261326382143034m_8760641961014=
86489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-w=
rap">			</span>try {</div><div><span class=3D"m_-6283129410843123603m_-3212=
254570011668864m_7795618734202079044m_-8346886424083693801m_648682234549428=
2471m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Ap=
ple-tab-span" style=3D"white-space:pre-wrap">				</span>session.close();</d=
iv><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_779561=
8734202079044m_-8346886424083693801m_6486822345494282471m_31412613263821430=
34m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"=
white-space:pre-wrap">			</span>} catch (KuduException e) {</div><div><span=
 class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044=
m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196=
101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:p=
re-wrap">				</span>logger.error(&quot;Exception in Connection close&quot;,=
 e);</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m=
_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3141261326=
382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" st=
yle=3D"white-space:pre-wrap">			</span>}</div><div><span class=3D"m_-628312=
9410843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693=
801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-7297442=
341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">		</span>}=
</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_779=
5618734202079044m_-8346886424083693801m_6486822345494282471m_31412613263821=
43034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=
=3D"white-space:pre-wrap">		</span><span style=3D"background-color:rgb(255,=
255,0)">return session.getPendingErrors(); =C2=A0</span><span style=3D"back=
ground-color:rgb(255,255,255)">=C2=A0 =C2=A0 =C2=A0=C2=A0------------------=
---&gt; Once, the connection is lost, this part of the code never gets call=
ed and the Spark job will=C2=A0keep on running and=C2=A0</span>processing t=
he records while the KuduClient is trying to connect to Kudu. Meanwhile, we=
 are loosing all the records.</div><div><span class=3D"m_-62831294108431236=
03m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_648682=
2345494282471m_3141261326382143034m_876064196101486489m_-729744234138211604=
1gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</span>}</div><div><=
span class=3D"m_-6283129410843123603m_-3212254570011668864m_779561873420207=
9044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_87606=
4196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-spa=
ce:pre-wrap">	</span></div><div><span class=3D"m_-6283129410843123603m_-321=
2254570011668864m_7795618734202079044m_-8346886424083693801m_64868223454942=
82471m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-A=
pple-tab-span" style=3D"white-space:pre-wrap">	</span>public static void up=
sertOp(Map&lt;String, Object&gt; formattedMap) {</div><div><span class=3D"m=
_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-83468864=
24083693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_=
-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">		=
</span>if (formattedMap.size() !=3D 0) {</div><div><span class=3D"m_-628312=
9410843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693=
801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-7297442=
341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>=
try {</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864=
m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_314126132=
6382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" s=
tyle=3D"white-space:pre-wrap">				</span>Upsert upsert =3D table.newUpsert(=
);</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7=
795618734202079044m_-8346886424083693801m_6486822345494282471m_314126132638=
2143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" styl=
e=3D"white-space:pre-wrap">				</span>PartialRow row =3D upsert.getRow();</=
div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956=
18734202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143=
034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D=
"white-space:pre-wrap">				</span>for (Map.Entry&lt;String, Object&gt; entr=
y : formattedMap.entrySet()) {</div><div><span class=3D"m_-6283129410843123=
603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868=
22345494282471m_3141261326382143034m_876064196101486489m_-72974423413821160=
41gmail-Apple-tab-span" style=3D"white-space:pre-wrap">					</span>if (entr=
y.getValue().getClass().e<wbr>quals(String.class)) {</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">						</span>if (entry.getValue().equals(Speci<wbr>alNullConstants.speci=
alStringN<wbr>ull))</div><div><span class=3D"m_-6283129410843123603m_-32122=
54570011668864m_7795618734202079044m_-8346886424083693801m_6486822345494282=
471m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-App=
le-tab-span" style=3D"white-space:pre-wrap">							</span>row.setNull(entry=
.getKey());</div><div><span class=3D"m_-6283129410843123603m_-3212254570011=
668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_314=
1261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-s=
pan" style=3D"white-space:pre-wrap">						</span>else</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">							</span>row.addString(entry.getKey(), (String) entry.getValue());<=
/div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7795=
618734202079044m_-8346886424083693801m_6486822345494282471m_314126132638214=
3034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=
=3D"white-space:pre-wrap">					</span>} else if (entry.getValue().getClass(=
).e<wbr>quals(Long.class)) {</div><div><span class=3D"m_-628312941084312360=
3m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_6486822=
345494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041=
gmail-Apple-tab-span" style=3D"white-space:pre-wrap">						</span>if (entry=
.getValue().equals(Speci<wbr>alNullConstants.specialLongNul<wbr>l))</div><d=
iv><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187342=
02079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_8=
76064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white=
-space:pre-wrap">							</span>row.setNull(entry.getKey());</div><div><span=
 class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044=
m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196=
101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:p=
re-wrap">						</span>else</div><div><span class=3D"m_-6283129410843123603m=
_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_648682234=
5494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041gm=
ail-Apple-tab-span" style=3D"white-space:pre-wrap">							</span>row.addLon=
g(entry.getKey(), (Long) entry.getValue());</div><div><span class=3D"m_-628=
3129410843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083=
693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-7297=
442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">					</=
span>} else if (entry.getValue().getClass().e<wbr>quals(Integer.class)) {</=
div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956=
18734202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143=
034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D=
"white-space:pre-wrap">						</span>if (entry.getValue().equals(Speci<wbr>a=
lNullConstants.specialIntNull<wbr>))</div><div><span class=3D"m_-6283129410=
843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m=
_6486822345494282471m_3141261326382143034m_876064196101486489m_-72974423413=
82116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">							</span>=
row.setNull(entry.getKey());</div><div><span class=3D"m_-628312941084312360=
3m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_6486822=
345494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041=
gmail-Apple-tab-span" style=3D"white-space:pre-wrap">						</span>else</div=
><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187=
34202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034=
m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"wh=
ite-space:pre-wrap">							</span>row.addInt(entry.getKey(), (Integer) entr=
y.getValue());</div><div><span class=3D"m_-6283129410843123603m_-3212254570=
011668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_=
3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-ta=
b-span" style=3D"white-space:pre-wrap">					</span>}</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">				</span>}</div><div><br></div><div><span class=3D"m_-6283129410843123=
603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868=
22345494282471m_3141261326382143034m_876064196101486489m_-72974423413821160=
41gmail-Apple-tab-span" style=3D"white-space:pre-wrap">				</span>session.a=
pply(upsert);</div><div><span class=3D"m_-6283129410843123603m_-32122545700=
11668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3=
141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab=
-span" style=3D"white-space:pre-wrap">			</span>} catch (Exception e) {</di=
v><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618=
734202079044m_-8346886424083693801m_6486822345494282471m_314126132638214303=
4m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"w=
hite-space:pre-wrap">				</span>logger.error(&quot;Exception during upsert:=
&quot;, e);</div><div><span class=3D"m_-6283129410843123603m_-3212254570011=
668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_314=
1261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-s=
pan" style=3D"white-space:pre-wrap">			</span>}</div><div><span class=3D"m_=
-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834688642=
4083693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-=
7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">		<=
/span>}</div><div><span class=3D"m_-6283129410843123603m_-32122545700116688=
64m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3141261=
326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span"=
 style=3D"white-space:pre-wrap">	</span>}</div><div><span class=3D"m_-62831=
29410843123603m_-3212254570011668864m_7795618734202079044m_-834688642408369=
3801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729744=
2341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</span><=
/div><div>}</div><div><span class=3D"m_-6283129410843123603m_-3212254570011=
668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_314=
1261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-s=
pan" style=3D"white-space:pre-wrap">	</span></div><div>class KuduConnection=
 {</div><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_7=
795618734202079044m_-8346886424083693801m_6486822345494282471m_314126132638=
2143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" styl=
e=3D"white-space:pre-wrap">	</span>private static Logger logger =3D LoggerF=
actory.getLogger(KuduCo<wbr>nnection.class);</div><div><span class=3D"m_-62=
83129410843123603m_-3212254570011668864m_7795618734202079044m_-834688642408=
3693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729=
7442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</spa=
n>private static Map&lt;String, AsyncKuduClient&gt; asyncCache =3D new Hash=
Map&lt;&gt;();</div><div><span class=3D"m_-6283129410843123603m_-3212254570=
011668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_=
3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-ta=
b-span" style=3D"white-space:pre-wrap">	</span>private static int ShutdownH=
ookPriority =3D 100;</div><div><br></div><div><span class=3D"m_-62831294108=
43123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_=
6486822345494282471m_3141261326382143034m_876064196101486489m_-729744234138=
2116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">	</span>static =
AsyncKuduClient getAsyncClient(String kuduMaster) {</div><div><span class=
=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834=
6886424083693801m_6486822345494282471m_3141261326382143034m_876064196101486=
489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wra=
p">		</span>if (!asyncCache.containsKey(kuduM<wbr>aster)) {</div><div><span=
 class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044=
m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196=
101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:p=
re-wrap">			</span>AsyncKuduClient asyncClient =3D new AsyncKuduClient.Asyn=
cKuduClien<wbr>tBuilder(kuduMaster).build();</div><div><span class=3D"m_-62=
83129410843123603m_-3212254570011668864m_7795618734202079044m_-834688642408=
3693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729=
7442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</s=
pan>ShutdownHookManager.get().addS<wbr>hutdownHook(new Runnable() {</div><d=
iv><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187342=
02079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_8=
76064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white=
-space:pre-wrap">				</span>@Override</div><div><span class=3D"m_-628312941=
0843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801=
m_6486822345494282471m_3141261326382143034m_876064196101486489m_-7297442341=
382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">				</span>pu=
blic void run() {</div><div><span class=3D"m_-6283129410843123603m_-3212254=
570011668864m_7795618734202079044m_-8346886424083693801m_648682234549428247=
1m_3141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple=
-tab-span" style=3D"white-space:pre-wrap">					</span>try {</div><div><span=
 class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044=
m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196=
101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:p=
re-wrap">						</span>asyncClient.close();</div><div><span class=3D"m_-6283=
129410843123603m_-3212254570011668864m_7795618734202079044m_-83468864240836=
93801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-72974=
42341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">					</s=
pan>} catch (Exception e) {</div><div><span class=3D"m_-6283129410843123603=
m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_64868223=
45494282471m_3141261326382143034m_876064196101486489m_-7297442341382116041g=
mail-Apple-tab-span" style=3D"white-space:pre-wrap">						</span>logger.err=
or(&quot;Exception closing async client&quot;, e);</div><div><span class=3D=
"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834688=
6424083693801m_6486822345494282471m_3141261326382143034m_876064196101486489=
m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">=
					</span>}</div><div><span class=3D"m_-6283129410843123603m_-32122545700=
11668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_3=
141261326382143034m_876064196101486489m_-7297442341382116041gmail-Apple-tab=
-span" style=3D"white-space:pre-wrap">				</span>}</div><div><span class=3D=
"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-834688=
6424083693801m_6486822345494282471m_3141261326382143034m_876064196101486489=
m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">=
			</span>}, ShutdownHookPriority);</div><div><span class=3D"m_-62831294108=
43123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_=
6486822345494282471m_3141261326382143034m_876064196101486489m_-729744234138=
2116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">			</span>async=
Cache.put(kuduMaster, asyncClient);</div><div><span class=3D"m_-62831294108=
43123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_=
6486822345494282471m_3141261326382143034m_876064196101486489m_-729744234138=
2116041gmail-Apple-tab-span" style=3D"white-space:pre-wrap">		</span>}</div=
><div><span class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187=
34202079044m_-8346886424083693801m_6486822345494282471m_3141261326382143034=
m_876064196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"wh=
ite-space:pre-wrap">		</span>return asyncCache.get(kuduMaster);</div><div><=
span class=3D"m_-6283129410843123603m_-3212254570011668864m_779561873420207=
9044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_87606=
4196101486489m_-7297442341382116041gmail-Apple-tab-span" style=3D"white-spa=
ce:pre-wrap">	</span>}</div><div>}</div></div><div><br></div><div><br></div=
><div><br></div><div>Thanks,</div><div>Ravi</div></div><div class=3D"m_-628=
3129410843123603m_-3212254570011668864m_7795618734202079044m_-8346886424083=
693801m_6486822345494282471m_3141261326382143034m_876064196101486489HOEnZb"=
><div class=3D"m_-6283129410843123603m_-3212254570011668864m_77956187342020=
79044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_8760=
64196101486489h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"=
>On 5 March 2018 at 16:20, Mike Percy <span dir=3D"ltr">&lt;<a href=3D"mail=
to:mpercy@apache.org" target=3D"_blank">mpercy@apache.org</a>&gt;</span> wr=
ote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Ravi, it would b=
e helpful if you could attach what you are getting back from getPendingErro=
rs() -- perhaps from dumping RowError.toString() from items in the returned=
 array -- and indicate what you were hoping to get back. Note that a RowErr=
or can also return to you the <a href=3D"https://kudu.apache.org/releases/1=
.6.0/apidocs/org/apache/kudu/client/RowError.html#getOperation--" target=3D=
"_blank">Operation</a> that you used to generate the write. From the Operat=
ion, you can get the original=C2=A0<a href=3D"https://kudu.apache.org/relea=
ses/1.6.0/apidocs/org/apache/kudu/client/PartialRow.html" target=3D"_blank"=
>PartialRow</a> object, which should be able to identify the affected row t=
hat the write failed for. Does that help?<div><br></div><div>Since you are =
using the Kudu client directly, Spark is not involved from the Kudu perspec=
tive, so you will need to deal with Spark on your own in that case.</div><s=
pan class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079=
044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064=
196101486489m_-7297442341382116041HOEnZb"><font color=3D"#888888"><div><br>=
</div><div>Mike</div></font></span></div><div class=3D"m_-62831294108431236=
03m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_648682=
2345494282471m_3141261326382143034m_876064196101486489m_-729744234138211604=
1HOEnZb"><div class=3D"m_-6283129410843123603m_-3212254570011668864m_779561=
8734202079044m_-8346886424083693801m_6486822345494282471m_31412613263821430=
34m_876064196101486489m_-7297442341382116041h5"><div class=3D"gmail_extra">=
<br><div class=3D"gmail_quote">On Mon, Mar 5, 2018 at 1:59 PM, Ravi Kanth <=
span dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0@gmail.com" target=3D"_=
blank">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><div dir=3D"ltr">Hi Mike,<div><br></div><div>Thanks for the r=
eply. Yes, I am using AUTO_FLUSH_BACKGROUND.=C2=A0</div><div><br></div><div=
>So, I am trying to use Kudu Client API to perform UPSERT into Kudu and I i=
ntegrated this with Spark. I am trying to test a case where in if any of Ku=
du server fails. So, in this case, if there is any problem in writing, getP=
endingErrors() should give me a way to handle these errors so that I can su=
ccessfully terminate my Spark Job. This is what I am trying to do.</div><di=
v><br></div><div>But, I am not able to get a hold of the exceptions being t=
hrown from with in the KuduClient when retrying to connect to Tablet Server=
. My getPendingErrors is not getting ahold of these exceptions.</div><div><=
br></div><div>Let me know if you need more clarification. I can post some S=
nippets.</div><div><br></div><div>Thanks,</div><div>Ravi</div></div><div cl=
ass=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079044m_-=
8346886424083693801m_6486822345494282471m_3141261326382143034m_876064196101=
486489m_-7297442341382116041m_-5427319935322750979HOEnZb"><div class=3D"m_-=
6283129410843123603m_-3212254570011668864m_7795618734202079044m_-8346886424=
083693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-7=
297442341382116041m_-5427319935322750979h5"><div class=3D"gmail_extra"><br>=
<div class=3D"gmail_quote">On 5 March 2018 at 13:18, Mike Percy <span dir=
=3D"ltr">&lt;<a href=3D"mailto:mpercy@apache.org" target=3D"_blank">mpercy@=
apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">Hi Ravi, are you using <a href=3D"https://kudu.apache.org/releases=
/1.6.0/apidocs/org/apache/kudu/client/SessionConfiguration.FlushMode.html" =
target=3D"_blank">AUTO_FLUSH_BACKGROUND</a>? You mention that you are tryin=
g to use=C2=A0<a href=3D"https://kudu.apache.org/releases/1.6.0/apidocs/org=
/apache/kudu/client/KuduSession.html#getPendingErrors--" target=3D"_blank">=
getPendingErrors()</a>=C2=A0but it sounds like it&#39;s not working for you=
 -- can you be more specific about what you expect and what you are observi=
ng?<div><br></div><div>Thanks,</div><div>Mike<br><div><br></div><div>=C2=A0=
=C2=A0</div></div></div><div class=3D"m_-6283129410843123603m_-321225457001=
1668864m_7795618734202079044m_-8346886424083693801m_6486822345494282471m_31=
41261326382143034m_876064196101486489m_-7297442341382116041m_-5427319935322=
750979m_4536400153608310277HOEnZb"><div class=3D"m_-6283129410843123603m_-3=
212254570011668864m_7795618734202079044m_-8346886424083693801m_648682234549=
4282471m_3141261326382143034m_876064196101486489m_-7297442341382116041m_-54=
27319935322750979m_4536400153608310277h5"><div class=3D"gmail_extra"><br><d=
iv class=3D"gmail_quote">On Mon, Feb 26, 2018 at 8:04 PM, Ravi Kanth <span =
dir=3D"ltr">&lt;<a href=3D"mailto:ravikanth.4b0@gmail.com" target=3D"_blank=
">ravikanth.4b0@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gma=
il_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-lef=
t:1ex"><div dir=3D"ltr">Thank Clifford. We are running Kudu 1.4 version. Ti=
ll date we didn&#39;t see any issues in production and we are not losing ta=
blet servers. But, as part of testing I have to generate few unforeseen cas=
es to analyse the application performance. One among that is bringing down =
the tablet server or master server intentionally during which I observed th=
e loss of records. Just wanted to test cases out of the happy path here. On=
ce again thanks for taking time to respond to me.=C2=A0<span class=3D"m_-62=
83129410843123603m_-3212254570011668864m_7795618734202079044m_-834688642408=
3693801m_6486822345494282471m_3141261326382143034m_876064196101486489m_-729=
7442341382116041m_-5427319935322750979m_4536400153608310277m_62283904851672=
14013HOEnZb"><font color=3D"#888888"><div><br></div><div>- Ravi</div></font=
></span></div><div class=3D"m_-6283129410843123603m_-3212254570011668864m_7=
795618734202079044m_-8346886424083693801m_6486822345494282471m_314126132638=
2143034m_876064196101486489m_-7297442341382116041m_-5427319935322750979m_45=
36400153608310277m_6228390485167214013HOEnZb"><div class=3D"m_-628312941084=
3123603m_-3212254570011668864m_7795618734202079044m_-8346886424083693801m_6=
486822345494282471m_3141261326382143034m_876064196101486489m_-7297442341382=
116041m_-5427319935322750979m_4536400153608310277m_6228390485167214013h5"><=
div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On 26 February 201=
8 at 19:58, Clifford Resnick <span dir=3D"ltr">&lt;<a href=3D"mailto:cresni=
ck@mediamath.com" target=3D"_blank">cresnick@mediamath.com</a>&gt;</span> w=
rote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;borde=
r-left:1px #ccc solid;padding-left:1ex">


<div>
<div dir=3D"auto">
<div dir=3D"auto">I&#39;ll have to get back to you on the code bits, but I&=
#39;m pretty sure we&#39;re doing simple sync batching. We&#39;re not in pr=
oduction yet, but after some months of development I haven&#39;t seen any f=
ailures, even when pushing load doing multiple years&#39;
 backfill. I think the real question is why are you losing tablet servers? =
The only instability we ever had with Kudu was when it had that weird ntp s=
ync issue that was fixed I think for 1.6.=C2=A0<span style=3D"font-family:s=
ans-serif">What version are you running?=C2=A0</span></div>
<div dir=3D"auto"><br>
</div>
<div dir=3D"auto">Anyway I would think that infinite loop should be catchab=
le somewhere. Our pipeline is set to fail/retry with Flink snapshots. I ima=
gine there is similar with Spark. Sorry I cant be of more help!</div><div><=
div class=3D"m_-6283129410843123603m_-3212254570011668864m_7795618734202079=
044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_876064=
196101486489m_-7297442341382116041m_-5427319935322750979m_45364001536083102=
77m_6228390485167214013m_1281330844382541101h5">
<div dir=3D"auto"><br>
</div>
<div dir=3D"auto"><br>
</div>
<div><br>
<div class=3D"m_-6283129410843123603m_-3212254570011668864m_779561873420207=
9044m_-8346886424083693801m_6486822345494282471m_3141261326382143034m_87606=
4196101486489m_-7297442341382116041m_-5427319935322750979m_4536400153608310=
277m_6228390485167214013m_1281330844382541101m_244286682376208444elided-tex=
t">On Feb 26, 2018 9:10 PM, Ravi Kanth &lt;<a href=3D"mailto:ravikanth.4b0@=
gmail.com" target=3D"_blank">ravikanth.4b0@gmail.com</a>&gt; wrote:<br>
<blockquote style=3D"margin:0 0 0 0.8ex;border-left:1px #ccc solid;padding-=
left:1ex">
<div>
<div dir=3D"ltr">Cliff,
<div><br>
</div>
<div>Thanks for the response. Well, I do agree that its simple and seamless=
. In my case, I am able to upsert ~25000 events/sec into Kudu. But, I am fa=
cing the problem when any of the Kudu Tablet or master server is down. I am=
 not able to get a hold of the exception
 from client. The client is going into an infinite loop trying to connect t=
o Kudu. Meanwhile, I am loosing my records. I tried handling the errors thr=
ough getPendingErrors() but still it is helpless. I am using AsyncKuduClien=
t to establish the connection and
 retrieving the syncClient from the Async to open the session and table. An=
y help?=C2=A0</div>
<div><br>
</div>
<div>Thanks,</div>
<div>Ravi</div>
</div>
<div><br>
<div>On 26 February 2018 at 18:00, Cliff Resnick <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:cresny@gmail.com" target=3D"_blank">cresny@gmail.com</a>&gt;<=
/span> wrote:<br>
<blockquote style=3D"margin:0 0 0 0.8ex;border-left:1px #ccc solid;padding-=
left:1ex">
<div dir=3D"auto">While I can&#39;t speak for Spark, we do use the client A=
PI from Flink streaming and it&#39;s simple and seamless. It&#39;s especial=
ly nice if you require an Upsert semantic.</div>
<div>
<div>
<div><br>
<div>On Feb 26, 2018 7:51 PM, &quot;Ravi Kanth&quot; &lt;<a href=3D"mailto:=
ravikanth.4b0@gmail.com" target=3D"_blank">ravikanth.4b0@gmail.com</a>&gt; =
wrote:<br>
<blockquote style=3D"margin:0 0 0 0.8ex;border-left:1px #ccc solid;padding-=
left:1ex">
<div dir=3D"ltr">Hi,
<div><br>
</div>
<div>Anyone using Spark Streaming to ingest data into Kudu and using Kudu C=
lient API to do so rather than the traditional KuduContext API? I am stuck =
at a point and couldn&#39;t find a solution.=C2=A0</div>
<div><br>
</div>
<div>Thanks,</div>
<div>Ravi</div>
</div>
</blockquote>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div></div></div>
</div>

</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div></div>

--94eb2c0b19ced29d5a0566cb306f--