Mailing-List: contact user-help@ignite.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@ignite.apache.org
From: Denis Magda <dmagda@apache.org>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_50BF0A95-6322-489A-B15F-3FBC1FF0ED24"
Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\))
Subject: Re: CacheStore's Performance Drops Dramatically - Why?
Date: Thu, 4 May 2017 12:43:54 -0700
References: <CAJYNqU+fk2DvMZ=MiT0EZhnS2AfhSJZhA7Pm8xELXu5R0OXBtA@mail.gmail.com>
 <CAFkXquT_2C8EmPr+d+hB5RcrMascwM8DhHgUx+Eyefokf0VAEw@mail.gmail.com>
 <CAJYNqULZy5jPr8LiPqsmOkZWOFcPq9KybKY_Aotpd0pq3tMDNA@mail.gmail.com>
 <CAFkXquTWLE+9Z0K3ArrC3WbRZrf9nEii+g8LFkwCvnzWgEnM3w@mail.gmail.com>
 <CAJYNqU+FtUnU-stpfJh-sKAHbhdwM2b+YOrzLerbaAvg_5QLqA@mail.gmail.com>
 <96A6DFC1-F9C9-4EC4-98F1-51403E53AAB6@apache.org>
 <CAJYNqUKAAiM8etuy=r07u6fGxkLWF=tWJripvYXtwNcsBh_zsQ@mail.gmail.com>
 <CAJYNqUKv31gNR-qZqa4SdxgpwmOfiUwHU76UJicHXgHR5g2OEQ@mail.gmail.com>
 <CAFkXquSdKMTV7j86tY7TyOyR4LEb52SQw3P_NBri4JMO1WC_fQ@mail.gmail.com>
To: user@ignite.apache.org,
 dev@ignite.apache.org
In-Reply-To: <CAFkXquSdKMTV7j86tY7TyOyR4LEb52SQw3P_NBri4JMO1WC_fQ@mail.gmail.com>
Message-Id: <8F54FFB0-0E0F-4219-8502-15FBD9DCF4D3@apache.org>
archived-at: Thu, 04 May 2017 19:43:58 -0000


--Apple-Mail=_50BF0A95-6322-489A-B15F-3FBC1FF0ED24
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

Looks like the naming of =E2=80=98getWriteBehindFlushSize=E2=80=99 =
method is totally wrong. It confuses so many people. However, if we =
refer to the documentation of this method or look into the source code =
we will find out that it sets the maximum size of the write-behind =
queue/buffer on a single node. Once this size is reached data will be =
flushed to a storage in the sync mode.

So, you need to set the flush size (maximum queue/buffer size) to a =
bigger value if you can=E2=80=99t keep up with updates and always switch =
to the sync mode.

In any case, I=E2=80=99ve created a ticket to address both issues =
discussed here:
https://issues.apache.org/jira/browse/IGNITE-5173 =
<https://issues.apache.org/jira/browse/IGNITE-5173>

Thanks for your patience.

=E2=80=94
Denis

> On May 3, 2017, at 10:10 AM, Jessie Lin <jessie.jianwei.lin@gmail.com> =
wrote:
>=20
> I thought flushsize could be set as several times higher than the =
batch size is that in a cluster, data nodes would flush in parallel. For =
example there's a cluster with 10 nodes, and flushSize is 10240, thread =
count =3D 2, batch size =3D 512. Then each node would flush out in 2 =
thread, and each thread flushes out in batch of 512.=20
>=20
> Could someone confirms or clarify the understanding? Thank you!
>=20
> On Wed, May 3, 2017 at 12:16 AM, Matt <dromitlabs@gmail.com =
<mailto:dromitlabs@gmail.com>> wrote:
> In fact, I don't see why you would need both batchSize and flushSize. =
If I got it right, only the min of them would be used by Ignite to know =
when to flush, why do we have both in the first place?
>=20
> In case they're both necessary for a reason I'm not seeing, I still =
wonder if the default values should be batchSize > flushSize as I think =
or not.
>=20
> On Wed, May 3, 2017 at 3:26 AM, Matt <dromitlabs@gmail.com =
<mailto:dromitlabs@gmail.com>> wrote:
> I'm writing to confirm I managed to fix my problem by fine tuning the =
config params for the write behind cache until the performance was fine. =
I still see single element inserts from time to time, but just a few of =
them every now and then not like before. You should definitely avoid =
synchronous single elements insertions, I hope that changes in future =
versions.
>=20
> Regarding writeBehindBatchSize and writeBehindFlushSize, I don't see =
the point of setting both values when batchSize < flushSize (default =
values are 512 and 10240 respectively). If I'm not wrong, the cache is =
flushed whenever the its size is equal to min(batchSize, flushSize). =
Since batchSize is less than flushSize, flushSize is never really used =
and the size of the flush is controlled by the size of the cache itself =
only.
>=20
> That is how it works by default, on the other hand if we swap their =
values (ie, batchSize=3D10240 and flushSize=3D512) the behavior would be =
the same (Ignite would call writeAll() with 512 elements each time), but =
the number of elements flushed would be controlled by the correct =
variable (ie, flushSize).
>=20
> Were the default values supposed to be the other way around or am I =
missing something?
>=20
> On Tue, May 2, 2017 at 9:13 PM, Denis Magda <dmagda@apache.org =
<mailto:dmagda@apache.org>> wrote:
> Matt,
>=20
> Cross-posting to the dev list.
>=20
> Yes, Ignite switches to the synchronous mode once the buffer is =
exhausted. However, I do agree that it would be a right solution to =
flush multiple entries rather than one in the synchronous mode. =
*Igniters*, I was sure we had a ticket for that optimization but unable =
to find it.  Does anybody know the ticket name/number?
>=20
> To omit the performance degradation you have to tweak the following =
parameters so that the write-behind store can keep up with you updates:
> - setWriteBehindFlushThreadCount
> - setWriteBehindFlushFrequency
> - setWriteBehindBatchSize
> - setWriteBehindFlushSize
>=20
> Usually it helped all the times to Apache Ignite users.
>=20
> > QUESTION 2
> >
> > I've read on the docs that using ATOMIC mode (default mode) is =
better for performance, but I'm not getting why. If I'm not wrong using =
TRANSACTIONAL mode would cause the CacheStore to reuse connections (not =
call openConnection(autocommit=3Dtrue) on each writeAll()).
> >
> > Shouldn't it be better to use transactional mode?
>=20
> Transactional mode enables 2 phase commit protocol: =
https://apacheignite.readme.io/docs/transactions#two-phase-commit-2pc =
<https://apacheignite.readme.io/docs/transactions#two-phase-commit-2pc>
>=20
> This is why atomic operations are swifter in general.
>=20
> =E2=80=94
> Denis
>=20
> > On May 2, 2017, at 10:40 AM, Matt <dromitlabs@gmail.com =
<mailto:dromitlabs@gmail.com>> wrote:
> >
> > No, only with inserts, I haven't tried removing at this rate yet but =
it may have the same problem.
> >
> > I'm debugging Ignite internal code and I may be onto something. The =
thing is Ignite has a cacheMaxSize (aka, WriteBehindFlushSize) and =
cacheCriticalSize (which by default is cacheMaxSize*1.5). When the cache =
reaches that size Ignite starts writing elements SYNCHRONOUSLY, as you =
can see in [1].
> >
> > I think this makes things worse since only one single value is =
flushed at a time, it becomes much slower forcing Ignite to do more =
synchronous writes.
> >
> > Anyway, I'm still not sure why the cache reaches that level when the =
database is clearly able to keep up with the insertions. I'll check if =
it has to do with the number of open connections or what.
> >
> > Any insight on this is very welcome!
> >
> > [1] =
https://github.com/apache/ignite/blob/master/modules/core/src/main/java/or=
g/apache/ignite/internal/processors/cache/store/GridCacheWriteBehindStore.=
java#L620 =
<https://github.com/apache/ignite/blob/master/modules/core/src/main/java/o=
rg/apache/ignite/internal/processors/cache/store/GridCacheWriteBehindStore=
.java#L620>
> >
> > On Tue, May 2, 2017 at 2:17 PM, Jessie Lin =
<jessie.jianwei.lin@gmail.com <mailto:jessie.jianwei.lin@gmail.com>> =
wrote:
> > I noticed that behavior when any cache.remove operation is involved. =
I keep putting stuff in cache seems to be working properly.
> >
> > Do you use remove operation?
> >
> > On Tue, May 2, 2017 at 9:57 AM, Matt <dromitlabs@gmail.com =
<mailto:dromitlabs@gmail.com>> wrote:
> > I'm stuck with that. No matter what config I use (flush size, write =
threads, etc) this is the behavior I always get. It's as if Ignite =
internal buffer is full and it's trying to write and get rid of the =
oldest (one) element only.
> >
> > Any idea people? What is your CacheStore configuration to avoid =
this?
> >
> > On Tue, May 2, 2017 at 11:50 AM, Jessie Lin =
<jessie.jianwei.lin@gmail.com <mailto:jessie.jianwei.lin@gmail.com>> =
wrote:
> > Hello Matt, thank you for posting. I've noticed similar behavior.
> >
> > Would be curious to see the response from the engineering team.
> >
> > Best,
> > Jessie
> >
> > On Tue, May 2, 2017 at 1:03 AM, Matt <dromitlabs@gmail.com =
<mailto:dromitlabs@gmail.com>> wrote:
> > Hi all,
> >
> > I have two questions for you!
> >
> > QUESTION 1
> >
> > I'm following the example in [1] (a mix between "jdbc transactional" =
and "jdbc bulk operations") and I've enabled write behind, however after =
the first 10k-20k insertions the performance drops *dramatically*.
> >
> > Based on prints I've added to the CacheStore, I've noticed what =
Ignite is doing is this:
> >
> > - writeAll called with 512 elements (Ignites buffers elements, =
that's good)
> > - openConnection with autocommit=3Dtrue is called each time inside =
writeAll (since session is not stored in atomic mode)
> > - writeAll is called with 512 elements a few dozen times, each time =
it opens a new JDBC connection as mentioned above
> > - ...
> > - writeAll called with ONE element (for some reason Ignite stops =
buffering elements)
> > - writeAll is called with ONE element from here on, each time it =
opens a new JDBC connection as mentioned above
> > - ...
> >
> > Things to note:
> >
> > - All config values are the defaults ones except for write through =
and write behind which are both enabled.
> > - I'm running this as a server node (only one node on the cluster, =
the application itself).
> > - I see the problem even with a big heap (ie, Ignite is not nearly =
out of memory).
> > - I'm using PostgreSQL for this test (it's fine ingesting around 40k =
rows per second on this computer, so that shouldn't be a problem)
> >
> > What is causing Ignite to stop buffering elements after calling =
writeAll() a few dozen times?
> >
> > QUESTION 2
> >
> > I've read on the docs that using ATOMIC mode (default mode) is =
better for performance, but I'm not getting why. If I'm not wrong using =
TRANSACTIONAL mode would cause the CacheStore to reuse connections (not =
call openConnection(autocommit=3Dtrue) on each writeAll()).
> >
> > Shouldn't it be better to use transactional mode?
> >
> > Regards,
> > Matt
> >
> > [1] =
https://apacheignite.readme.io/docs/persistent-store#section-cachestore-ex=
ample =
<https://apacheignite.readme.io/docs/persistent-store#section-cachestore-e=
xample>
> >
> >
> >
> >
>=20
>=20
>=20
>=20


--Apple-Mail=_50BF0A95-6322-489A-B15F-3FBC1FF0ED24
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D"">Looks like the naming of =E2=80=98getWriteBehindFlushSize=E2=80=
=99 method is totally wrong. It confuses so many people. However, if we =
refer to the documentation of this method or look into the source code =
we will find out that it sets the maximum size of the write-behind =
queue/buffer on a single node. Once this size is reached data will be =
flushed to a storage in the sync mode.<div class=3D""><br =
class=3D""></div><div class=3D"">So, you need to set the flush size =
(maximum queue/buffer size) to a bigger value if you can=E2=80=99t keep =
up with updates and always switch to the sync mode.</div><div =
class=3D""><br class=3D""></div><div class=3D"">In any case, I=E2=80=99ve =
created a ticket to address both issues discussed here:</div><div =
class=3D""><a href=3D"https://issues.apache.org/jira/browse/IGNITE-5173" =
class=3D"">https://issues.apache.org/jira/browse/IGNITE-5173</a></div><div=
 class=3D""><br class=3D""></div><div class=3D"">Thanks for your =
patience.</div><div class=3D""><br class=3D""></div><div =
class=3D"">=E2=80=94</div><div class=3D"">Denis</div><div class=3D""><br =
class=3D""><div class=3D""><div><blockquote type=3D"cite" class=3D""><div =
class=3D"">On May 3, 2017, at 10:10 AM, Jessie Lin &lt;<a =
href=3D"mailto:jessie.jianwei.lin@gmail.com" =
class=3D"">jessie.jianwei.lin@gmail.com</a>&gt; wrote:</div><br =
class=3D"Apple-interchange-newline"><div class=3D""><div dir=3D"ltr" =
class=3D"">I thought flushsize could be set as several times higher than =
the batch size is that in a cluster, data nodes would flush in parallel. =
For example there's a cluster with 10 nodes, and flushSize is 10240, =
thread count =3D 2, batch size =3D 512. Then each node would flush out =
in 2 thread, and each thread flushes out in batch of 512.&nbsp;<div =
class=3D""><br class=3D""></div><div class=3D"">Could someone confirms =
or clarify the understanding? Thank you!</div></div><div =
class=3D"gmail_extra"><br class=3D""><div class=3D"gmail_quote">On Wed, =
May 3, 2017 at 12:16 AM, Matt <span dir=3D"ltr" class=3D"">&lt;<a =
href=3D"mailto:dromitlabs@gmail.com" target=3D"_blank" =
class=3D"">dromitlabs@gmail.com</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" =
class=3D"">In fact, I don't see why you would need both&nbsp;<span =
style=3D"font-size:12.8px" class=3D"">batchSize and&nbsp;</span><span =
style=3D"font-size:12.8px" class=3D"">flushSize. If I got it right, only =
the min of them would be used by Ignite to know when to flush, why do we =
have both in the first place?</span><div class=3D""><span =
style=3D"font-size:12.8px" class=3D""><br class=3D""></span></div><div =
class=3D""><span style=3D"font-size:12.8px" class=3D"">In case they're =
both necessary for a reason I'm not seeing, I still wonder if the =
default values should be batchSize &gt; flushSize as I think or =
not.</span></div></div><div class=3D"HOEnZb"><div class=3D"h5"><div =
class=3D"gmail_extra"><br class=3D""><div class=3D"gmail_quote">On Wed, =
May 3, 2017 at 3:26 AM, Matt <span dir=3D"ltr" class=3D"">&lt;<a =
href=3D"mailto:dromitlabs@gmail.com" target=3D"_blank" =
class=3D"">dromitlabs@gmail.com</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" =
class=3D"">I'm writing to confirm I managed to fix my problem by fine =
tuning the config params for the write behind cache until the =
performance was fine. I still see single element inserts from time to =
time, but just a few of them every now and then not like before. You =
should definitely avoid synchronous single elements insertions, I hope =
that changes in future versions.<div class=3D""><br class=3D""></div><div =
class=3D""><span style=3D"font-size:12.8px" =
class=3D"">Regarding&nbsp;</span>w<span style=3D"font-size:12.8px" =
class=3D"">riteBehindBatchSize and w</span><span =
style=3D"font-size:12.8px" class=3D"">riteBehindFlushSize,</span><span =
style=3D"font-size:12.8px" class=3D"">&nbsp;I don't see the point of =
setting both values when batchSize &lt; flushSize (default values are =
512 and 10240 respectively). If I'm not wrong, the cache is flushed =
whenever the its size is equal to min(batchSize, flushSize). Since =
batchSize is less than flushSize, flushSize is never really used and the =
size of the flush is controlled by the size of the cache itself =
only.</span></div><div class=3D""><span style=3D"font-size:12.8px" =
class=3D""><br class=3D""></span></div><div class=3D""><span =
style=3D"font-size:12.8px" class=3D"">That is how it works by default, =
o</span><span style=3D"font-size:12.8px" class=3D"">n the other hand if =
we swap their values (ie,&nbsp;</span><span style=3D"font-size:12.8px" =
class=3D"">b</span><span style=3D"font-size:12.8px" =
class=3D"">atchSize=3D10240 and flushSize=3D512) the behavior would be =
the same (Ignite would call writeAll() with 512 elements each time), but =
the number of elements flushed would be controlled by the correct =
variable (ie, flushSize).</span></div><div class=3D""><span =
style=3D"font-size:12.8px" class=3D""><br class=3D""></span></div><div =
class=3D""><span style=3D"font-size:12.8px" class=3D"">Were the default =
values supposed to be the other way around or am I missing =
something?</span></div></div><div =
class=3D"m_3640847500995479039HOEnZb"><div =
class=3D"m_3640847500995479039h5"><div class=3D"gmail_extra"><br =
class=3D""><div class=3D"gmail_quote">On Tue, May 2, 2017 at 9:13 PM, =
Denis Magda <span dir=3D"ltr" class=3D"">&lt;<a =
href=3D"mailto:dmagda@apache.org" target=3D"_blank" =
class=3D"">dmagda@apache.org</a>&gt;</span> wrote:<br =
class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">Matt,<br class=3D"">
<br class=3D"">
Cross-posting to the dev list.<br class=3D"">
<br class=3D"">
Yes, Ignite switches to the synchronous mode once the buffer is =
exhausted. However, I do agree that it would be a right solution to =
flush multiple entries rather than one in the synchronous mode. =
*Igniters*, I was sure we had a ticket for that optimization but unable =
to find it.&nbsp; Does anybody know the ticket name/number?<br class=3D"">=

<br class=3D"">
To omit the performance degradation you have to tweak the following =
parameters so that the write-behind store can keep up with you =
updates:<br class=3D"">
- setWriteBehindFlushThreadCount<br class=3D"">
- setWriteBehindFlushFrequency<br class=3D"">
- setWriteBehindBatchSize<br class=3D"">
- setWriteBehindFlushSize<br class=3D"">
<br class=3D"">
Usually it helped all the times to Apache Ignite users.<br class=3D"">
<span class=3D""><br class=3D"">
&gt; QUESTION 2<br class=3D"">
&gt;<br class=3D"">
&gt; I've read on the docs that using ATOMIC mode (default mode) is =
better for performance, but I'm not getting why. If I'm not wrong using =
TRANSACTIONAL mode would cause the CacheStore to reuse connections (not =
call openConnection(autocommit=3Dtrue<wbr class=3D"">) on each =
writeAll()).<br class=3D"">
&gt;<br class=3D"">
&gt; Shouldn't it be better to use transactional mode?<br class=3D"">
<br class=3D"">
</span>Transactional mode enables 2 phase commit protocol: <a =
href=3D"https://apacheignite.readme.io/docs/transactions#two-phase-commit-=
2pc" rel=3D"noreferrer" target=3D"_blank" =
class=3D"">https://apacheignite.readme.io<wbr =
class=3D"">/docs/transactions#two-phase-c<wbr class=3D"">ommit-2pc</a><br =
class=3D"">
<br class=3D"">
This is why atomic operations are swifter in general.<br class=3D"">
<br class=3D"">
=E2=80=94<br class=3D"">
<span class=3D"m_3640847500995479039m_-1528842771699353780HOEnZb"><font =
color=3D"#888888" class=3D"">Denis<br class=3D"">
</font></span><div =
class=3D"m_3640847500995479039m_-1528842771699353780HOEnZb"><div =
class=3D"m_3640847500995479039m_-1528842771699353780h5"><br class=3D"">
&gt; On May 2, 2017, at 10:40 AM, Matt &lt;<a =
href=3D"mailto:dromitlabs@gmail.com" target=3D"_blank" =
class=3D"">dromitlabs@gmail.com</a>&gt; wrote:<br class=3D"">
&gt;<br class=3D"">
&gt; No, only with inserts, I haven't tried removing at this rate yet =
but it may have the same problem.<br class=3D"">
&gt;<br class=3D"">
&gt; I'm debugging Ignite internal code and I may be onto something. The =
thing is Ignite has a cacheMaxSize (aka, WriteBehindFlushSize) and =
cacheCriticalSize (which by default is cacheMaxSize*1.5). When the cache =
reaches that size Ignite starts writing elements SYNCHRONOUSLY, as you =
can see in [1].<br class=3D"">
&gt;<br class=3D"">
&gt; I think this makes things worse since only one single value is =
flushed at a time, it becomes much slower forcing Ignite to do more =
synchronous writes.<br class=3D"">
&gt;<br class=3D"">
&gt; Anyway, I'm still not sure why the cache reaches that level when =
the database is clearly able to keep up with the insertions. I'll check =
if it has to do with the number of open connections or what.<br =
class=3D"">
&gt;<br class=3D"">
&gt; Any insight on this is very welcome!<br class=3D"">
&gt;<br class=3D"">
&gt; [1] <a =
href=3D"https://github.com/apache/ignite/blob/master/modules/core/src/main=
/java/org/apache/ignite/internal/processors/cache/store/GridCacheWriteBehi=
ndStore.java#L620" rel=3D"noreferrer" target=3D"_blank" =
class=3D"">https://github.com/apache/igni<wbr =
class=3D"">te/blob/master/modules/core/sr<wbr =
class=3D"">c/main/java/org/apache/ignite/<wbr =
class=3D"">internal/processors/cache/stor<wbr =
class=3D"">e/GridCacheWriteBehindStore.<wbr class=3D"">java#L620</a><br =
class=3D"">
&gt;<br class=3D"">
&gt; On Tue, May 2, 2017 at 2:17 PM, Jessie Lin &lt;<a =
href=3D"mailto:jessie.jianwei.lin@gmail.com" target=3D"_blank" =
class=3D"">jessie.jianwei.lin@gmail.com</a>&gt; wrote:<br class=3D"">
&gt; I noticed that behavior when any cache.remove operation is =
involved. I keep putting stuff in cache seems to be working properly.<br =
class=3D"">
&gt;<br class=3D"">
&gt; Do you use remove operation?<br class=3D"">
&gt;<br class=3D"">
&gt; On Tue, May 2, 2017 at 9:57 AM, Matt &lt;<a =
href=3D"mailto:dromitlabs@gmail.com" target=3D"_blank" =
class=3D"">dromitlabs@gmail.com</a>&gt; wrote:<br class=3D"">
&gt; I'm stuck with that. No matter what config I use (flush size, write =
threads, etc) this is the behavior I always get. It's as if Ignite =
internal buffer is full and it's trying to write and get rid of the =
oldest (one) element only.<br class=3D"">
&gt;<br class=3D"">
&gt; Any idea people? What is your CacheStore configuration to avoid =
this?<br class=3D"">
&gt;<br class=3D"">
&gt; On Tue, May 2, 2017 at 11:50 AM, Jessie Lin &lt;<a =
href=3D"mailto:jessie.jianwei.lin@gmail.com" target=3D"_blank" =
class=3D"">jessie.jianwei.lin@gmail.com</a>&gt; wrote:<br class=3D"">
&gt; Hello Matt, thank you for posting. I've noticed similar =
behavior.<br class=3D"">
&gt;<br class=3D"">
&gt; Would be curious to see the response from the engineering team.<br =
class=3D"">
&gt;<br class=3D"">
&gt; Best,<br class=3D"">
&gt; Jessie<br class=3D"">
&gt;<br class=3D"">
&gt; On Tue, May 2, 2017 at 1:03 AM, Matt &lt;<a =
href=3D"mailto:dromitlabs@gmail.com" target=3D"_blank" =
class=3D"">dromitlabs@gmail.com</a>&gt; wrote:<br class=3D"">
&gt; Hi all,<br class=3D"">
&gt;<br class=3D"">
&gt; I have two questions for you!<br class=3D"">
&gt;<br class=3D"">
&gt; QUESTION 1<br class=3D"">
&gt;<br class=3D"">
&gt; I'm following the example in [1] (a mix between "jdbc =
transactional" and "jdbc bulk operations") and I've enabled write =
behind, however after the first 10k-20k insertions the performance drops =
*dramatically*.<br class=3D"">
&gt;<br class=3D"">
&gt; Based on prints I've added to the CacheStore, I've noticed what =
Ignite is doing is this:<br class=3D"">
&gt;<br class=3D"">
&gt; - writeAll called with 512 elements (Ignites buffers elements, =
that's good)<br class=3D"">
&gt; - openConnection with autocommit=3Dtrue is called each time inside =
writeAll (since session is not stored in atomic mode)<br class=3D"">
&gt; - writeAll is called with 512 elements a few dozen times, each time =
it opens a new JDBC connection as mentioned above<br class=3D"">
&gt; - ...<br class=3D"">
&gt; - writeAll called with ONE element (for some reason Ignite stops =
buffering elements)<br class=3D"">
&gt; - writeAll is called with ONE element from here on, each time it =
opens a new JDBC connection as mentioned above<br class=3D"">
&gt; - ...<br class=3D"">
&gt;<br class=3D"">
&gt; Things to note:<br class=3D"">
&gt;<br class=3D"">
&gt; - All config values are the defaults ones except for write through =
and write behind which are both enabled.<br class=3D"">
&gt; - I'm running this as a server node (only one node on the cluster, =
the application itself).<br class=3D"">
&gt; - I see the problem even with a big heap (ie, Ignite is not nearly =
out of memory).<br class=3D"">
&gt; - I'm using PostgreSQL for this test (it's fine ingesting around =
40k rows per second on this computer, so that shouldn't be a problem)<br =
class=3D"">
&gt;<br class=3D"">
&gt; What is causing Ignite to stop buffering elements after calling =
writeAll() a few dozen times?<br class=3D"">
&gt;<br class=3D"">
&gt; QUESTION 2<br class=3D"">
&gt;<br class=3D"">
&gt; I've read on the docs that using ATOMIC mode (default mode) is =
better for performance, but I'm not getting why. If I'm not wrong using =
TRANSACTIONAL mode would cause the CacheStore to reuse connections (not =
call openConnection(autocommit=3Dtrue<wbr class=3D"">) on each =
writeAll()).<br class=3D"">
&gt;<br class=3D"">
&gt; Shouldn't it be better to use transactional mode?<br class=3D"">
&gt;<br class=3D"">
&gt; Regards,<br class=3D"">
&gt; Matt<br class=3D"">
&gt;<br class=3D"">
&gt; [1] <a =
href=3D"https://apacheignite.readme.io/docs/persistent-store#section-cache=
store-example" rel=3D"noreferrer" target=3D"_blank" =
class=3D"">https://apacheignite.readme.io<wbr =
class=3D"">/docs/persistent-store#section<wbr =
class=3D"">-cachestore-example</a><br class=3D"">
&gt;<br class=3D"">
&gt;<br class=3D"">
&gt;<br class=3D"">
&gt;<br class=3D"">
<br class=3D"">
</div></div></blockquote></div><br class=3D""></div>
</div></div></blockquote></div><br class=3D""></div>
</div></div></blockquote></div><br class=3D""></div>
</div></blockquote></div><br class=3D""></div></div></body></html>=

--Apple-Mail=_50BF0A95-6322-489A-B15F-3FBC1FF0ED24--