Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8D8379894 for ; Thu, 3 Jan 2013 00:35:44 +0000 (UTC) Received: (qmail 38870 invoked by uid 500); 3 Jan 2013 00:35:44 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 38811 invoked by uid 500); 3 Jan 2013 00:35:44 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 38794 invoked by uid 99); 3 Jan 2013 00:35:44 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Jan 2013 00:35:44 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS,UNPARSEABLE_RELAY X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [129.55.12.46] (HELO mx2.ll.mit.edu) (129.55.12.46) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Jan 2013 00:35:36 +0000 Received: from LLE2K7-HUB01.mitll.ad.local (LLE2K7-HUB01.mitll.ad.local) by mx2.ll.mit.edu (unknown) with ESMTP id r030ZDaV008991; Wed, 2 Jan 2013 19:35:14 -0500 From: "Kepner, Jeremy - 0553 - MITLL" To: "user@accumulo.apache.org" , "dev@accumulo.apache.org" CC: "Kepner, Jeremy - 0553 - MITLL" Date: Wed, 2 Jan 2013 19:25:42 -0500 Subject: Re: ingest performance oscillations and Xceivers Thread-Topic: ingest performance oscillations and Xceivers Thread-Index: Ac3pSN6eKBKs/7LjSz6gpHnDXPVEDw== Message-ID: <7C21BEED-BDBD-40D4-97CF-EF60B776540A@ll.mit.edu> References: <20130102191131.GA12538@ll.mit.edu> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.9.8327,1.0.431,0.0.0000 definitions=2013-01-02_10:2012-12-31,2013-01-02,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 ipscore=0 suspectscore=7 phishscore=0 bulkscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=6.0.2-1211240000 definitions=main-1301020286 X-Virus-Checked: Checked by ClamAV on apache.org Hmmm, that's interesting, because in the past I didn't see this behavior. = It might be worth having someone look into because it seems to have a 2x im= pact on sustained ingest.=20 Regards. -Jeremy On Jan 2, 2013, at 2:23 PM, Keith Turner wrote: > On Wed, Jan 2, 2013 at 2:11 PM, Jeremy Kepner wrote: >> So what mechanism causes the number of Xceivers to increase? >=20 > Its been a while since I looked at the data node source code. When I > last look at it an Xceiver was just a thread created to handle a > datanode request. The thread went away after the request was > processed. So major and minor compactions running would cause more > Xceivers to be created to read and write data. >=20 > Newer datanode code may use a thread pool instead of creating a > thread/xceiver for each request. I am not sure. >=20 >> I am carefully controlling the number of ingestors and the data isn't va= rying too much. >> I would expect the number of Xceivers to remain consant. >>=20 >> Regards. -Jeremy >>=20 >> On Tue, Jan 01, 2013 at 09:45:20PM -0500, Eric Newton wrote: >>> Hey Jeremy, >>>=20 >>> Can you compare the ingest rate to the number of tablets, too? >>>=20 >>> I've found, that if I have 20-80 tablets per server (on similar hardwar= e) I >>> get the best performance. >>>=20 >>> # of Xceivers =3D=3D number of writers when ingest is the primary targe= t. >>>=20 >>> Also, is this 1.4 or trunk? >>>=20 >>> -Eric >>>=20 >>>=20 >>>=20 >>> On Tue, Jan 1, 2013 at 9:19 PM, Kepner, Jeremy - 1010 - MITLL < >>> kepner@ll.mit.edu> wrote: >>>=20 >>>> Accumulo Colleagues, >>>> I am trying to optimize my ingest into a single node Accumulo instanc= e >>>> running on a 32 core node with 96 GB of RAM. I am seeing the follow i= ngest >>>> variations as a I change the number of ingest processes (see attached)= : >>>>=20 >>>> ------------------------------------- >>>> Ingestors, Ingest rate >>>> ------------------------------------- >>>> 1, 60K inserts/sec (stable) >>>> 2, 120K inserts/sec (stable) >>>> 3, 60K to 180K inserts/sec >>>> 4, 90K to 220K inserts/sec >>>> 8, 80K to 280K inserts/sec >>>> 12, 80K to 280K inserts/sec >>>> ------------------------------------- >>>>=20 >>>> The only thing I can see that correlates with the ingest rate is the >>>> number of Xceivers. When the ingest rate is high the number of Xceive= rs is >>>> usually low. Likewise, when the ingest rate drops, the number of Xcei= vers >>>> usually increases significantly. >>>>=20 >>>> Question: What role to Xceivers play in ingest? >>>>=20 >>>> Request: It would be great to add a plot showing the number of Xceiver= s >>>> over time to the diagnostics. >>>>=20 >>>> Regards. -Jeremy >>>>=20 >>>>=20