Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 930E2200C4B for ; Mon, 6 Mar 2017 00:41:23 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 91931160B6B; Sun, 5 Mar 2017 23:41:23 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 904B8160B7D for ; Mon, 6 Mar 2017 00:41:22 +0100 (CET) Received: (qmail 18293 invoked by uid 500); 5 Mar 2017 23:41:21 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 18259 invoked by uid 99); 5 Mar 2017 23:41:21 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Mar 2017 23:41:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id D24371A042C for ; Sun, 5 Mar 2017 23:41:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.82 X-Spam-Level: X-Spam-Status: No, score=-1.82 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-2.999, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=hotmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id cs5wHiW3IXXw for ; Sun, 5 Mar 2017 23:41:16 +0000 (UTC) Received: from BLU004-OMC1S35.hotmail.com (blu004-omc1s35.hotmail.com [65.55.116.46]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 7BE595F39E for ; Sun, 5 Mar 2017 23:41:15 +0000 (UTC) Received: from NAM01-SN1-obe.outbound.protection.outlook.com ([65.55.116.9]) by BLU004-OMC1S35.hotmail.com over TLS secured channel with Microsoft SMTPSVC(7.5.7601.23008); Sun, 5 Mar 2017 15:41:07 -0800 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=hotmail.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=n2ML583nQi9Pa6uB1nqwEfaj7lkNiFMyuQALtig00us=; b=HJ4Zz5Aq/hcssxnDQiL2Q2XviX8Y9XIXb9lApSethhk6m1oK+6GVDezeG2yusMqG2hI6dJEolWWc0HSnO3Ek84pgINOHyAR7m1CoNyYEU/ObxXdqalO3WsnpeGYYQEgRskewXpTcwG2k7+pzbW8kZZLZ4Fdm6J+k16eOK8BDcEhbrXg268hn6L+IIhp8O/JGrTt/M+SCsNqQFfXJRdG30BNQaLtqpAKeLfaymDAiB6r2+pmJAuFlWoSbV5XQ3vCx7J9tyPi1b8T/4wQPKnBweEdu6/D1VISSO8IDp/OQYKlKtqPYJyQ/utv5SkYKDhUO8zJcI5DCwHW0WO9fqBnfSg== Received: from BN3NAM01FT050.eop-nam01.prod.protection.outlook.com (10.152.66.54) by BN3NAM01HT034.eop-nam01.prod.protection.outlook.com (10.152.66.66) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.947.7; Sun, 5 Mar 2017 23:41:06 +0000 Received: from BY1PR14MB0167.namprd14.prod.outlook.com (10.152.66.51) by BN3NAM01FT050.mail.protection.outlook.com (10.152.66.164) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.947.7 via Frontend Transport; Sun, 5 Mar 2017 23:41:06 +0000 Received: from BY1PR14MB0167.namprd14.prod.outlook.com ([10.162.148.20]) by BY1PR14MB0167.namprd14.prod.outlook.com ([10.162.148.20]) with mapi id 15.01.0933.020; Sun, 5 Mar 2017 23:41:05 +0000 From: jeff saremi To: Yu Li , "dev@hbase.apache.org" Subject: Re: Need guidance on getting detailed elapsed times in every stage of processing a request Thread-Topic: Need guidance on getting detailed elapsed times in every stage of processing a request Thread-Index: AQHSkgGUT46wS2IARUmdi07eUSr7FaF+46QAgAABXN6AAjL91IAA6A+AgAAHzOuAABvRRIAAfVmAgAC23rGAABKFgIAACoIqgAALD3aAAA5QAIAC0daAgACPcPY= Date: Sun, 5 Mar 2017 23:41:05 +0000 Message-ID: References: , In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: gmail.com; dkim=none (message not signed) header.d=none;gmail.com; dmarc=none action=none header.from=hotmail.com; x-incomingtopheadermarker: OriginalChecksum:5FD0BB3A166BE76883A33D73F321E1CC6C945DD2DFDFFCE207E2CED42996163D;UpperCasedChecksum:B3E8DDFA02F72F1EEFA1A1F19D0DCA764008BB48EA0CA50EBC3F13853F3DD006;SizeAsReceived:8998;Count:39 x-ms-exchange-messagesentrepresentingtype: 1 x-tmn: [TUVRa4uXmqG+G9LApjsQmbNmld088mpF] x-incomingheadercount: 39 x-eopattributedmessage: 0 x-microsoft-exchange-diagnostics: 1;BN3NAM01HT034;5:LJLsSmy/kizdcE7SSc5Cwm4z6QhQsSMWYICrKk4iMnADENS335iT6yzDaXLGAkfYWUGTShYiRwoId/CNZV0URZQAN7F8Cw9Zr64EVtg0L32TBwO+f8a7SJOqgfpbK34pZLko8iNLchcnysClvmTzFA==;24:cB7T7aSyJGKl59ZqJkCUwj/d/w1H1az81kbiB6BhTCwVyY5zOZjCci4s9p54EPDvSg7z51H3Sdz3z3mdf51wmuSnn+KVmVa81yqJPLNES/s=;7:QkV+RZTAHz0ddOCF7TtBauRPXIxCt6YzrtCehiUU+tlDgpbSh2nQ/T+6H2ChpeGHW9eUe4ZRIvbE9xxE3fg6eE/9zuLvVh9YYBTBfnBjWzY72cUSSTdo4cXA/OkBZWv9YFIleqgr1m6EcUzPs0ECpgurNUhkvz8Kj/U9KhMM74e5XY9+gykni6GMiT4UZOiAW1i9OTxAYwsigezNAsfyvxzXbm/aHkq5Rf8tVuxnMAaYkweeYXktEXYb6sxIoSMooOfuEiQ1W7i9mV6bXBa5ULWDEHX3sCUyKckpNBkNKG+U0HSWH2qWlvjHjH1Idl0K x-forefront-antispam-report: EFV:NLI;SFV:NSPM;SFS:(10019020)(98900016);DIR:OUT;SFP:1102;SCL:1;SRVR:BN3NAM01HT034;H:BY1PR14MB0167.namprd14.prod.outlook.com;FPR:;SPF:None;LANG:en; x-ms-office365-filtering-correlation-id: df183f1c-0987-4885-52f5-08d464211767 x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(22001)(201702061074)(5061506573)(5061507331)(1603103135)(1601125254)(1603101448)(1701031045);SRVR:BN3NAM01HT034; x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(432015087)(444000031);SRVR:BN3NAM01HT034;BCL:0;PCL:0;RULEID:;SRVR:BN3NAM01HT034; x-forefront-prvs: 02379661A3 spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_BY1PR14MB01671E6C602ABCBC32D1FC40C12D0BY1PR14MB0167namp_" MIME-Version: 1.0 X-OriginatorOrg: hotmail.com X-MS-Exchange-CrossTenant-originalarrivaltime: 05 Mar 2017 23:41:05.4178 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Internet X-MS-Exchange-CrossTenant-id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN3NAM01HT034 X-OriginalArrivalTime: 05 Mar 2017 23:41:07.0548 (UTC) FILETIME=[F62249C0:01D29609] archived-at: Sun, 05 Mar 2017 23:41:23 -0000 --_000_BY1PR14MB01671E6C602ABCBC32D1FC40C12D0BY1PR14MB0167namp_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable HI Yu I applied your path (the one that didn't have any V number) to 1.2.2 manual= ly. I'm waiting to see the results. I should circle back if there are issue= s. ________________________________ From: Yu Li Sent: Sunday, March 5, 2017 7:06:27 AM To: dev@hbase.apache.org Cc: jeffsaremi@hotmail.com Subject: Re: Need guidance on getting detailed elapsed times in every stage= of processing a request @jeff normally it's recommended to try the latest one, say v5 in this case. btw, one thing to make clear: only #4 in my list is missing in current code= base, all others are already there for usage. Best Regards, Yu On 4 March 2017 at 04:02, Sean Busbey > wrote: -user@hbase to bcc +dev@hbase Jeff, These kinds of questions get into things that we only handle within the development community. I've copied in the appropriate dev@hbase mailing list, please ensure you are subscribed there so you can see any further discussion. On Fri, Mar 3, 2017 at 1:13 PM, jeff saremi > wrote: > Yu > Of the patches attached to HBASE-15160, do I need to apply all (v2, v3, .= ..) or just HBASE-15160.patch ? > Also how would I know against what version this patch was created? > thanks > > > > > ________________________________ > From: jeff saremi > > Sent: Friday, March 3, 2017 10:34:00 AM > To: Hbase-User > Subject: Re: Need guidance on getting detailed elapsed times in every sta= ge of processing a request > > Thanks a lot Yu > > These are truly the metrics we care about about at this point. It is sad = to see that such important metrics were removed from the code. > > I will try to apply your patch on my own to the version of HBase we have.= We definitely need these. > > Other solutions like HTrace are not as urgent as having these few metrics= you talked about here. So if we can get these merged with the code we shou= ld be happy. > > > ________________________________ > From: Yu Li > > Sent: Friday, March 3, 2017 9:54:29 AM > To: Hbase-User > Subject: Re: Need guidance on getting detailed elapsed times in every sta= ge of processing a request > > Hi Jeff, > > If the question is simply monitoring HDFS read/write latencies, please > refer to HBASE-15160 , > there's a patch but not committed yet, and probably cannot apply cleanly = on > current code base, but still some good reference IMHO, so JFYI. > > To get an overview of how quickly the system could respond and what might > be the root cause of the spikes, we only need to monitor the > average/p99/p999 latency of below metrics (stages): > 1. totalCallTime: time from request arriving at server to sending respons= e > 2. processCallTime: time for the server to process the call, regardless o= f > the time this call being queued > 3. queueCallTime: time the call has been queued > 4. HDFS read/pread/write time: time of HFile reading/writing, added in > HBASE-15160 > 5. WAL sync time: time of WAL sync to HDFS, critical path of writing requ= est > > However, for your original question, that to monitor the whole trace of a > single request, I'm afraid no mature solution for the time being just as > Stack mentioned. > > Hope my answer helps (smile). > > Best Regards, > Yu > > On 4 March 2017 at 00:48, jeff saremi > wrote: > >> anything would help. thanks >> >> ________________________________ >> From: saint.ack@gmail.com > on behalf of Stack < >> stack@duboce.net> >> Sent: Thursday, March 2, 2017 9:53:41 PM >> To: Hbase-User >> Subject: Re: Need guidance on getting detailed elapsed times in every >> stage of processing a request >> >> On Thu, Mar 2, 2017 at 10:26 PM, jeff saremi > >> wrote: >> >> > So i'd like to come back to my original question on how to get about >> > separating the latency of HDFS from HBase. >> > >> > >> That is a simple question to which we do not have an answer unfortunatel= y >> (we should). If interested, I could describe how you might do it. I don'= t >> think it would take much work. >> >> St.Ack >> >> >> >> > Is there a most appropriate log4j TRACE option that could print out th= is >> > information to the logs? >> > Thanks >> > >> > ________________________________ >> > From: jeff saremi > >> > Sent: Thursday, March 2, 2017 12:45:59 PM >> > To: Hbase-User >> > Subject: Re: Need guidance on getting detailed elapsed times in every >> > stage of processing a request >> > >> > Thanks so much for the advice! Looking forward to when Tracing gets >> picked >> > up again >> > >> > ________________________________ >> > From: saint.ack@gmail.com > on behalf of Stack < >> > stack@duboce.net> >> > Sent: Thursday, March 2, 2017 12:17:35 PM >> > To: Hbase-User >> > Subject: Re: Need guidance on getting detailed elapsed times in every >> > stage of processing a request >> > >> > HBase/HTrace integration once worked but has long since rotted. >> > Refactorings of internals without proper respect for trace connections= is >> > the main culprit. Updates in htrace and hdfs that need attention >> > reconnecting spans, etc., is another. On top of this, zipkin project h= as >> > seen a burst of effort of late that would seem to offer much promise i= f >> > someone of us spent some time rejiggering how HTrace and Zipkin relate= . >> > >> > I would not waste any time on trying to setup HTrace for HBase at leas= t >> > until after HBASE-14451 goes in, an issue that has been put aside with= a >> > while now. Sorry if you've burned time on this to date. >> > >> > Yours, >> > St.Ack >> > >> > On Thu, Mar 2, 2017 at 6:28 AM, jeff saremi > >> > wrote: >> > >> > > Where would i seek help for issues revolving around HTrace and zipki= n? >> > > Here? Because I have configured everything the way documentation sai= d >> > but i >> > > see nothing in the zipkin server or in the logs. nothing at all >> > > >> > > ________________________________ >> > > From: jeff saremi > >> > > Sent: Tuesday, February 28, 2017 12:52:32 PM >> > > To: user@hbase.apache.org >> > > Subject: Re: Need guidance on getting detailed elapsed times in ever= y >> > > stage of processing a request >> > > >> > > No I had not. but it looks like what i needed. Thanks Ted. >> > > >> > > I'll see if I have any more questions after reading this. >> > > >> > > ________________________________ >> > > From: Ted Yu > >> > > Sent: Tuesday, February 28, 2017 12:47:08 PM >> > > To: user@hbase.apache.org >> > > Subject: Re: Need guidance on getting detailed elapsed times in ever= y >> > > stage of processing a request >> > > >> > > Have you looked at: >> > > http://hbase.apache.org/book.html#tracing >> > > >> > > On Tue, Feb 28, 2017 at 12:37 PM, jeff saremi > >> > > wrote: >> > > >> > > > I think we need to get detailed information from HBase RegionServe= r >> > logs >> > > > on how a request (read or write) is processed. Specifically >> speaking, i >> > > > need to know of say 100 ms time spent in processing a write, how m= uch >> > of >> > > it >> > > > was spent waiting for the HDFS? >> > > > What is the most efficient way of enabling this in log4j propertie= s? >> > Are >> > > > there better mechanisms to get this information? >> > > > >> > > > If I can get this in the log, then I can process the logs offline = or >> in >> > > > neartime and mount some dashboards on the top. >> > > > >> > > > thanks >> > > > >> > > > >> > > > Jeff >> > > > >> > > >> > >> --_000_BY1PR14MB01671E6C602ABCBC32D1FC40C12D0BY1PR14MB0167namp_--