Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CA867FE62 for ; Fri, 3 May 2013 14:02:16 +0000 (UTC) Received: (qmail 79662 invoked by uid 500); 3 May 2013 14:02:16 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 79509 invoked by uid 500); 3 May 2013 14:02:13 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 79180 invoked by uid 99); 3 May 2013 14:02:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 May 2013 14:02:11 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of supun06@gmail.com designates 209.85.223.173 as permitted sender) Received: from [209.85.223.173] (HELO mail-ie0-f173.google.com) (209.85.223.173) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 May 2013 14:02:07 +0000 Received: by mail-ie0-f173.google.com with SMTP id k5so1849070iea.4 for ; Fri, 03 May 2013 07:01:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=K4GNGtlimSgRQTvL3lKKACbJTqrLFzI5vbEJ6KDIM30=; b=BZaTFnwx0j29DZyxcCmoza/hLfFr4nNF5lraIHz2qAXAdVNYO9n0G/dASK/t22IziJ 3UttnhhgmBGQcSrMU/fcNH12g0RPExc1a41YtkJodf4KF4kptuuKfaHXe/o8aoTWCmW0 9vJqUyJ0L/YWi7sYH/ftWSpmo+WLs1K8pOgorRjSPjQ+qdgloy0WFm2e0HDHWJQn0GvQ 15qk8899ZFdUMQxMk7+oR/ijIMj9XsRfPmQNi6xxdz/DEAB3rn1RALm+L+eRRw+i9lb7 esQkXY7wt2NX2wMNzgR8vw+lVwWuVKkw5bUFvLuLDpMdeQNjjKZRcE4ZJa1T03lcEQY/ JNgA== X-Received: by 10.43.65.10 with SMTP id xk10mr4576337icb.10.1367589707095; Fri, 03 May 2013 07:01:47 -0700 (PDT) MIME-Version: 1.0 Received: by 10.50.12.129 with HTTP; Fri, 3 May 2013 07:01:26 -0700 (PDT) In-Reply-To: References: <517F2BC4.1030000@gmail.com> From: Supun Kamburugamuva Date: Fri, 3 May 2013 10:01:26 -0400 Message-ID: Subject: Re: GSOC: Monitor improvements - draft proposal To: dev@accumulo.apache.org Content-Type: multipart/alternative; boundary=bcaec51b1c73b0a6a304dbd0ca10 X-Virus-Checked: Checked by ClamAV on apache.org --bcaec51b1c73b0a6a304dbd0ca10 Content-Type: text/plain; charset=ISO-8859-1 I've submitted the proposal to google. http://www.google-melange.com/gsoc/proposal/review/google/gsoc2013/supun06/1 Thanks, Supun.. On Tue, Apr 30, 2013 at 10:31 AM, Supun Kamburugamuva wrote: > Hi Josh, > > Thanks for the detailed feedback. I've integrated your suggestions to the > document. > > For a third party monitoring tool I would like to use Zabbix. But I may > have to do more research on this one. For now I'll leave it with Zabbix. > > I couldn't find a library that helps with JMX development. I guess most > people use the Java API directly. Anyways I'll do more research on this one > and try to find one if possible. > > Thanks for the Javascrip library suggestions. d3.js seems impressive. > > I've updated the timeline and added a deliverable section. Hope this helps > a bit. Let me know if it needs further improvements. For each phase my plan > is to create a patch with the changes. > > Thanks, > Supun.. > > > > > On Mon, Apr 29, 2013 at 10:26 PM, Josh Elser wrote: > >> Supun, >> >> Thanks for the draft! Some feedback -- hopefully it's useful for your >> proposal in addition to giving you a better understanding of how Accumulo >> is typically run. >> >> "These servers perform different functionalities" >> >> Actually, most serversin an Accumulo cluster are identical to one >> another: most are running a TabletServer, and in <1.5, a Logger. The >> exceptions are the Master, Monitor, Tracer and GarbageCollector. Master, >> monitor and gc are typically run on the same node (monitor and gc are >> rather lightweight). Running a tracer on every TabletServer is probably >> overkill, but, again, this is another lightweight process, so not outside >> the realm of possibilities. >> >> "Create a JMX API for Monitor to gather statistics" >> >> Any plans to include an example 3rd-party monitor that takes advantage of >> the internal change from Thrift to JMX? If so, which? I could see this >> being very useful for your own verification and validation, not to mention >> for 3rd parties (people other than yourself). >> >> "Table Graphs" >> >> I'd be rather interested to see how the amount of data being returned by >> a TabletServer correlates with query rate. It would be a neat plot to see >> how RFile index size and size of each key-value returned corresponds with >> query rate. Maybe it would be cool to have the ability to let users create >> composite graphs? >> >> "Trace Visualization" >> >> Not a lot to really see here. Currently you get some rudimentary >> information about how long it took to determine which files to delete, and >> how long deleting them took (I think). It would be nice to see this broken >> down by table, and include file size and other file metadata. >> >> "Server Status Information" >> >> I remember hearing that someone had done some work to actually pop a >> shell in the monitor when authenticated over HTTPS. Another cool feature >> might be to actually have some greater insight into a node (perhaps using >> JMX calls that we wouldn't want publicly available) when properly >> authenticated? I'm thinking about being able to view the list of running >> scans on a node... being able to introspect the actual scan options/data, >> ranges being run, etc. >> >> "Mock Stats Collector" >> >> I would put money that this will pay off in spades as you move forward >> testing things. >> >> Some more high-level things... >> >> * Any thought/preference on the JMX library you would want to use? >> * Re: Javascript, might want to look at DataTables (jQuery-based), d3.js, >> and/or nvd3. Lots of options here, but licensing can be a concern. Glad you >> thought about that already. >> >> "Deliverables and Timeline" >> >> I'd try to rethink your timeline a bit; it comes off very waterfall-y to >> me. The biggest red-flag to me is the "write documentation" as your last >> phase. Coming from experience, this doesn't work 95% of the time. Something >> else always comes up, takes longer, w/e and suddenly you have some code >> that you just got working and no documentation. I know it's difficult to >> create a development schedule when you're not completely familiar with what >> will be required of you, but trying to lay out the work in such a way that >> you have some concrete, measurable results after each phase will help you >> and, I believe, make a much more realistic schedule (not to mention make >> the advisor's job easier to see progress :P). >> >> I hope this helps in one way or another. >> >> - Josh >> >> >> On 4/29/2013 10:46 AM, Supun Kamburugamuva wrote: >> > Hi all, >> > >> > Here is the draft proposal for the Monitor Improvements project. >> > >> > https://docs.google.com/**document/d/**1j1YHZJXuzxIrB1udt1RnWZUgZLeo-** >> JX711gEv1l--r8/edit#heading=h.**2r66wv56fsz >> > >> > I would really appreciate your feedback. >> > >> > Cheers, >> > Supun.. >> > >> >> > > > -- > Supun Kamburugamuva > Member, Apache Software Foundation; http://www.apache.org > E-mail: supun06@gmail.com; Mobile: +1 812 369 6762 > Blog: http://supunk.blogspot.com > > -- Supun Kamburugamuva Member, Apache Software Foundation; http://www.apache.org E-mail: supun06@gmail.com; Mobile: +1 812 369 6762 Blog: http://supunk.blogspot.com --bcaec51b1c73b0a6a304dbd0ca10--