Return-Path: Delivered-To: apmail-incubator-chukwa-dev-archive@www.apache.org Received: (qmail 14045 invoked from network); 10 Dec 2010 23:59:25 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Dec 2010 23:59:25 -0000 Received: (qmail 78248 invoked by uid 500); 10 Dec 2010 23:59:25 -0000 Delivered-To: apmail-incubator-chukwa-dev-archive@incubator.apache.org Received: (qmail 78208 invoked by uid 500); 10 Dec 2010 23:59:25 -0000 Mailing-List: contact chukwa-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: chukwa-dev@incubator.apache.org Delivered-To: mailing list chukwa-dev@incubator.apache.org Received: (qmail 78200 invoked by uid 99); 10 Dec 2010 23:59:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 23:59:25 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 23:59:23 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oBANx0MS017949 for ; Fri, 10 Dec 2010 23:59:01 GMT Message-ID: <1171194.67621292025540790.JavaMail.jira@thor> Date: Fri, 10 Dec 2010 18:59:00 -0500 (EST) From: "Bill Graham (JIRA)" To: chukwa-dev@incubator.apache.org Subject: [jira] Commented: (CHUKWA-564) HBase output collector uses incorrect column family In-Reply-To: <24998714.48261291936140740.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CHUKWA-564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970345#action_12970345 ] Bill Graham commented on CHUKWA-564: ------------------------------------ I agree that there are limitations in using annotations on the processors. I think that where the data is written should be decoupled from the processors. A processor knows how to process data, but it shouldn't also state where the data should be written. Generic processors like TsProcessors could be used repeatedly for different data types, all of which should be written to different table/column-families. Coupling the two with annotations makes this difficult. You end up with empty subclasses used only to configure different data types to table/cfs via overridden annotations. I suggest we externalize the table/cf mappings from the processors. Instead we could have something like an HBaseRouterFactory (or something perhaps named better) that the OutputCollector and the HBaseWriter interact with. HBaseRouterFactory has a method that takes in a dataType and probably also a ChukwaRecord and knows how to return the Table and ColumnFamily that the data should be written too. We could then configure that dataType 'foo' should use BarProcessor and write to table 'bat', column family 'biz'. I don't know how we'd configure 'foo's payload to be written to multiple cfs though. What's the use case for why we'd want to write the same data to two locations? There's still an unresolved separate problem of how to handle ORM-ish functionality as well, since reduxing the many parameters in the record body back to a single 'body' field can be sub-optimal. > HBase output collector uses incorrect column family > --------------------------------------------------- > > Key: CHUKWA-564 > URL: https://issues.apache.org/jira/browse/CHUKWA-564 > Project: Chukwa > Issue Type: Bug > Reporter: Bill Graham > Fix For: 0.5.0 > > > The HBase {{OutputCollector}} does this to obtain the column family from the data type: > {noformat} > cf = key.getReduceType().getBytes(); > {noformat} > The column family should instead be taken by the {{@Table.columnFamily}} annotation on the processor. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.