chukwa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Schubert Zhang (JIRA)" <j...@apache.org>
Subject [jira] Commented: (CHUKWA-22) Need index for chukwa sequence files
Date Mon, 12 Oct 2009 20:22:31 GMT

    [ https://issues.apache.org/jira/browse/CHUKWA-22?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764815#action_12764815
] 

Schubert Zhang commented on CHUKWA-22:
--------------------------------------

Yes Eric, we have the same experiences and opinions about utilizing HBase's implementation
for data indexing to avoid repeating work form hbase.

In fact, our dataset is very big (e.g. 60,000 records/second). So, it is a challenge to insert
such big dataset into hbase.

> Need index for chukwa sequence files
> ------------------------------------
>
>                 Key: CHUKWA-22
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-22
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>         Environment: Redhat EL 5.1 and Java 6
>            Reporter: Eric Yang
>            Assignee: Eric Yang
>
> Chukwa has ability to collect large volume of data, but the lack of index prevents Chukwa
front end to serve data straight from HDFS.  This jira is the place holder for designing a
indexing service for Chukwa.  The plan is to create indexing service base on available software
like lucene or katta.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message