hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-1913) [HBase] Build a Lucene index on an HBase table
Date Mon, 24 Sep 2007 21:32:51 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12529990
] 

Hadoop QA commented on HADOOP-1913:
-----------------------------------

+1 overall.  Here are the results of testing the latest attachment 
http://issues.apache.org/jira/secure/attachment/12366485/build_table_index.take6.patch
against trunk revision r578879.

    @author +1.  The patch does not contain any @author tags.

    javadoc +1.  The javadoc tool did not generate any warning messages.

    javac +1.  The applied patch does not generate any new compiler warnings.

    findbugs +1.  The patch does not introduce any new Findbugs warnings.

    core tests +1.  The patch passed core unit tests.

    contrib tests +1.  The patch passed contrib unit tests.

Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/testReport/
Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/console

This message is automatically generated.

> [HBase] Build a Lucene index on an HBase table
> ----------------------------------------------
>
>                 Key: HADOOP-1913
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1913
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>            Reporter: Ning Li
>            Assignee: stack
>            Priority: Minor
>             Fix For: 0.15.0
>
>         Attachments: build_table_index.patch, build_table_index.take2.again.patch, build_table_index.take2.patch,
build_table_index.take3.patch, build_table_index.take4.patch, build_table_index.take5.patch,
build_table_index.take6.patch
>
>
> This patch provides a Reducer class and other related classes which help to build a Lucene
index on an HBase table. The index build part is similar to that of Nutch.
>   - Each row is modeled as a Lucene document: row key is indexed in its untokenized form,
column name-value pairs are Lucene field name-value pairs.
>   - IndexConf is used to configure various Lucene parameters, specify whether to optimize
an index and which columns to index and/or store, in tokenized or untokenized form, etc.
>   - The number of reduce tasks decides the number of indexes (partitions). The index(es)
is stored in the output path of job configuration.
>   - The index build process is done in the reduce phase. Users can use the map phase
to join rows from different tables or to pre-parse/analyze column content, etc.
>   - A junit test is added to test the build of an index on an HBase table with an identity
mapper. It also serves as an example on how to use the new classes.
>   - BuildTableIndex is provided to help building an index on an HBase table. It should
be moved to examples package if HBase decides to have one.
> This patch requires the inclusion of the Lucene library.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message