incubator-blur-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron McCurry (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BLUR-18) Rework the MapReduce Library to implement Input/OutputFromats
Date Thu, 11 Oct 2012 12:21:03 GMT

    [ https://issues.apache.org/jira/browse/BLUR-18?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13474079#comment-13474079
] 

Aaron McCurry commented on BLUR-18:
-----------------------------------

I do not believe anyone is actively working on this issue.

Basically the BlurReducer is responsible for indexing the data into Blur, both for bulk loads
and for MapReduce updates.  The functionality in BlurReducer needs to be moved into an OutputFormat
(both the new MR api and the old MR api).  Also an corresponding InputFormat would also be
good addition to read data from Blur via MR.  Once these two formats have been created and
tested, integrating with other technologies such as Hive and Pig will be a lot easier.
                
> Rework the MapReduce Library to implement Input/OutputFromats
> -------------------------------------------------------------
>
>                 Key: BLUR-18
>                 URL: https://issues.apache.org/jira/browse/BLUR-18
>             Project: Apache Blur
>          Issue Type: Improvement
>            Reporter: Aaron McCurry
>
> Currently the only way to implement indexing is to use the BlurReducer.  A better way
to implement this would be to support Hadoop input/outputformats in both the new and old api's.
 This would allow an easier integration with other Hadoop projects such as Hive and Pig.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message