incubator-blur-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gagan Deep Juneja (JIRA)" <>
Subject [jira] [Updated] (BLUR-18) Rework the MapReduce Library to implement Input/OutputFormats
Date Sat, 03 Nov 2012 11:12:12 GMT


Gagan Deep Juneja updated BLUR-18:

    Attachment: 0001-BLUR-ID-18-Created-New-Version-of-Files.patch

I have created first draft as discussed. Initially I see some things not much efficient like
I used HashMap to store querySessions in InputFormat which I feel is not the best solution
even though I am thinking to remove this entry from map once job is completed in OutputCommitter.
I am attaching the patch here. Please review and give me your feedback.
> Rework the MapReduce Library to implement Input/OutputFormats
> -------------------------------------------------------------
>                 Key: BLUR-18
>                 URL:
>             Project: Apache Blur
>          Issue Type: Improvement
>            Reporter: Aaron McCurry
>             Fix For: 0.2.0
>         Attachments: 0001-BLUR-ID-18-Created-New-Version-of-Files.patch
> Currently the only way to implement indexing is to use the BlurReducer.  A better way
to implement this would be to support Hadoop input/outputformats in both the new and old api's.
 This would allow an easier integration with other Hadoop projects such as Hive and Pig.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message