accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Corey J. Nolet (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (ACCUMULO-2553) AccumuloFileOutputFormat should be able to support output for multiple tables.
Date Mon, 19 May 2014 17:06:38 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14001983#comment-14001983
] 

Corey J. Nolet edited comment on ACCUMULO-2553 at 5/19/14 5:05 PM:
-------------------------------------------------------------------

The way I'm doing it, the files are put in directories named to the group.
I'm using multiple-outputs to specify the output filename of the group. The only thing is,
I had to make the range partitioner add \u0000 and \uffff before and after the given ranges
respectively (unless they included that themselves) to make sure keys/values aren't written
to files not belonging to that group. 

I think this may be acceptable though it is scheduling reducers that don't have any values.






was (Author: cjnolet@gmail.com):
The way I'm doing it, the files are put in directories named to the group.
I'm using multiple-outputs to specify the output filename of the group.





> AccumuloFileOutputFormat should be able to support output for multiple tables.
> ------------------------------------------------------------------------------
>
>                 Key: ACCUMULO-2553
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2553
>             Project: Accumulo
>          Issue Type: New Feature
>            Reporter: Corey J. Nolet
>            Assignee: Corey J. Nolet
>            Priority: Minor
>
> This may not necessarily be something that would require changes in the AccumuloFileOutputFormat
itself. Perhaps the ability to use it with Hadoop's MultipleOutputs is really the solution.
> It would be useful if the user could specify multiple directories where RFiles should
be placed and have a mechanism for populating the RFiles in the necessary directories based
on a table name or group name. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message