accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Corey J. Nolet (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-2553) AccumuloFileOutputFormat should be able to support output for multiple tables.
Date Sat, 13 Sep 2014 01:41:34 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14132451#comment-14132451
] 

Corey J. Nolet commented on ACCUMULO-2553:
------------------------------------------

So I've got this coded up. It's sitting in a different github project at the moment until
I get time to port it over. I believe Bill Slacum may be using this on Koverse as well.

https://github.com/calrissian/accumulo-recipes/blob/master/commons/src/main/java/org/calrissian/accumulorecipes/commons/hadoop/GroupedKeyRangePartitioner.java


I have to get an example coded up as well.

> AccumuloFileOutputFormat should be able to support output for multiple tables.
> ------------------------------------------------------------------------------
>
>                 Key: ACCUMULO-2553
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2553
>             Project: Accumulo
>          Issue Type: New Feature
>            Reporter: Corey J. Nolet
>            Assignee: Corey J. Nolet
>            Priority: Minor
>
> This may not necessarily be something that would require changes in the AccumuloFileOutputFormat
itself. Perhaps the ability to use it with Hadoop's MultipleOutputs is really the solution.
> It would be useful if the user could specify multiple directories where RFiles should
be placed and have a mechanism for populating the RFiles in the necessary directories based
on a table name or group name. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message