accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Medinets <david.medin...@gmail.com>
Subject Re: Using AccumuloOutputFormat, All Records Stored In One Tablet (Node)
Date Mon, 16 Apr 2012 18:55:48 GMT
argh ... Just to be clear. The splits are essentially partitions of the row id?

Can I add splits after the data is ingested? If so, how can I redistribute?

On Mon, Apr 16, 2012 at 2:45 PM, Eric Newton <eric.newton@gmail.com> wrote:
> Create the table with splits, but this requires you to know something about
> the distribution of your data.
>
> -Eric
>
>
> On Mon, Apr 16, 2012 at 2:38 PM, David Medinets <david.medinets@gmail.com>
> wrote:
>>
>> Hopefully I am doing something wrong that can be easily rectified. I
>> have an hadoop job that is sending well over 200M entries into
>> accumulo. But every entry is being sent to a single node. The table
>> was created by the hadoop job.
>>
>> How can I get the entries to be spread over several nodes?
>
>

Mime
View raw message