accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <>
Subject [GitHub] milleruntime commented on a change in pull request #38: Continue tour
Date Thu, 01 Jan 1970 00:00:00 GMT
milleruntime commented on a change in pull request #38: Continue tour

 File path: tour/
 @@ -0,0 +1,36 @@
+title: Ranges and Splits
+A Range is a specified group of Keys. There are many different ways to create a Range.  Here
are a few examples:
+new Range(Key startKey, Key endKey)  // Creates a range from startKey inclusive to endKey
+new Range(CharSequence row)  // Creates a range that covers an entire row.
+new Range(CharSequence startRow, CharSequence endRow) // Creates a range from startRow inclusive
to endRow inclusive.
+A Scanner by default will scan all Keys in a table but this can be inefficient. It is a good
practice to set a range on a Scanner.
+scanner.setRange(new Range("id0000", "id0010"));  // returns rows from id0000 to id0010
+As your data grows larger, Accumulo will split tables across multiple servers called Tablet
+By default a table will get split on row boundaries, guaranteeing an entire row to be on
one Tablet Server.  We have the ability to 
+tell Accumulo were to split tables by setting split points. This is done using _addSplits_
in the [TableOperations] API.  The image below 
+demonstrates how Accumulo splits data.  
+![data distribution]({{ site.url }}/images/docs/data_distribution.png)
+There are many useful administrative methods in [TableOperations] so take a minute to look
through the API. Here are some terms specific to Accumulo:
 Review comment:
   I agree. I was trying to point out the more useful API classes and the best places to start.
 Maybe we could do that in another exercise

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

With regards,
Apache Git Services

View raw message