Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3F9DA160C0B for ; Tue, 14 Nov 2017 20:14:00 +0100 (CET) Received: (qmail 30196 invoked by uid 500); 14 Nov 2017 19:13:59 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 29931 invoked by uid 99); 14 Nov 2017 19:13:59 -0000 Received: from ec2-52-202-80-70.compute-1.amazonaws.com (HELO gitbox.apache.org) (52.202.80.70) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Nov 2017 19:13:59 +0000 From: GitBox To: notifications@accumulo.apache.org Subject: [GitHub] keith-turner commented on a change in pull request #38: Continue tour Message-ID: <151068683864.21773.17958651334001403926.gitbox@gitbox.apache.org> archived-at: Tue, 14 Nov 2017 19:14:00 -0000 keith-turner commented on a change in pull request #38: Continue tour URL: https://github.com/apache/accumulo-website/pull/38#discussion_r150927221 ########## File path: tour/ranges-splits.md ########## @@ -0,0 +1,36 @@ +--- +title: Ranges and Splits +--- + +A Range is a specified group of Keys. There are many different ways to create a Range. Here are a few examples: +```java +new Range(Key startKey, Key endKey) // Creates a range from startKey inclusive to endKey inclusive. +new Range(CharSequence row) // Creates a range that covers an entire row. +new Range(CharSequence startRow, CharSequence endRow) // Creates a range from startRow inclusive to endRow inclusive. +``` + +A Scanner by default will scan all Keys in a table but this can be inefficient. It is a good practice to set a range on a Scanner. +```java +scanner.setRange(new Range("id0000", "id0010")); // returns rows from id0000 to id0010 +``` + +As your data grows larger, Accumulo will split tables across multiple servers called Tablet Servers. +By default a table will get split on row boundaries, guaranteeing an entire row to be on one Tablet Server. We have the ability to +tell Accumulo were to split tables by setting split points. This is done using _addSplits_ in the [TableOperations] API. The image below +demonstrates how Accumulo splits data. Review comment: Should mention Tablets in this paragraph. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org With regards, Apache Git Services