Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 82F31200C7E for ; Tue, 23 May 2017 15:50:44 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 818D2160BC3; Tue, 23 May 2017 13:50:44 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 77FD7160BB6 for ; Tue, 23 May 2017 15:50:43 +0200 (CEST) Received: (qmail 16844 invoked by uid 500); 23 May 2017 13:50:37 -0000 Mailing-List: contact commits-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list commits@accumulo.apache.org Received: (qmail 16770 invoked by uid 99); 23 May 2017 13:50:37 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 May 2017 13:50:37 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 831E8DFFB5; Tue, 23 May 2017 13:50:37 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit From: mwalch@apache.org To: commits@accumulo.apache.org Date: Tue, 23 May 2017 13:50:38 -0000 Message-Id: In-Reply-To: <97cfaf5c312d429b8a98092972e2c2c3@git.apache.org> References: <97cfaf5c312d429b8a98092972e2c2c3@git.apache.org> X-Mailer: ASF-Git Admin Mailer Subject: [2/2] accumulo-website git commit: Jekyll build from master:18942e8 archived-at: Tue, 23 May 2017 13:50:44 -0000 Jekyll build from master:18942e8 Improved linking to javadocs Project: http://git-wip-us.apache.org/repos/asf/accumulo-website/repo Commit: http://git-wip-us.apache.org/repos/asf/accumulo-website/commit/2d7cc3d5 Tree: http://git-wip-us.apache.org/repos/asf/accumulo-website/tree/2d7cc3d5 Diff: http://git-wip-us.apache.org/repos/asf/accumulo-website/diff/2d7cc3d5 Branch: refs/heads/asf-site Commit: 2d7cc3d5ac357abc59c64f0d79b41925af07863e Parents: 3e4443c Author: Mike Walch Authored: Tue May 23 09:50:13 2017 -0400 Committer: Mike Walch Committed: Tue May 23 09:50:13 2017 -0400 ---------------------------------------------------------------------- docs/unreleased/development/sampling.html | 44 ++++++++++---------- .../getting-started/table_configuration.html | 35 ++++++---------- feed.xml | 4 +- 3 files changed, 36 insertions(+), 47 deletions(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/accumulo-website/blob/2d7cc3d5/docs/unreleased/development/sampling.html ---------------------------------------------------------------------- diff --git a/docs/unreleased/development/sampling.html b/docs/unreleased/development/sampling.html index 4592c40..fff24b9 100644 --- a/docs/unreleased/development/sampling.html +++ b/docs/unreleased/development/sampling.html @@ -336,20 +336,19 @@ estimate, and then making decisions based on the estimate.

Configuring

-

Inorder to use sampling, an Accumulo table must be configured with a class that -implements org.apache.accumulo.core.sample.Sampler along with options for -that class. For guidance on implementing a Sampler see that interface’s -javadoc. Accumulo provides a few implementations out of the box. For -information on how to use the samplers that ship with Accumulo look in the -package org.apache.accumulo.core.sample and consult the javadoc of the -classes there. See the sampling example for examples of how to -configure a Sampler on a table.

- -

Once a table is configured with a sampler all writes after that point will -generate sample data. For data written before sampling was configured sample +

In order to use sampling, an Accumulo table must be configured with a class that +implements Sampler along with options for that class. For guidance on +implementing a Sampler, see the Sampler interface javadoc. Accumulo provides a few +implementations of Sampler out of the box. For information on how to use the samplers that +ship with Accumulo, look in the package org.apache.accumulo.core.client.sample +and consult the javadoc of the classes there. See the sampling example +for examples of how to configure a Sampler on a table.

+ +

Once a table is configured with a Sampler, all writes after that point will +generate sample data. For data written before sampling was configured, sample data will not be present. A compaction can be initiated that only compacts the -files in the table that do not have sample data. The example readme shows how -to do this.

+files in the table that do not have sample data. The sampling example +shows how to do this.

If the sampling configuration of a table is changed, then Accumulo will start generating new sample data with the new configuration. However old data will @@ -358,19 +357,18 @@ compaction can also be issued in this case to regenerate the sample data.

Scanning sample data

-

Inorder to scan sample data, use the setSamplerConfiguration(...) method on -Scanner or BatchScanner. Please consult this methods javadocs for more +

In order to scan sample data, use setSamplerConfiguration(...) method of +Scanner or BatchScanner. Please consult the javadoc of this method for more information.

-

Sample data can also be scanned from within an Accumulo SortedKeyValueIterator. +

Sample data can also be scanned from within an Accumulo SortedKeyValueIterator. To see how to do this, look at the example iterator referenced in the sampling example. -Also, consult the javadoc on org.apache.accumulo.core.iterators.IteratorEnvironment.cloneWithSamplingEnabled().

+Also, consult the javadoc on IteratorEnvironment.cloneWithSamplingEnabled().

-

Map reduce jobs using the AccumuloInputFormat can also read sample data. See -the javadoc for the setSamplerConfiguration() method on -AccumuloInputFormat.

+

Map reduce jobs using the AccumuloInputFormat can also read sample data. See +the javadoc for the setSamplerConfiguration() method of AccumuloInputFormat.

-

Scans over sample data will throw a SampleNotPresentException in the following cases :

+

Scans over sample data will throw a SampleNotPresentException in the following cases :

  1. sample data is not present,
  2. @@ -384,8 +382,8 @@ generated with the same configuration.

    Bulk import

    When generating rfiles to bulk import into Accumulo, those rfiles can contain -sample data. To use this feature, look at the javadoc on the -AccumuloFileOutputFormat.setSampler(...) method.

    +sample data. To use this feature, look at the javadoc of the setSampler(...) +method of AccumuloFileOutputFormat.

    http://git-wip-us.apache.org/repos/asf/accumulo-website/blob/2d7cc3d5/docs/unreleased/getting-started/table_configuration.html ---------------------------------------------------------------------- diff --git a/docs/unreleased/getting-started/table_configuration.html b/docs/unreleased/getting-started/table_configuration.html index 9c8fc2d..8a691ca 100644 --- a/docs/unreleased/getting-started/table_configuration.html +++ b/docs/unreleased/getting-started/table_configuration.html @@ -400,19 +400,14 @@ com.test.AnotherConstraint=2

    Currently there are no general-purpose constraints provided with the Accumulo distribution. New constraints can be created by writing a Java class that implements -the following interface:

    +the Constraint interface.

    -
     org.apache.accumulo.core.constraints.Constraint
    -
    -
    - -

    To deploy a new constraint, create a jar file containing the class implementing the -new constraint and place it in the lib directory of the Accumulo installation. New +

    To deploy a new constraint, create a jar file containing a class implementing Constraint +and place it in the lib/ directory of the Accumulo installation. New constraint jars can be added to Accumulo and enabled without restarting but any change to an existing constraint class requires Accumulo to be restarted.

    -

    See the contraints examples -for example code.

    +

    See the constraints examples for example code.

    Bloom Filters

    @@ -427,8 +422,7 @@ This can speed up lookups considerably.

    -

    The bloom filter examples -contains an extensive example of using Bloom Filters.

    +

    The bloom filter examples contains an extensive example of using Bloom Filters.

    Iterators

    @@ -437,7 +431,7 @@ TabletServers when scanning or compacting data. This allows users to efficiently summarize, filter, and aggregate data. In fact, the built-in features of cell-level security and column fetching are implemented using Iterators. Some useful Iterators are provided with Accumulo and can be found in the -org.apache.accumulo.core.iterators.user package. +org.apache.accumulo.core.iterators.user package. In each case, any custom Iterators must be included in Accumulo’s classpath, typically by including a jar in lib/ or lib/ext/, although the VFS classloader allows for classpath manipulation using a variety of schemes including URLs and HDFS URIs.

    @@ -445,7 +439,7 @@ allows for classpath manipulation using a variety of schemes including URLs and

    Setting Iterators via the Shell

    Iterators can be configured on a table at scan, minor compaction and/or major -compaction scopes. If the Iterator implements the OptionDescriber interface, the +compaction scopes. If the Iterator implements the OptionDescriber interface, the setiter command can be used which will interactively prompt the user to provide values for the given necessary options.

    @@ -458,7 +452,7 @@ user@myinstance mytable> setiter -t mytable -scan -p 15 -n myiter -class com.

    The config command can always be used to manually configure iterators which is useful -in cases where the Iterator does not implement the OptionDescriber interface.

    +in cases where the Iterator does not implement the OptionDescriber interface.

    config -t mytable -s table.iterator.scan.myiter=15,com.company.MyIterator
     config -t mytable -s table.iterator.minc.myiter=15,com.company.MyIterator
    @@ -560,11 +554,10 @@ are removed from disk as part of the regular garbage collection process.

    Filters

    When scanning over a set of key-value pairs it is possible to apply an arbitrary -filtering policy through the use of a Filter. Filters are types of iterators that return +filtering policy through the use of a Filter. Filters are types of iterators that return only key-value pairs that satisfy the filter logic. Accumulo has a few built-in filters that can be configured on any table: AgeOff, ColumnAgeOff, Timestamp, NoVis, and RegEx. More can be added -by writing a Java class that extends the -org.apache.accumulo.core.iterators.Filter class.

    +by writing a Java class that extends the Filter class.

    The AgeOff filter can be configured to remove data older than a certain date or a fixed amount of time from the present. The following example sets a table to delete @@ -671,14 +664,12 @@ foo day:20080103 [] 1

    Accumulo includes some useful Combiners out of the box. To find these look in -the org.apache.accumulo.core.iterators.user package.

    +the org.apache.accumulo.core.iterators.user package.

    Additional Combiners can be added by creating a Java class that extends -org.apache.accumulo.core.iterators.Combiner and adding a jar containing that -class to Accumulo’s lib/ext directory.

    +Combiner and adding a jar containing that class to Accumulo’s lib/ext directory.

    -

    See the combiner example -for example code.

    +

    See the combiner example for example code.

    Block Cache

    http://git-wip-us.apache.org/repos/asf/accumulo-website/blob/2d7cc3d5/feed.xml ---------------------------------------------------------------------- diff --git a/feed.xml b/feed.xml index c68428d..2bf2fe2 100644 --- a/feed.xml +++ b/feed.xml @@ -6,8 +6,8 @@ https://accumulo.apache.org/ - Mon, 22 May 2017 16:09:46 -0400 - Mon, 22 May 2017 16:09:46 -0400 + Tue, 23 May 2017 09:49:52 -0400 + Tue, 23 May 2017 09:49:52 -0400 Jekyll v3.3.1