Return-Path: X-Original-To: apmail-incubator-accumulo-commits-archive@minotaur.apache.org Delivered-To: apmail-incubator-accumulo-commits-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6E4979DC3 for ; Fri, 3 Feb 2012 15:47:51 +0000 (UTC) Received: (qmail 63025 invoked by uid 500); 3 Feb 2012 15:47:51 -0000 Delivered-To: apmail-incubator-accumulo-commits-archive@incubator.apache.org Received: (qmail 62987 invoked by uid 500); 3 Feb 2012 15:47:51 -0000 Mailing-List: contact accumulo-commits-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: accumulo-dev@incubator.apache.org Delivered-To: mailing list accumulo-commits@incubator.apache.org Received: (qmail 62980 invoked by uid 99); 3 Feb 2012 15:47:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Feb 2012 15:47:51 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO eris.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Feb 2012 15:47:48 +0000 Received: from eris.apache.org (localhost [127.0.0.1]) by eris.apache.org (Postfix) with ESMTP id 611132388A36 for ; Fri, 3 Feb 2012 15:47:27 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Subject: svn commit: r803868 - /websites/staging/accumulo/trunk/content/accumulo/example/wikisearch.html Date: Fri, 03 Feb 2012 15:47:27 -0000 To: accumulo-commits@incubator.apache.org From: buildbot@apache.org X-Mailer: svnmailer-1.0.8-patched Message-Id: <20120203154727.611132388A36@eris.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org Author: buildbot Date: Fri Feb 3 15:47:26 2012 New Revision: 803868 Log: Staging update by buildbot for accumulo Modified: websites/staging/accumulo/trunk/content/accumulo/example/wikisearch.html Modified: websites/staging/accumulo/trunk/content/accumulo/example/wikisearch.html ============================================================================== --- websites/staging/accumulo/trunk/content/accumulo/example/wikisearch.html (original) +++ websites/staging/accumulo/trunk/content/accumulo/example/wikisearch.html Fri Feb 3 15:47:26 2012 @@ -127,7 +127,7 @@ td { [Document 57, Document 220] Other -172849 +172,849 [] Ostrich @@ -230,8 +230,8 @@ td { 3.65 3.85 3.67 -22956 -3830102 +22,956 +3,830,102 “paris” and “in” and “the” and “spring” 3.06 @@ -239,8 +239,8 @@ td { 2.78 3.02 2.92 -10755 -1757293 +10,755 +1,757,293 “rubber” and “duckie” and “ernie” 0.08 @@ -257,8 +257,8 @@ td { 1.3 1.31 1.31 -2973 -493800 +2,973 +493,800 “slashdot” and “grok” 0.06 @@ -267,7 +267,7 @@ td { 0.06 0.06 14 -2371 +2,371 “three” and “little” and “pigs” 0.92 @@ -275,8 +275,8 @@ td { 0.9 1.08 0.88 -2742 -481531 +2,742 +481,531

Because the terms are tested together within the region server, even fairly high-cardinality terms such as “old,” “man,” and “sea” can be tested efficiently, without needing to return to the client, or make distributed calls between servers to perform the intersection between terms.

@@ -284,22 +284,22 @@ td {
Term Cardinality
ducky 795 -
ernie 13433 -
fast 166813 -
furious 10535 +
ernie 13,433 +
fast 166,813 +
furious 10,535
furriest 45 -
grok 1168 -
in 1884638 -
little 320748 -
man 548238 -
old 720795 -
paris 232464 -
pigs 8356 -
rubber 17235 -
sea 247231 -
slashdot 2343 -
spring 125605 -
three 718810 +
grok 1,168 +
in 1,884,638 +
little 320,748 +
man 548,238 +
old 720,795 +
paris 232,464 +
pigs 8,356 +
rubber 17,235 +
sea 247,231 +
slashdot 2,343 +
spring 125,605 +
three 718,810

Accumulo supports caching index information, which is turned on by default, and for the non-index blocks of a file, which is not. After turning on data block caching for the wiki table:

@@ -386,16 +386,16 @@ For comparison, these are the cold start Count 41.97 -440743 +440,743 41.61 -320522 +320,522 42.11 -347969 +347,969 38.32 -275655 +275,655

Query Load During Ingest