accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bob.Thor...@l-3com.com
Subject RE: how to use CountingIterator to count records?
Date Thu, 07 Jun 2012 15:25:20 GMT
It's an adaptation of a feature table where the weight is the number of
occurrences found during ingest.  The rowId's are features that are
relevant to my queries/row counts (e.g. timespan, geo-space, document
partition id, keywords, etc.)  

Example:

ROWID		FAM 		QUAL		VIS		VALUE
=====		===		====		===		=====
White		KEYWORD	OTHER		public	123 
14SU		GEO		MGRS		public	456
9223		TIMESPAN	EPOC		public	7890
DOCPART1	DOCUMENT	PARTITION	public	1234567


One tablet server will know how many rows exist across the cluster for
any ROWID.  So I can quickly determine how many rows exist in all my
tablet servers with one simple scan. 

Obviously you have counter them all on ingest and update the edge table.


-----Original Message-----
From: David Medinets [mailto:david.medinets@gmail.com] 
Sent: Thursday, June 07, 2012 09:00
To: user@accumulo.apache.org
Subject: Re: how to use CountingIterator to count records?

Can you describe the Edge Table approach or provide a reference?

On Thu, Jun 7, 2012 at 8:55 AM,  <Bob.Thorman@l-3com.com> wrote:
> have moved to the Edge Table approach for a direct look up of
occurrences.

Mime
View raw message