Mailing-List: contact user-help@hbase.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hbase.apache.org
Received-SPF: pass (athena.apache.org: domain of dean.hiller@broadridge.com
 designates 64.18.2.159 as permitted sender)
From: "Hiller, Dean  x66079" <dean.hiller@broadridge.com>
To: "user@hbase.apache.org" <user@hbase.apache.org>
Date: Fri, 17 Jun 2011 16:21:31 -0400
Subject: RE: What's the best approach to search in HBase?
Thread-Topic: What's the best approach to search in HBase?
Thread-Index: Acwr20XGGOCQVaNgTFqN3AuAforZKQBULYKg
Message-ID: 
 <08230D4C8E666D479F6DE495A7684DCD0ABA957D@JSCPCWEXMAA1.bsg.ad.adp.com>
References: <BANLkTikruHfpfps8BW6O0tX7R2nVO=o1Aw@mail.gmail.com>
	<2D6136772A13B84E95DF6DA79E85A9F00142F40132F1@NSPEXMBX-A.the-lab.llnl.gov>
 <BANLkTinAoc8Z6qD18D+dyrpqdgaGUzitpw@mail.gmail.com>
In-Reply-To: <BANLkTinAoc8Z6qD18D+dyrpqdgaGUzitpw@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0

What about using Hbasene....is it pretty good....looks just like a distribu=
ted Lucene and the same api and everything?

Later,
Dean

-----Original Message-----
From: Mark Kerzner [mailto:markkerzner@gmail.com]=20
Sent: Wednesday, June 15, 2011 10:10 PM
To: user@hbase.apache.org
Subject: Re: What's the best approach to search in HBase?

Thank you, everybody. I summarized your advice here,
http://shmsoft.blogspot.com/2011/06/search-in-ediscovery.html, because I
need it for my open source eDiscovery, and now just need to try it all :)

Sincerely,
Mark

On Mon, Jun 6, 2011 at 11:18 AM, Buttler, David <buttler1@llnl.gov> wrote:

> I store over 500M documents in HBase, and index using Solr with dynamic
> fields.  This gives you tremendous flexibility to do the type of queries =
you
> are looking for -- and to make them simple and intuitive via a faceted
> interface.
>
> However, there was quite a bit of software that we had to write to get
> things going, and I can neither release all of it open source, or support
> other people using it.  If I had to start again, I would seriously look a=
t
> solutions like elastic search and lily.
>
> Dave
>
> -----Original Message-----
> From: Mark Kerzner [mailto:markkerzner@gmail.com]
> Sent: Friday, June 03, 2011 5:57 PM
> To: HBase Discussion Group
> Subject: What's the best approach to search in HBase?
>
> Hi,
>
> I need to store, say, 10M-100M documents, with each document having say 1=
00
> fields, like author, creation date, access date, etc., and then I want to
> ask questions like
>
> give me all documents whose author is like abc**, and creation date any
> time
> in 2010 and access date in 2010-2011, and so on, perhaps 10-20 conditions=
,
> matching a list of some keywords.
>
> What's best, Lucene, Katta, HBase CF with secondary indices, or plain sca=
n
> and compare of every record?
>
> Thanks a bunch!
>
> Mark
>
This message and any attachments are intended only for the use of the add=
ressee and
may contain information that is privileged and confidential. If the reade=
r of the =

message is not the intended recipient or an authorized representative of =
the
intended recipient, you are hereby notified that any dissemination of thi=
s
communication is strictly prohibited. If you have received this communica=
tion in
error, please notify us immediately by e-mail and delete the message and =
any
attachments from your system.
=0D