jackrabbit-oak-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ard Schrijvers <a.schrijv...@onehippo.com>
Subject Re: Re (OAK-36) Implement a query parser - what about indexing?
Date Mon, 26 Mar 2012 07:20:12 GMT
On Fri, Mar 23, 2012 at 6:57 PM, Justin Edelson
<justin@justinedelson.com> wrote:
> On Fri, Mar 23, 2012 at 5:40 AM, Ard Schrijvers
> <a.schrijvers@onehippo.com>wrote:
>> Although I am on thin ice here, I think there are hardly any noSQL
>> stores out there that actually include full text indexes.
> Yes, but the goal of Oak explicitly says "The implementation should provide
> more out-of-the-box functionality than typical NoSQL databases while
> achieving comparable levels of scalability and performance."

the 'more out-of-the-box functionality' does not imply a full-text
index is needed per se. For example hierarchy isn't part of most NoSQL
databases, that is already more functionality

I am just a bit skeptical about 'more x but the same y'. It seems to
me to be impossible to have a full text index and comparable levels of
scalability with NoSQL databases that do not concern about hierarchy
or full text indexes. Giving up some performance for a hierarchy makes
much sense, because it is true added value.

>> I think we
>> shouldn't try to address it in the repository, but rather provide some
>> tooling to easily setup a (external) full text index (like plain
>> Lucene, or use Solr/Elastic search) according someones exact needs
>> (like, which analyzer to use for which part of the content, which
>> properties should be stored, which properties should be analyzed in
>> which ways, which properties are meant for TrieRanges,  etc etc)
> I agree that for many use cases a separate index is appropriate. That
> doesn't obviate the need/appropriateness for an internal full-text search
> index.

I would/do agree only if the full-text search index doesn't imply much
performance constraints, higher memory consumption, and scalability
issues. Imho, the price for full text indexes is way too high, while I
still doubt usability in the end.

Regards Ard

> Justin
>> Regards Ard
>> >
>> > [1]:
>> >
>> http://wiki.apache.org/jackrabbit/Goals%20and%20non%20goals%20for%20Jackrab
>> > bit%203
>> >
>> > Regards,
>> > Thomas
>> >

Amsterdam - Oosteinde 11, 1017 WT Amsterdam
Boston - 1 Broadway, Cambridge, MA 02142

US +1 877 414 4776 (toll free)
Europe +31(0)20 522 4466

View raw message