lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wettin (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-550) InstantiatedIndex - faster but memory consuming index
Date Sun, 24 Feb 2008 20:29:14 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12571948#action_12571948
] 

Karl Wettin commented on LUCENE-550:
------------------------------------

{quote}
Did zOlive ever post his code Jakarta Commons? Without him actually doing it, I don't know
that it is good enough legally to accept it.
{quote}

He did not. Should I ask him to post the code as an ASL-tagged attachment to this issue? Or
is commons a better place?

{quote}
Also, is your last comment such that you think there is a new patch?
{quote}

Not anytime soon. They are only ideas that could make it a bit less ad hoc. But I'm actually
quite happy with the way it works now. The code has sucessfully been used in a handful of
commercial projects.

> InstantiatedIndex - faster but memory consuming index
> -----------------------------------------------------
>
>                 Key: LUCENE-550
>                 URL: https://issues.apache.org/jira/browse/LUCENE-550
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.0.0
>            Reporter: Karl Wettin
>            Assignee: Grant Ingersoll
>         Attachments: HitCollectionBench.jpg, LUCENE-550_20071021_no_core_changes.txt,
test-reports.zip
>
>
> Represented as a coupled graph of class instances, this all-in-memory index store implementation
delivers search results up to a 100 times faster than the file-centric RAMDirectory at the
cost of greater RAM consumption.
> Performance seems to be a little bit better than log2n (binary search). No real data
on that, just my eyes.
> Populated with a single document InstantiatedIndex is almost, but not quite, as fast
as MemoryIndex.    
> At 20,000 document 10-50 characters long InstantiatedIndex outperforms RAMDirectory some
30x,
> 15x at 100 documents of 2000 charachters length,
> and is linear to RAMDirectory at 10,000 documents of 2000 characters length.
> Mileage may vary depending on term saturation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message