lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <>
Subject [jira] Commented: (LUCENE-477) Build an index which allows me to broswe by category.
Date Tue, 06 Dec 2005 23:11:08 GMT
    [ ] 

Hoss Man commented on LUCENE-477:

This isn't a "bug" or a "feature" or a "task" as much as it is a "question" about using lucene
in a particular way.  Questions generally recieve more comment on the java-user at lucene
dot apache dot org then they do when posted in JIRA.

In particular, you should search the mailing list archive for "facet" or "faceted" before
you ask teh question, previous discussions may give you enough info to solve your problem.

> Build an index which allows me to broswe by category.
> -----------------------------------------------------
>          Key: LUCENE-477
>          URL:
>      Project: Lucene - Java
>         Type: Task
>   Components: Index
>     Versions: 1.4
>  Environment: JDK 1.4, Windows 2003, Tomcat 5.0.28
>     Reporter: Mark Dos Santos

> Hello there,
> I have a collection of documents that I am using lucene to build an index for, and then
I have a jsp app to search my documents. This all works great. I believe lucene is such an
amazing product, but thats a whole other topic. Anyway, maybe it's my lack of experience in
building indexes, but I am have trouble coming up with an index that kind of mimics verity's
parametric index.  You see my documents all have a category path (I have over 50,000 docs).
 A document can be at any level of the category path, and that same path can have many different
documents. IE. Document x, has a category path USA//New Jersey//Trenton//09890 and Document
y has a category path USA//New Jersey//Trenton//09890.  
> Basically, I would like to build an index using lucene, where when I search, if my results
were to bring back those two documents, I would like to retrieve the distinct category path
for those two documents.  Of course I can loop through and build a vector with only the unique
paths that come in the search results, but that obviously would take to long when I get lets
say 10000 results from my search.
> So the question I guess is, how can I build an index that would facilitate this functionality
for me.  If anyone has any suggestions I would greatly appreciate it.
> Thanks,
> Mark

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message