lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shalin Shekhar Mangar <shalinman...@gmail.com>
Subject Re: Keepwords Schema
Date Fri, 02 Oct 2009 12:35:29 GMT
On Thu, Oct 1, 2009 at 7:37 PM, matrix_psj <matrix_psj@hotmail.com> wrote:

>
>
> An example:
> My schema is about web files. Part of the syntax is a text field of authors
> that have worked on each file, e.g.
> <file>
>    <filename>login.php</filename>
>   <lastModDate>2009-01-01</lastModDate>
>   <authors>alex, brian, carl carlington, dave alpha, eddie, dave
> beta</authors>
> </file>
>
> When I perform a search and get 20 web files back, I would like a facet of
> the individual authors, but only if there name appears in a
> public_authors.txt file.
>
> So if the public_authors.txt file contained:
> Anna,
> Bob,
> Carl Carlington,
> Dave Alpha,
> Elvis,
> Eddie,
>
> The facet returned would be:
> Carl Carlington
> Dave Alpha
> Eddie
>
>
>
> Not sure if that makes sense? If it does, could someone explain to me the
> schema fieldtype declarations that would bring back this sort of results.
>
>
If I'm understanding you correctly - You want to facet on a field (with
facet=true&facet.field=authors) but you want to show only certain
whitelisted facet values in the response.

If that is correct then, you can remove the authors which are not in the
whitelist during indexing time. You can do this by adding
KeepWordFilterFactory to your field type:

<filter class="solr.KeepWordFilterFactory" words="author_whitelist.txt"
ignoreCase="true" />

-- 
Regards,
Shalin Shekhar Mangar.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message