lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <>
Subject Re: Retrieving Documents
Date Sat, 17 Dec 2011 04:51:38 GMT
Hi Dan,

1) Are you looking for ?

2) Hundreds of words in a field should not be a problem for highlighting.  But it sounds
like this long field may contain content that corresponds to N different pages in a publication
and you would like to inform the searcher which page the match was on, and not just that a
match was somewhere in that big piece of text.  One way to deal with that is to break your
document into N smaller documents - one document for each page.


Performance Monitoring SaaS for Solr -

> From: Dan McGinn-Combs <>
>Sent: Friday, December 16, 2011 4:33 PM
>Subject: Retrieving Documents
>I've been doing a fair amount of reading and experimenting with Solr
>lately. I find that it does a good job of indexing very structured
>documents. However, the application I have in mind is build around
>long EPUB documents.
>Of course, I found the Extract components useful for indexing the
>EPUBs. However, I would like to be able to
>* Size the "highlight" portion of text around the query parameters
>(i.e. show 20 or 30 words) and
>* Retrieve a location within the document so I can display that "page"
>from the EPUB.
>What is common practice for these? I notice that if I have a list of
>(short) text segments in fields, they are stored without too much fuss
>and are retrievable. However, I'm talking about a field of potentially
>hundreds of words.
>Thanks for any pointers,
>Dan McGinn-Combs
>Peachtree City, Georgia USA
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message