lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amin Mohammed-Coleman <ami...@gmail.com>
Subject Re: Lucene Highlighting and Dynamic Summaries
Date Sat, 07 Mar 2009 11:37:06 GMT
hi
that's what i was thinking about.  i would need to get the file and extract
the text again and then pass through the highlighter.  The other option is
storing the content in the index the downside being index is going to be
large.  Which would be the recommended approach?

Cheers

Amin

On Sat, Mar 7, 2009 at 10:50 AM, Erik Hatcher <erik@ehatchersolutions.com>wrote:

> With the caveat that if you're not storing the text you want highlighted,
> you'll have to retrieve it somehow and send it into the Highlighter
> yourself.
>
>        Erik
>
>
> On Mar 7, 2009, at 5:40 AM, Michael McCandless wrote:
>
>
>> You should look at contrib/highlighter, which does exactly this.
>>
>> Mike
>>
>> Amin Mohammed-Coleman wrote:
>>
>>  Hi
>>> I am currently indexing documents (pdf, ms word, etc) that are uploaded,
>>> these documents can be searched and what the search returns to the user
>>> are
>>> summaries of the documents.  Currently the summaries are extracted when
>>> indexing the file (summary constructed by taking the first 10 lines of
>>> the
>>> document and stored in the index as field).  This is not ideal (static
>>> summary), and I was wondering if it would be possible to create a dynamic
>>> summary when a hit is found and highlight the terms found.  The content
>>> of
>>> the document is not stored in the index.
>>>
>>> So basically what I'm looking to do is:
>>>
>>> 1) PDF indexed
>>> 2) PDF body contains the word "search"
>>> 3) Do a search and return the hit
>>> 4) Construct a summary with the term "search" included.
>>>
>>> I'm not sure how to go about doing this (I presume it is possible).  I
>>> would
>>> be grateful for any advice.
>>>
>>>
>>> Cheers
>>> Amin
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message