lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arcadius Ahouansou (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-3110) Search result comes up with truncated words at the start of highlighted fragment
Date Thu, 27 Sep 2012 15:02:08 GMT

    [ https://issues.apache.org/jira/browse/SOLR-3110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13464798#comment-13464798
] 

Arcadius Ahouansou commented on SOLR-3110:
------------------------------------------

This issue has been around for a while and seems to be related to LUCENE-1822
                
> Search result comes up with truncated words at the start of highlighted fragment
> --------------------------------------------------------------------------------
>
>                 Key: SOLR-3110
>                 URL: https://issues.apache.org/jira/browse/SOLR-3110
>             Project: Solr
>          Issue Type: Bug
>          Components: highlighter
>    Affects Versions: 4.0-ALPHA
>         Environment: java Tomcat Solaris
>            Reporter: Shyam Bhaskaran
>              Labels: FastVectorHighlighter, boundaryScanner, highlighting, solr
>
> It is being observed that words are getting truncated at the start of Highlighter fragment
displayed. 
> Following boundary scanner settings are introduced inside in the solrconfig.xml file
> <str name="hl.bs.chars">.,!?  &\#9;&\#10;&\#13;</str>  
> If I change the settings to 
> <str name="hl.bs.chars">.,!?</str>
> then it is seen that this issue goes away but another issues comes up where the highlighted
search fragment does not start from the beginning of the sentence.
> Below is the complete list of setting we are using for boundary scanner.
>    <boundaryScanner name="simple" class="solr.highlight.SimpleBoundaryScanner" default="true">
>      <lst name="defaults">
>        <str name="hl.bs.maxScan">200</str>
>        <str name="hl.bs.chars">.,!? &\#9;&\#10;&\#13;</str>
>      </lst>
>    </boundaryScanner>
>    <boundaryScanner name="breakIterator" class="solr.highlight.BreakIteratorBoundaryScanner">
>      <lst name="defaults">
>        <str name="hl.bs.type">SENTENCE</str>
>        <str name="hl.bs.language">en</str>
>        <str name="hl.bs.country">US</str>
>      </lst>
>    </boundaryScanner>

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message