lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Klaas <mike.kl...@gmail.com>
Subject Re: [Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by MikeKlaas
Date Fri, 08 Jun 2007 22:23:52 GMT
On 8-Jun-07, at 2:44 PM, Chris Hostetter wrote:

>
> :    * '''generateWordParts="1"''' causes parts of words to be  
> generated:
> :      * `"PowerShot" => "Power" "Shot"`
> : +    * `"Power-Shot" => "Power" "Shot"`
>
> : +  * '''splitOnCaseChange="1"''' causes lowercase => uppercase  
> transitions to generate a new part [Solr 1.3]:
> : +    * `"PowerShot" => "Power" "Shot"`
> : +    * `"TransAM" => "Trans" "AM"`
> :
> :   These parameters may be combined in any way.
> :    * Example of generateWordParts="1" and  catenateWords="1":
>
> we should calrify when "PowerShot" becomes "Power" "Shot" ... is it
> only when generateWordParts=1 and splitOnCaseChange=1 ? ... will it  
> still
> happen if generateWordParts=0 and splitOnCaseChange=1 ?

It will, but it will be recombined assuming catenateWords=1, so there  
should be no externally visible effects

> (I ask because frankly i'm not clear on this myself)

I reorganized the parameters a bit to clarify this--let me know if  
that helps.

-Mike

Mime
View raw message