lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: cvs commit: jakarta-lucene/src/java/org/apache/lucene/analysis/de GermanAnalyzer.java GermanStemmer.java
Date Thu, 09 Oct 2003 08:54:39 GMT
It seems to be the issue mentioned here as well:

	http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18410


On Wednesday, October 8, 2003, at 09:41  PM, Otis Gospodnetic wrote:
> Answer to question comment: possibly because nouns start with a capital
> letter in German, so lowercasing may not be the right thing to do.
> This is a bit of a guess.  Maybe the author will enlighten us. :)
>
> Otis
>
> --- ehatcher@apache.org wrote:
>> ehatcher    2003/10/08 17:08:52
>>
>>   Modified:    src/java/org/apache/lucene/analysis/de
>> GermanAnalyzer.java
>>                         GermanStemmer.java
>>   Log:
>>   minor javadoc fixup and add question comment
>>
>>   Revision  Changes    Path
>>   1.7       +3 -2
>>
> jakarta-lucene/src/java/org/apache/lucene/analysis/de/ 
> GermanAnalyzer.java
>>
>>   Index: GermanAnalyzer.java
>>   ===================================================================
>>   RCS file:
>>
> /home/cvs/jakarta-lucene/src/java/org/apache/lucene/analysis/de/ 
> GermanAnalyzer.java,v
>>   retrieving revision 1.6
>>   retrieving revision 1.7
>>   diff -u -r1.6 -r1.7
>>   --- GermanAnalyzer.java	29 Jan 2003 17:18:53 -0000	1.6
>>   +++ GermanAnalyzer.java	9 Oct 2003 00:08:52 -0000	1.7
>>   @@ -169,7 +169,8 @@
>>        {
>>    	TokenStream result = new StandardTokenizer( reader );
>>    	result = new StandardFilter( result );
>>   -	result = new StopFilter( result, stoptable );
>>   +  // shouldn't there be a lowercaser before stop word filtering?
>>   +  result = new StopFilter( result, stoptable );
>>    	result = new GermanStemFilter( result, excltable );
>>    	return result;
>>        }
>>
>>
>>
>>   1.7       +1 -3
>>
> jakarta-lucene/src/java/org/apache/lucene/analysis/de/ 
> GermanStemmer.java
>>
>>   Index: GermanStemmer.java
>>   ===================================================================
>>   RCS file:
>>
> /home/cvs/jakarta-lucene/src/java/org/apache/lucene/analysis/de/ 
> GermanStemmer.java,v
>>   retrieving revision 1.6
>>   retrieving revision 1.7
>>   diff -u -r1.6 -r1.7
>>   --- GermanStemmer.java	18 Aug 2002 17:33:16 -0000	1.6
>>   +++ GermanStemmer.java	9 Oct 2003 00:08:52 -0000	1.7
>>   @@ -165,8 +165,6 @@
>>        /**
>>         * Does some optimizations on the term. This optimisations are
>>         * contextual.
>>   -     *
>>   -     * @return  The term with the optimizations applied.
>>         */
>>        private void optimize( StringBuffer buffer )
>>        {
>>
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
>> For additional commands, e-mail: lucene-dev-help@jakarta.apache.org
>>
>
>
> __________________________________
> Do you Yahoo!?
> The New Yahoo! Shopping - with improved product search
> http://shopping.yahoo.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message