lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From antonio roa <antonio.roa.valve...@gmail.com>
Subject Re: RemoveDuplicatesTokenFilter does not work for me
Date Tue, 13 Sep 2011 16:07:33 GMT
I meant I was considering

<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
<filter class="solr.PositionFilterFactory"/>

instead of

*<filter class="solr.**PositionFilterFactory"/>*
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>


Antonio ;)


On 13 September 2011 18:06, antonio roa <antonio.roa.valverde@gmail.com>wrote:

> Thanks a lot Chris.
>
> I changed the order to my filters as was considering:
>
> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> <filter class="solr.PositionFilterFactory"/>
>
> instead of
>
> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> <filter class="solr.PositionFilterFactory"/>
>
> Now it is running perfectly ;)
>
> Regards,
> Antonio.
>
>
> On 13 September 2011 17:48, Chris Hostetter <hossman_lucene@fucit.org>wrote:
>
>>
>> : I am running an application using RemoveDuplicatesTokenFilter using
>> : solr-core-3.1 and after using the analysis interface this filter does
>> just
>> : nothing. I have debugged the source code of this filter and seems it is
>> not
>> : detecting duplicates tokens.
>>
>> Please note carefully the documentation...
>>
>>
>> https://lucene.apache.org/solr/api/org/apache/solr/analysis/RemoveDuplicatesTokenFilter.html
>> "A TokenFilter which filters out Tokens at the same position and Term text
>> as the previous token in the stream. "
>>
>> are you sure the "duplicates" you are seeing are exact duplicates?  at the
>> exact same position?
>>
>> BTW: if you have followup questions, please post them to the
>> solr-user@lucene mailing list.
>>
>>
>> -Hoss
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message