lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Shane <sha...@LEXUM.UMontreal.CA>
Subject Re: Is there a way to check for field "uniqueness" when indexing?
Date Wed, 19 Aug 2009 19:04:22 GMT
But in that case, I assume Solr does a commit per document added.

Lets say I wanted to index a collection of 1 million pages, would it 
take much longer if I comited at each insertion rather than comiting at 
the end?

Daniel Shane
 
Grant Ingersoll wrote:
>
>
> On Aug 13, 2009, at 10:33 AM, Daniel Shane wrote:
>>
>> Does anyone have an idea on how I could check an index that is in the 
>> process of being indexed (things added, things deleted) for the 
>> uniquess of a given field *at the time I index a document* ?
>
>
> Solr has de-duplication built-in at indexing time: 
> http://wiki.apache.org/solr/Deduplication
>
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) 
> using Solr/Lucene:
> http://www.lucidimagination.com/search
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message