lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Re-Indexing 143 million rows
Date Mon, 21 Nov 2016 14:37:33 GMT
In a word, "no". Resending the same document will

1> delete the old version (based on ID)
2> index the document just sent.

When a document comes in, Solr can't assume that
"nothing's changed". What if you changed your schema?

So I'd expect the second run to take at least as long as the first.

Best,
Erick

On Mon, Nov 21, 2016 at 1:16 AM, subinalex <alexkutty19@gmail.com> wrote:
> Hi Team,
>
> I have indexed data with 143 rows(docs) into solr.It takes around 3 hours to
> index.I usde csvUpdateHandler and indexes the csv file by remote streaming.
> Now ,when i re-indexing the same csv data,it is still taking 3+ hours.
>
> Ideally,since there are no changes in _id values,it should have finished
> quickly right?.
>
> Please provide some insights on this..
>
> Regards,
> Subin
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Re-Indexing-143-million-rows-tp4306622.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message