lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: What is the recommended way to import and update index records?
Date Tue, 27 Jan 2015 17:09:12 GMT
What do you mean by "update"? If you mean partial update, DIH does not
do it AFAIK. If you mean replace, it should.

If you are getting duplicate records, maybe your uniqueKey is not set correctly?

clean=false looks to me like the right approach for incremental updates.

Regards,
   Alex.
----
Sign up for my Solr resources newsletter at http://www.solr-start.com/


On 27 January 2015 at 11:43, Carl Roberts <carl.roberts.zapata@gmail.com> wrote:
> Also, if I try full-import and clean=false with the same XML file, I end up
> with more records each time the import runs.  How can I make SOLR just add
> the records that are new by id, and update the ones that have an id that
> matches the one in the existing index?
>
>
>
> On 1/27/15, 11:32 AM, Carl Roberts wrote:
>>
>> Hi,
>>
>> What is the recommended way to import and update index records?
>>
>> I've read the documentation and I've experimented with full-import and
>> delta-import and I am not seeing the desired results.
>>
>> Basically, I have 15 RSS feeds that I am importing through
>> rss-data-config.xml.
>>
>> The first RSS feed should be a full import and the ones that follow may
>> contain the same id, in which case the existing id in the index should be
>> updated from the record in the new RSS feed. Also there may be new records
>> in the RSS feeds that follow the first one, in which case I want them added
>> to the index.
>>
>> When I try full-import for each entity, the index is cleared and I just
>> end up with the records for the last import.
>>
>> When I try full-import for each entity, with the clean=false parameter,
>> all the records from each entity are added to the index and I end up with
>> duplicate records.
>>
>> When I try delta-import for the entities the follow the first one, I don't
>> get any new index records.
>>
>> How should I do this?
>>
>> Regards,
>>
>> Joe
>
>

Mime
View raw message