lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pulkit Singhal <pulkitsing...@gmail.com>
Subject Re: Interesting DIH challenge
Date Mon, 10 Oct 2011 01:13:04 GMT
Oh also: Does DIH have any experimental way for folks to be reading data
from one solr core and then massaging it and importing it into another core?
If not, then would that be a good addition or just a waste of time for some
architectural reason?

On Sun, Oct 9, 2011 at 8:00 PM, Pulkit Singhal <pulkitsinghal@gmail.com>wrote:

> @Gora Thank You!
>
> I know that Solr accepts xml with Solr specific elements that are commands
> that only it understands ... such as <add/>, <commit/> etc.
>
> Question: Is there some way to ask Solr to dump out whatever it has in its
> index already ... as a Solr xml document?
>
> Plan: I intend to message that xml dump (add the field + value that I need
> in every doc's xml element) and then I should be able to push this dump back
> to Solr to get data indexed again, I hope.
>
> Thanks!
> - Pulkit
>
>
> On Sun, Oct 9, 2011 at 2:57 PM, Gora Mohanty <gora@mimirtech.com> wrote:
>
>> On Mon, Oct 10, 2011 at 1:17 AM, Pulkit Singhal <pulkitsinghal@gmail.com>
>> wrote:
>> > Hello Folks,
>> >
>> > I'm a big DIH fan but I'm fairly sure that now I've run into a scenario
>> > where it can't help me anymore ... but before I give up and roll my own
>> > solution, I jsut wanted to check with everyone else.
>> >
>> > The scenario:
>> > - already have 1M+ documents indexed
>> > - the schema.xml needs to have one more field added to it ...
>> > problem/do-able? yes? no? remove all the old data? or do the update per
>> doc
>> > (add/delete)?
>>
>> This is independent of DIH. If you want to add a new field to the schema,
>> you should reindex. 1M documents should not take that long.
>>
>> > - need to populate data from a file that has a key and value per line
>> and i
>> > need to use the key to find the doc to update and then add the value to
>> the
>> > new schema field
>>
>> It is best just to reindex, but it should be possible to write a script to
>> pull
>> the doc from the existing Solr index, massage the return format into
>> Solr's XML format, adding a value for the new field in the process, and
>> then posting the new file to Solr for indexing.
>>
>> Regards,
>> Gora
>>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message