lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Noble Paul നോബിള്‍ नोब्ळ्" <noble.p...@gmail.com>
Subject Re: Large Data Set Suggestions
Date Thu, 06 Nov 2008 13:41:49 GMT
On Thu, Nov 6, 2008 at 7:04 PM, Steven Anderson <sanderson@vsticorp.com> wrote:
>> The performance of DIH is likely to be faster than SolrJ.
>> Because , it does not have the overhead of an http request.
>
> Understood.  However, we may not have the option of co-locating the data
> to be injested with the Solr server.
In that case you may put the file in a mounted NFS directory or you
can serve it out with an apache server.
>
>> What is your data source? I am assuming it is xml.
>
> Yes. Incoming stream of xml documents to a directory.
>
>> SolrJ cannot directly index xml . You may need to read docs from xml
> before solrj can index it.
>
> Understood.  We'd like to compare the performance difference of DIH vs.
> custom xml parsing + SolrJ.
good. share your observations
>
> A. Steven Anderson
> 410-418-9908 VSTI
> 443-790-4269 cell
>
>
>



-- 
--Noble Paul

Mime
View raw message