lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lance Norskog" <>
Subject RE: Large Data Set Suggestions
Date Thu, 06 Nov 2008 19:14:36 GMT
You can also do streaming XML upload for the XML-based indexing. This can
feed, say, 100k records in one XML file from a separate machine.

All of these options ignore the case where there is an error in your input
records v.s. the schema.  DIH gives up on an error. Streaming XML gives up
on an error.


-----Original Message-----
From: Steven Anderson [] 
Sent: Thursday, November 06, 2008 5:57 AM
Subject: RE: Large Data Set Suggestions

> In that case you may put the file in a mounted NFS directory or you 
> can serve it out with an apache server.

That's one option although someone else on the list mentioned that
performance was 10x slower in their NFS experience.

Another option is to serve up the files via Apache and pull them via DIH

Thankfully, there are lots of options, but we need to determine which one
will perform best.


A. Steven Anderson
410-418-9908 VSTI
443-790-4269 cell

View raw message