+1
perhaps we want a "schema.xml" and a "schema-kitchen-sink.xml"
While it is good that the default is fast, we also want to make sure
everything has a functioning example somewhere.
On Mar 7, 2009, at 10:12 AM, Yonik Seeley wrote:
> I've occasionally run across people going with another search engine
> because it was faster at indexing.
> The example schema that people may be using as a base to do their
> benchmarking (with perhaps minimal modifications) is slow.
> There are many people out there that check what's fastest first, and
> *then* check if it is satisfactory to meet their needs in other areas.
>
> With very simple synthetic test documents (just a few fields each) and
> the CSV loader, I've personally seen the indexing rate go from
> ~330/sec to ~3000/sec, when I removed the default field values, term
> vectors, copyFields, etc. The default example schema should still be
> able to show how something can be done, but that doesn't mean it needs
> to be enabled by default.
>
> So what do people think about speeding up the default/example schema
> before 1.4?
>
> -Yonik
> http://www.lucidimagination.com
|