lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: indexing api wrt Analyzer
Date Thu, 13 Mar 2008 18:37:17 GMT

On Mar 13, 2008, at 11:03 AM, John Wang wrote:

> Yes, but usually it's a good idea to add documents in batch and not  
> having
> to reinstantiate the writer for every document and then closing it.
> It would be nice if one can specify to the writer which analyzer to  
> use.
> PerfieldAnalyzer wouldn't work because different analyzers may apply  
> on the
> same field depending on the doc, e.g.

Also, I don't know that it is wise to put different langs in the same  
field.  I can't prove it definitively, but it seems to me your corpus  
statistics could be skewed by terms that are spelled the same but have  
different meanings across languages.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message