lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Parkes" <steven_par...@esseff.org>
Subject RE: Token termBuffer issues
Date Thu, 26 Jul 2007 19:24:13 GMT
	First I create a single large file that has one doc per line
from
	Wikipedia content, using this alg

Anybody disagree that the 1-line-per-doc format is better (at least for
Wikipedia)? If so, I'll get rid of the intermediate one-file-per-doc
step.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message