lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Liaqat Ali <liaqatalim...@gmail.com>
Subject Corpus interpretation
Date Wed, 24 Oct 2007 13:02:13 GMT
I want to index the Urdu language corpus (200 documents in CES XML DTD 
format). Is net necessary to break the XML file into 200 different files 
or it can be indexed in the original form using Lucene. Kindly guide in 
this regard.



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message