lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From vineet yadav <vineet.yadav.i...@gmail.com>
Subject Fwd: Indexing Wikipedia with Solr/Lucene
Date Sun, 13 May 2012 18:55:06 GMT
Hi all,
I want to create Lucene/Solr index of wikipedia xml dump. I used Solr
example(http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia)
to index wikipedia xml dump. Since in wikipedia, Category and external
links are part of wikipedia text, I am not able to index category and
external links separately.     I want to index  Category, Externals
links etc separately and store them in separate fields.
Would anyone please be kind enough to give me a bit of advice?
Thanks
Vineet Yadav

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message