lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From payo <>
Subject a single index
Date Fri, 07 Sep 2007 14:45:47 GMT

I am working with lucene and i am new 

I want to index documents HTML for this I do 

java org.w3c.tidy.Tidy - m * html

java org.apache.lucene.demo.IndexHTML - create - index index .\

all this generates index to me and when doing my search in the Web if it
shows to the documents and the summary to me.

despues I index pdf

org.pdfbox.searchengine.lucene.IndexFiles - create - index pdf \

this also generates index to me

but the index PDF replace index HTML

how I can make him to have single index and  when doing my search in the WEB
showme as HTML and PDF documents?

my directory base is


i use the lucene demo

View this message in context:
Sent from the Lucene - General mailing list archive at

View raw message