lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mrugendra <rahevar.mrugen...@gmail.com>
Subject query regarding Lucene Indexing and searching
Date Sun, 02 Mar 2014 10:15:18 GMT
Sir i am PG student, my research topic is to optimize the indexing file
[reduce index file size, RAM usage, CPU utilization, and create index with
payload to improve searching speed].

Currently working scope is Desktop search engine

1.i am using lucene for indexing the pdf files[indexing file name and
content]. after applying standard analyzer lucene index file size is 11 MB
for 1.77GB
and windows 8 windows.edb file size 42 MB for 1.77GB[Tested for windows
desktop environment]. So the space complexity is done.

How to do time complexity?

2. how to apply lemmatization with standard analyzer to reduce index file
size and ADD PAYLOAD during indexing.

3. from where i can find the test benchmark.

-- 
Regards

Rahevar Mrugendrasinh

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message