lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: query regarding Lucene Indexing and searching
Date Sun, 02 Mar 2014 16:56:34 GMT
Please elaborate on what you expect will be in this payload. Is it 
information derived from the indexing process itself or is it external 
information to be added to the indexed terms?

-- Jack Krupansky

-----Original Message----- 
From: Mrugendra
Sent: Sunday, March 2, 2014 5:15 AM
To: java-user@lucene.apache.org
Subject: query regarding Lucene Indexing and searching

Sir i am PG student, my research topic is to optimize the indexing file
[reduce index file size, RAM usage, CPU utilization, and create index with
payload to improve searching speed].

Currently working scope is Desktop search engine

1.i am using lucene for indexing the pdf files[indexing file name and
content]. after applying standard analyzer lucene index file size is 11 MB
for 1.77GB
and windows 8 windows.edb file size 42 MB for 1.77GB[Tested for windows
desktop environment]. So the space complexity is done.

How to do time complexity?

2. how to apply lemmatization with standard analyzer to reduce index file
size and ADD PAYLOAD during indexing.

3. from where i can find the test benchmark.

-- 
Regards

Rahevar Mrugendrasinh 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message