httpd-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Querna <c...@force-elite.com>
Subject [PATCH] clucene search for mod_mbox
Date Sun, 12 Nov 2006 09:04:16 GMT
This is a work-in-progress patch, integrating CLuence[1] as a full text
search engine for mod_mbox.

For each directory full of mbox files, there is a .mbox_search_index
containing the CLucene index.  This index is created by mod-mbox-util,
when it is called with the -s argument.  This indexing is done
separately from the main mailing list cache updating.

Performance for searching the entire httpd archives only takes a couple
milliseconds once indexed on my MacBookPro.

TODOs:
- Split the patch into consumable bits/commits
- Make a proper Search Engine Result Page (SERP)
- Make Ajaxy Search on the left pane when reading a specific month
- Add OpenSearch support
- Clean up the search indexer to use a unified utilities function for
fetching the mime-decoded version of an email.
- Lots of code clean up

[1] - http://clucene.sourceforge.net/index.php/Main_Page

Mime
View raw message