lucene-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mattm...@apache.org
Subject svn commit: r886813 - /lucene/board-reports/2009/board-report-dec-2009.txt
Date Thu, 03 Dec 2009 15:22:26 GMT
Author: mattmann
Date: Thu Dec  3 15:22:24 2009
New Revision: 886813

URL: http://svn.apache.org/viewvc?rev=886813&view=rev
Log:
Update December board report with more info on Nutch and Tika.

Modified:
    lucene/board-reports/2009/board-report-dec-2009.txt

Modified: lucene/board-reports/2009/board-report-dec-2009.txt
URL: http://svn.apache.org/viewvc/lucene/board-reports/2009/board-report-dec-2009.txt?rev=886813&r1=886812&r2=886813&view=diff
==============================================================================
--- lucene/board-reports/2009/board-report-dec-2009.txt (original)
+++ lucene/board-reports/2009/board-report-dec-2009.txt Thu Dec  3 15:22:24 2009
@@ -22,7 +22,12 @@
 
 NUTCH
 
-Nutch is a web-search engine: crawler, indexer and search runtime.
+Nutch is a web-search engine: crawler, indexer and search runtime. There has
+been a recent flurry of work on discussing Nutch's future post ApacheCon, 
+spearheaded by Andrzej Bialecki and others. In addition, there have been 
+efforts to more closely integrate Tika's parsing framework into Nutch,
+as well as efforts to update the (existing) mime detection work based on improvements
+to Tika's mime detector.
 
 
   
@@ -59,6 +64,10 @@
 
 Apache Tika is a toolkit for detecting and extracting metadata and
 structured text content from various documents using existing parser
-libraries.  Tika released version 0.5 this quarter.
+libraries.  Tika released version 0.5 this quarter. There have been
+recent development efforts to speed up Tika's mime detector, as well as
+efforts to provide a self-contained OGSI-based Tika bundle. There is a 
+strong desire to release these post 0.5 improvements, so we are planning
+to release Tika 0.6 in the next few weeks.
 
 



Mime
View raw message