incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Incubator Wiki] Update of "PDFBoxProposal" by BenLitchfield
Date Wed, 14 Nov 2007 22:25:53 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The following page has been changed by BenLitchfield:

  === Alignment ===
+ The ability to search PDF documents is a basic requirement for any enterprise search solution.
 PDFBox provides the basic content that is needed for content indexing.  This functionality
aligns with the those of Lucene, Nutch, Tika and UIMA and all users of these projects will
benefit from continued development of PDFBox.  
  == Known Risks ==
  === Orphaned products ===
+ PDFBox has been in active development for over 5 years.  The PDFBox community has grown
each year.  PDFBox implements the PDF specification, which is highly utilized by companies
across the world.  The need for a PDF library is strong and is unlikely to change in the near
  === Inexperience with Open Source ===
+ All developers have experience with Open Source projects.
  === Homogenous Developers ===
+ The initial set of committers is diverse and it is likely to attract new developers.
  === Reliance on Salaried Developers ===
+ PDFBox is not the primary job for any of the initial committers.
  === Relationships with Other Apache Products ===
+ PDFBox has relationships with the following Apache Products
+   * [ Apache Lucene] Lucene users typically integrate with
PDFBox to add PDF indexing capabilities.
+   * [ Lucene Nutch] Nutch currently utilizes PDFBox to index
PDF documents.
+   * [ Tika] Tika currently utilizes PDFBox for extracting
PDF content.
+   * [ Apache UIMA] UIMA analyzes unstructured content and
would benefit from PDF content.
  === A Excessive Fascination with the Apache Brand ===
+ Many existing Apache developers are already familiar with PDFBox.  PDFBox was initially
written to compliment the functionality of Lucene and has worked with it's developers over
the past several years.  PDFBox will benefit from closer cooperation with several existing
Apache projects.
  == Documentation ==

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message