incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Update of "DroidsProposal" by ThorstenScherler
Date Wed, 03 Sep 2008 11:49:11 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The following page has been changed by ThorstenScherler:
http://wiki.apache.org/incubator/DroidsProposal

The comment on the change is:
more text enhancements

------------------------------------------------------------------------------
  
  === Rationale ===
  
- There is ever more demand for tools that automatically do determinate tasks. Search engines
such as Nuts are normally very focused on a specific functionality and are not focused on
extensibility. While there are a number of existing crawler libraries for various task, each
of them comes with a custom API and there are no generic interface for automatically determining
which crawler (droids) to use for a specific task. The focus of Droids is not a single crawler
but more to offer different reusable components that custom droids can use to automate certain
tasks.
+ There is ever more demand for tools that automatically do determinate tasks. Search engines
such as Nuts are normally very focused on a specific functionality and are not focused on
extensibility. Furthermore there are manly focused on crawling, requesting certain pages and
extract links to other pages, which in our opinion is only one small area for automated robots.
While there are a number of existing crawler libraries for various task, each of them comes
with a custom API and there are no generic interface for automatically determining which crawler
(droids) to use for a specific task. 
  
- The Droids project attempts to remove this duplication of efforts. We believe that by pooling
the efforts of multiple projects we will be able to create a generic toolkit that exceeds
the capabilities and quality of the custom solutions of any single project. A generic toolkit
project will also provide common ground for the developers of crawler libraries and content
applications to interact.
+ The Droids project attempts to remove this duplication of efforts. We believe that by pooling
the efforts of multiple projects we will be able to create a generic robot framework that
exceeds the capabilities and quality of the custom solutions of any single project. The focus
of Droids is not a single crawler but more to offer different reusable components that custom
droids (robots) can use to automate certain tasks. An intelligent standalone robot framework
project will not only provide common ground for the developers of crawler but as well for
any other automated application (robots) libraries. 
  
  === Initial Goals ===
  
  The initial goals of the proposed project are:
  
   * Viable community around the Droids codebase
-  * Active relationships and possible cooperation with related projects and communities
+  * Active relationships and possible cooperation with related projects and communities (e.g.
reusing Tika for text extraction)
-  * Generic robot API for crawling, extracting structured text content and new task, filtering
task and handle the content
+  * Generic robot API for crawling, extracting structured text content and/or new task, filtering
task and handle the content
   * Flexible extension and plugin development to create a wide range of functionality
  
  == Current Status ==
  
+ === Meritocracy ===
+ 
+ All the initial committers are familiar with the meritocracy principles of Apache, and have
already worked on the various source codebases. We will follow the normal meritocracy rules
also with other potential contributors.
+ 
+ === Community ===
+ 
+ There is not yet a clear Droids community. Instead we have a number of people and related
projects with an understanding that an intelligent standalone robot framework project would
best serve everyone's interests. The primary goal of the incubating project is to build a
self-sustaining community around this shared vision.
+ 
+ === Core Developers ===
+ 
+ The initial set of developers comes from various backgrounds, with different but compatible
needs for the proposed project.
+ 
+ 

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message