xml-commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Glavassevich <mrgla...@ca.ibm.com>
Subject Re: Fwd: Proposal: Commons SAX
Date Fri, 19 Dec 2008 04:16:49 GMT

Hi Jukka,

It sounds like it might be a good match. I think XML Commons always hoped
to add common XML utilities to the project.

I am curious though about what kind of community you were hoping would
build around this code. You should know that XML Commons has fairly low
activity [1] with only two components (XML Commons External [aka XML APIs]
and the XML Commons Resolver) that are being maintained by a few
committers. We do update them occasionally. For example this summer we
upgraded XML Commons External to JAXP 1.4 just prior to starting an
implementation of the new APIs in Xerces. Otherwise things are usually
fairly quiet.


[1] http://www.ohloh.net/p/xml-commons/contributors

Michael Glavassevich
XML Parser Development
IBM Toronto Lab
E-mail: mrglavas@ca.ibm.com
E-mail: mrglavas@apache.org

"Jukka Zitting" <jukka.zitting@gmail.com> wrote on 12/18/2008 08:14:21 AM:

> Hi,
> See below for a proposal I made at the Apache Commons project. I was
> pointed to the Xerces project as the current home of the XML Commons
> subproject, so I wanted to check with you on your thoughts about the
> matter. Would this (a library of SAX utilities) be a good match for
> the existing XML Commons project?
> BR,
> Jukka Zitting
> ---------- Forwarded message ----------
> From: Jukka Zitting <jukka.zitting@gmail.com>
> Date: Wed, Dec 17, 2008 at 2:09 PM
> Subject: Proposal: Commons SAX
> To: Jakarta Commons Developers List <dev@commons.apache.org>
> Hi,
> In the Apache Tika project [1] we use SAX quite a lot, and have
> written a set of quite useful general utility classes for SAX
> handling.
> For example, in org.apache.tika.sax [2] we have the following:
> * ContentHandlerDecorator - Convenient base class for writing
> ContentHandler decorators
> * EmbeddedContentHandler - Decorator that blocks startDocument() and
> endDocument() calls
> * TeeContentHandler - Forwards SAX events to multiple handlers
> * TextContentHandler - Decorator that blocks everything but character
> events (and start/endDocument)
> * WriteOutContentHandler - Writes the contents of all character events
> to a Writer
> In org.apache.tika.sax.xpath [3] we have a simple XPath subset
> implementation that supports streaming and filtering of SAX events. In
> other words, the implementation doesn't need a DOM tree to evaluate
> XPath statements.
> I believe this code would be useful also outside Tika, and I was
> thinking that it might perhaps make sense to create a Commons project
> for this. I also know of some SAX processing classes in Cocoon and
> Jackrabbit that could well be of interest to a wider audience.
> Do you think something like this would be interesting as a Commons
> project? Are there other similar efforts that I should know of? I
> looked at XML Commons in xml.apache.org, but it seems pretty dormant.
> [1] http://lucene.apache.org/tika/
> [2] http://lucene.apache.
> org/tika/apidocs/org/apache/tika/sax/package-summary.html
> [3] http://lucene.apache.
> org/tika/apidocs/org/apache/tika/sax/xpath/package-summary.html
> BR,
> Jukka Zitting
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
> For additional commands, e-mail: j-dev-help@xerces.apache.org
View raw message