commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeremias Maerki <...@jeremias-maerki.ch>
Subject Re: Proposal: Commons SAX
Date Thu, 18 Dec 2008 08:25:53 GMT
Oversight/Responsibility over XML Commons has been transferred to the
Xerces PMC in November 2006. The XML project itself is subject to be
killed as soon as the last issues (AxKit & XIndice) have been dealt with.
Xerces is currently just using XML Commons to maintain the XML APIs. Not
much else is going on there. So if this is no good fit for Apache
Commons, we'd have to talk to the Xerces people.

On 18.12.2008 07:40:26 Martin Cooper wrote:
> "I looked at XML Commons in xml.apache.org, but it seems pretty dormant."
> 
> Perhaps this would be an opportunity to re-vitalize it? I really dislike the
> implication that Commons should accept a new project simply because a more
> appropriate parent project wasn't paying attention.
> 
> --
> Martin Cooper
> 
> 
> On Wed, Dec 17, 2008 at 5:09 AM, Jukka Zitting <jukka.zitting@gmail.com>wrote:
> 
> > Hi,
> >
> > In the Apache Tika project [1] we use SAX quite a lot, and have
> > written a set of quite useful general utility classes for SAX
> > handling.
> >
> > For example, in org.apache.tika.sax [2] we have the following:
> >
> > * ContentHandlerDecorator - Convenient base class for writing
> > ContentHandler decorators
> > * EmbeddedContentHandler - Decorator that blocks startDocument() and
> > endDocument() calls
> > * TeeContentHandler - Forwards SAX events to multiple handlers
> > * TextContentHandler - Decorator that blocks everything but character
> > events (and start/endDocument)
> > * WriteOutContentHandler - Writes the contents of all character events
> > to a Writer
> >
> > In org.apache.tika.sax.xpath [3] we have a simple XPath subset
> > implementation that supports streaming and filtering of SAX events. In
> > other words, the implementation doesn't need a DOM tree to evaluate
> > XPath statements.
> >
> > I believe this code would be useful also outside Tika, and I was
> > thinking that it might perhaps make sense to create a Commons project
> > for this. I also know of some SAX processing classes in Cocoon and
> > Jackrabbit that could well be of interest to a wider audience.
> >
> > Do you think something like this would be interesting as a Commons
> > project? Are there other similar efforts that I should know of? I
> > looked at XML Commons in xml.apache.org, but it seems pretty dormant.
> >
> > [1] http://lucene.apache.org/tika/
> > [2]
> > http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/package-summary.html
> > [3]
> > http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/xpath/package-summary.html
> >
> > BR,
> >
> > Jukka Zitting
> >



Jeremias Maerki


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
For additional commands, e-mail: dev-help@commons.apache.org


Mime
View raw message