commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Libbrecht <p...@activemath.org>
Subject Re: Proposal: Commons SAX
Date Mon, 29 Dec 2008 11:02:19 GMT
It may be worth quoting that:
- it's livable to be registered to commons-dev without filter (you  
need a swift delete key)
- the filter practice is deep anchored until the netiquette for both  
commons-dev and commons-user so that mails without the project prefix  
are very quickly warned about and corrected

paul

Le 29-déc.-08 à 04:56, Henri Yandell a écrit :

> I don't see why it's not a good fit for Commons. We've a few xml
> related components in the dormant section. I think a component with
> things for both SAX and DOM would be fine, and letting it grow
> organically would also work well.
>
> Hen
>
> On Wed, Dec 17, 2008 at 5:09 AM, Jukka Zitting <jukka.zitting@gmail.com 
> > wrote:
>> Hi,
>>
>> In the Apache Tika project [1] we use SAX quite a lot, and have
>> written a set of quite useful general utility classes for SAX
>> handling.
>>
>> For example, in org.apache.tika.sax [2] we have the following:
>>
>> * ContentHandlerDecorator - Convenient base class for writing
>> ContentHandler decorators
>> * EmbeddedContentHandler - Decorator that blocks startDocument() and
>> endDocument() calls
>> * TeeContentHandler - Forwards SAX events to multiple handlers
>> * TextContentHandler - Decorator that blocks everything but character
>> events (and start/endDocument)
>> * WriteOutContentHandler - Writes the contents of all character  
>> events
>> to a Writer
>>
>> In org.apache.tika.sax.xpath [3] we have a simple XPath subset
>> implementation that supports streaming and filtering of SAX events.  
>> In
>> other words, the implementation doesn't need a DOM tree to evaluate
>> XPath statements.
>>
>> I believe this code would be useful also outside Tika, and I was
>> thinking that it might perhaps make sense to create a Commons project
>> for this. I also know of some SAX processing classes in Cocoon and
>> Jackrabbit that could well be of interest to a wider audience.
>>
>> Do you think something like this would be interesting as a Commons
>> project? Are there other similar efforts that I should know of? I
>> looked at XML Commons in xml.apache.org, but it seems pretty dormant.
>>
>> [1] http://lucene.apache.org/tika/
>> [2] http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/package-summary.html
>> [3] http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/xpath/package-summary.html
>>
>> BR,
>>
>> Jukka Zitting
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
>> For additional commands, e-mail: dev-help@commons.apache.org
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
> For additional commands, e-mail: dev-help@commons.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
For additional commands, e-mail: dev-help@commons.apache.org


Mime
View raw message