Return-Path: Delivered-To: apmail-commons-dev-archive@www.apache.org Received: (qmail 45655 invoked from network); 29 Dec 2008 11:02:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 Dec 2008 11:02:54 -0000 Received: (qmail 5666 invoked by uid 500); 29 Dec 2008 11:02:53 -0000 Delivered-To: apmail-commons-dev-archive@commons.apache.org Received: (qmail 5591 invoked by uid 500); 29 Dec 2008 11:02:52 -0000 Mailing-List: contact dev-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Commons Developers List" Delivered-To: mailing list dev@commons.apache.org Received: (qmail 5580 invoked by uid 99); 29 Dec 2008 11:02:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Dec 2008 03:02:52 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [134.96.191.147] (HELO smtp.dfki.de) (134.96.191.147) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Dec 2008 11:02:42 +0000 Received: from smtp.dfki.de (localhost [127.0.0.1]) by oelhorst (Postfix) with ESMTP id D0AFF3172B; Mon, 29 Dec 2008 11:03:54 +0100 (CET) Received: from mail.dfki.de (lnv-104.sb.dfki.de [134.96.191.146]) by smtp.dfki.de (Postfix) with ESMTP id C0CDC316B0; Mon, 29 Dec 2008 11:03:54 +0100 (CET) Received: from bledsoe.dfki.uni-sb.de (bledsoe.dfki.uni-sb.de [134.96.184.151]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mail.dfki.de (Postfix) with ESMTPSA id A90683115D; Mon, 29 Dec 2008 11:03:54 +0100 (CET) Cc: jukka.zitting@gmail.com Message-Id: <456BCC8B-24A6-4205-B0B0-BB1BFC803A5D@activemath.org> From: Paul Libbrecht To: "Commons Developers List" In-Reply-To: <31cc37360812281956v7c6124e9ycb131c45f768d4b5@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed; delsp=yes Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Apple Message framework v930.3) Subject: Re: Proposal: Commons SAX Date: Mon, 29 Dec 2008 12:02:19 +0100 References: <510143ac0812170509n1725d005x5bd77aeba872f222@mail.gmail.com> <31cc37360812281956v7c6124e9ycb131c45f768d4b5@mail.gmail.com> X-Mailer: Apple Mail (2.930.3) X-Virus-Checked: Checked by ClamAV on apache.org It may be worth quoting that: - it's livable to be registered to commons-dev without filter (you =20 need a swift delete key) - the filter practice is deep anchored until the netiquette for both =20 commons-dev and commons-user so that mails without the project prefix =20= are very quickly warned about and corrected paul Le 29-d=E9c.-08 =E0 04:56, Henri Yandell a =E9crit : > I don't see why it's not a good fit for Commons. We've a few xml > related components in the dormant section. I think a component with > things for both SAX and DOM would be fine, and letting it grow > organically would also work well. > > Hen > > On Wed, Dec 17, 2008 at 5:09 AM, Jukka Zitting = > wrote: >> Hi, >> >> In the Apache Tika project [1] we use SAX quite a lot, and have >> written a set of quite useful general utility classes for SAX >> handling. >> >> For example, in org.apache.tika.sax [2] we have the following: >> >> * ContentHandlerDecorator - Convenient base class for writing >> ContentHandler decorators >> * EmbeddedContentHandler - Decorator that blocks startDocument() and >> endDocument() calls >> * TeeContentHandler - Forwards SAX events to multiple handlers >> * TextContentHandler - Decorator that blocks everything but character >> events (and start/endDocument) >> * WriteOutContentHandler - Writes the contents of all character =20 >> events >> to a Writer >> >> In org.apache.tika.sax.xpath [3] we have a simple XPath subset >> implementation that supports streaming and filtering of SAX events. =20= >> In >> other words, the implementation doesn't need a DOM tree to evaluate >> XPath statements. >> >> I believe this code would be useful also outside Tika, and I was >> thinking that it might perhaps make sense to create a Commons project >> for this. I also know of some SAX processing classes in Cocoon and >> Jackrabbit that could well be of interest to a wider audience. >> >> Do you think something like this would be interesting as a Commons >> project? Are there other similar efforts that I should know of? I >> looked at XML Commons in xml.apache.org, but it seems pretty dormant. >> >> [1] http://lucene.apache.org/tika/ >> [2] = http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/package-summary.= html >> [3] = http://lucene.apache.org/tika/apidocs/org/apache/tika/sax/xpath/package-su= mmary.html >> >> BR, >> >> Jukka Zitting >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org >> For additional commands, e-mail: dev-help@commons.apache.org >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org > For additional commands, e-mail: dev-help@commons.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org For additional commands, e-mail: dev-help@commons.apache.org