Return-Path: Delivered-To: apmail-xml-cocoon-dev-archive@xml.apache.org Received: (qmail 4930 invoked by uid 500); 31 Mar 2003 22:52:56 -0000 Mailing-List: contact cocoon-dev-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: cocoon-dev@xml.apache.org Delivered-To: mailing list cocoon-dev@xml.apache.org Received: (qmail 4917 invoked from network); 31 Mar 2003 22:52:56 -0000 Received: from natint.juniper.net (HELO merlot.juniper.net) (207.17.136.129) by daedalus.apache.org with SMTP; 31 Mar 2003 22:52:56 -0000 Received: from maroon.jnpr.net (maroon.jnpr.net [172.24.244.36]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id h2VMr2S05131; Mon, 31 Mar 2003 14:53:02 -0800 (PST) (envelope-from rjohnst@juniper.net) Received: (from rjohnst@localhost) by maroon.jnpr.net (8.11.6/8.11.3) id h2VMr2u34170; Mon, 31 Mar 2003 14:53:02 -0800 (PST) (envelope-from rjohnst) Date: Mon, 31 Mar 2003 14:53:02 -0800 From: Rob Johnston To: cocoon-dev@xml.apache.org Subject: Re: Question about TextRecorder in the SAXTransformer Message-ID: <20030331225302.GJ89266@juniper.net> References: <20030330171843.GQ26892@juniper.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4i X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Is the bug the fact that the data is being passed in two [characters] events or that the .trim() is in TextRecorder.java? I can nuke the .trim() locally but don't want to hose anything down the line. I just want to 'fix' it here locally since I'm on a time constraint, but I will file a PR for you all also. thanks, rob On Mon, 31 Mar 2003, Carsten Ziegeler wrote: > Hi Rob, > > this is a bug! > > Carsten > > > -----Original Message----- > > From: Rob Johnston [mailto:rjohnst@juniper.net] > > Sent: Sunday, March 30, 2003 7:19 PM > > To: cocoon-dev@xml.apache.org > > Subject: Question about TextRecorder in the SAXTransformer > > > > > > In the TextRecorder.java, I see the following: > > > > public void characters(char ary[], int start, int length) > > throws SAXException { > > String value = new String(ary, start, length).trim(); > > if (value.length() > 0) { > > buffer.append(value); > > } else { > > buffer.append(' '); > > } > > } > > > > I am wondering what the logic behind having the trim() in there is. > > Several times I have noticed Cocoon "eating" spaces out of the stream as > > evidenced in this log: > > > > INFO 2003-03-28 16:05:14.728 [sitemap.transformer.log ] (): > > [startElement] > > uri=http://xml.juniper.net/cocoon/work-schedule/1.0,local=start,ra > > w=sched:start > > DEBUG 2003-03-28 16:05:14.729 [sitemap.transformer.sche] (): > > BEGIN startTextRecording > > DEBUG 2003-03-28 16:05:14.730 [sitemap.transformer.sche] (): > > END startTextRecording > > INFO 2003-03-28 16:05:14.731 [sitemap.transformer.log ] (): > > [characters] 03/28/2003 > > INFO 2003-03-28 16:05:14.733 [sitemap.transformer.log ] (): > > [characters] 04:05 PM > > INFO 2003-03-28 16:05:14.734 [sitemap.transformer.log ] (): > > [endElement] > > uri=http://xml.juniper.net/cocoon/work-schedule/1.0,local=start,qn > > ame=sched:start > > DEBUG 2003-03-28 16:05:14.735 [sitemap.transformer.sche] (): > > BEGIN endTextRecording > > DEBUG 2003-03-28 16:05:14.736 [sitemap.transformer.sche] (): > > END endTextRecording text=03/28/200304:05 PM > > INFO 2003-03-28 16:05:14.737 [sitemap.transformer.log ] (): > > [endElement] > > uri=http://xml.juniper.net/cocoon/work-schedule/1.0,local=every,qn > > ame=sched:every > > ERROR 2003-03-28 16:05:14.739 [sitemap.transformer.sche] (): > > URLWorkScheduleTransformer: invalid time: 03/28/200304:05 PM > > > > 03/28/2003 04:05 PM > > > > is what is supposed to be being passed to the next transformer, but > > instead, it is showing up in two different [characters] events, and > > because of the trim(), it ends up being put back together as > > "03/28/200304:05 PM" which is invalid. > > > > I was just wondering why the trim() is in there, or if what I am seeing > > is expected behavior. > > > > thanks, > > rob