forrest-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave Brondsema <d...@brondsema.net>
Subject Re: BROKEN: UTFDataFormatException: String cannot be longer than 32k.
Date Wed, 01 Oct 2003 14:10:20 GMT
Quoting "Adam R. B. Jack" <ajack@trysybase.com>: 
 
> I am making some fun progress with Gump & Forrest: 
>  
> http://lsd.student.utwente.nl/gump/ 
> http://lsd.student.utwente.nl/gump/todos.html 
>  
> and will continue to work on configuration stuff. 
>  
> I am, however, getting this error: 
>  
> BROKEN: UTFDataFormatException: String cannot be longer than 32k. 
>  
> .. and Forrest is (eventually) exiting with a '1'. 
>  
> I get lines like: 
>  
> X [0] avalon/update/update_avalon.html	BROKEN: 
UTFDataFormatException: 
> String cannot be longer than 32k. 
>  
> but I still see: 
>  
> http://lsd.student.utwente.nl/gump/avalon/update/update_avalon.html 
>  
> For the full output, see: 
>  
> http://lsd.student.utwente.nl/gump/forrest.txt 
> http://lsd.student.utwente.nl/gump/gumpy.html 
>  
> Is this as simple as a source or a table or something can't be that long. Is 
> it failing, or working? Ought I be changing what I do (please say no :-) or 
> filling a bug report or ???? 
>  
 
I have had this problem with large .txt files being transformed.  IIRC, the 
error actually comes from the XML parser, not forrest or even cocoon.  I did 
some googling a while back and found a patch (I didn't test it) but it looked 
like an ugly hack that just broke the string up into parts and put it back 
together again. 
 
It looks like Jeff has filed this already as a cocoon bug. See 
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23299 
 
--  
Dave Brondsema 
dave@brondsema.net 
http://www.brondsema.net - personal 
http://www.splike.com - programming 

Mime
View raw message