cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Pogue <mpo...@apache.org>
Subject regexp in Java [was: Re: LGPL + APL = not good!]
Date Mon, 17 Apr 2000 19:54:45 GMT
FYI -- the regexp code should be in Xerces-J at this point.
It is Perl-regex-centric, as per the XML Schema draft.

I am talking to some people about taking it all the way
to the Unicode regexp (in Open Source, of course!).]

Mike

Michael French wrote:
> 
> > -----Original Message-----
> > From: Stefano Mazzocchi [mailto:stefano@apache.org]
> > Sent: Sunday, April 16, 2000 6:21 PM
> > To: Cocoon
> > Subject: LGPL + APL = not good!
> 
> > So, we should consider LGPL out of our radar.
> >
> > Specifially for our needs, this doesn't change anything:
> >
> >  1) complete regexp are _NOT_ going to be used for the sitemap.
> > What Pier is referring to are not "perl-like regular expressions"
> > but sort of "wildcards on steroyds" using * ? and **.
> > The reason is dead simple: if you need regexps to match your
> > URI space complexity, your URI partition
> > is stupid. Using regexp make the good old mod_rewrite hack
> > hunt us back.
> 
> I agree with the design point.
> 
> But if regexp is needed somewhere,
> then it should use the Unicode regexp:
> 
>         http://www.unicode.org/unicode/reports/tr18/
> 
> The sitemap is XML, and it would need a 'pattern' facet
> to have an XML Schema definition.
> 
>         http://www.w3.org/TR/xmlschema-0/#regexAppendix
>         http://www.w3.org/TR/xmlschema-2/#regexs
> 
> There will be open source code for these regexps, either in Xerces,
> or from a W3C validator, or from the ICU project
> (currently regexp is down as a 'future'):
> 
>         http://oss.software.ibm.com/icu/develop/tasks.html
> 
> Mike

Mime
View raw message