incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dennis E. Hamilton" <dennis.hamil...@acm.org>
Subject RE: RegExp replacement (was Re: Some more strange files in the OOo code)
Date Thu, 23 Jun 2011 23:15:39 GMT
+1

That would certainly be a clean choice for those regular expressions that show up in the OpenDocument
format of ODF 1.2 documents.  It is an easy way to document the implementation-dependent choice.

The ICU License is on this page <http://userguide.icu-project.org/intro>.  It is on
the BSD model.

 - Dennis

PS: And welcome, Eike, it is great to see you here. 



-----Original Message-----
From: Eike Rathke [mailto:ooo@erack.de] 
Sent: Thursday, June 23, 2011 15:36
To: ooo-dev@incubator.apache.org
Subject: Re: RegExp replacement (was Re: Some more strange files in the OOo code)

Hi Pedro,

On Thursday, 2011-06-23 12:39:49 -0700, Pedro F. Giffuni wrote:

> OpenOffice has a C++ interface to GNU regex so Google's
> RE2 seemed like a natural fit there. Unfortunately I see
> TextSearch::RESrchBkwrd in textsearch.cxx so I assume we
> need backreferences. The Re2 website says:
> 
> "If you absolutely need backreferences and generalized
> assertions, then RE2 is not for you, but you might be
> interested in irregexp, Google Chrome's regular expression
> engine."

I strongly propose to go for ICU's RE instead. OOo already makes heavy
use of ICU, the ICU REs support Unicode conforming to TR18
http://www.unicode.org/reports/tr18/ and seem to have all we need. See
http://userguide.icu-project.org/strings/regexp

  Eike

-- 
 PGP/OpenPGP/GnuPG encrypted mail preferred in all private communication.
 Key ID: 0x293C05FD - 997A 4C60 CE41 0149 0DB3  9E96 2F1A D073 293C 05FD


Mime
View raw message