incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eike Rathke <...@erack.de>
Subject Re: RegExp replacement (was Re: Some more strange files in the OOo code)
Date Thu, 23 Jun 2011 22:35:45 GMT
Hi Pedro,

On Thursday, 2011-06-23 12:39:49 -0700, Pedro F. Giffuni wrote:

> OpenOffice has a C++ interface to GNU regex so Google's
> RE2 seemed like a natural fit there. Unfortunately I see
> TextSearch::RESrchBkwrd in textsearch.cxx so I assume we
> need backreferences. The Re2 website says:
> 
> "If you absolutely need backreferences and generalized
> assertions, then RE2 is not for you, but you might be
> interested in irregexp, Google Chrome's regular expression
> engine."

I strongly propose to go for ICU's RE instead. OOo already makes heavy
use of ICU, the ICU REs support Unicode conforming to TR18
http://www.unicode.org/reports/tr18/ and seem to have all we need. See
http://userguide.icu-project.org/strings/regexp

  Eike

-- 
 PGP/OpenPGP/GnuPG encrypted mail preferred in all private communication.
 Key ID: 0x293C05FD - 997A 4C60 CE41 0149 0DB3  9E96 2F1A D073 293C 05FD

Mime
View raw message