spamassassin-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kevin A. McGrail" <kmcgr...@apache.org>
Subject use bytes was Re: Non-ascii subjects with images
Date Tue, 04 Sep 2018 15:04:16 GMT
On 9/4/2018 10:57 AM, RW wrote:
> My understanding is that for historic reasons there is heavy use of
> 'use byte', so SpamAssassin sees text as a series of bytes in whatever
> character set it's written in. normalize_charset allows text to be
> converted to UTF-8, which makes it easier to match byte sequences, but a
> byte can't be an emoticon etc.  

3.4 has use bytes removed.  See
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7232

Mark had concerns about backporting it though so I was just considering
this issue for reopening.

Any comment on that?

-- 
Kevin A. McGrail
VP Fundraising, Apache Software Foundation
Chair Emeritus Apache SpamAssassin Project
https://www.linkedin.com/in/kmcgrail - 703.798.0171


Mime
View raw message