spamassassin-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From RW <rwmailli...@googlemail.com>
Subject Re: Bayes underperforming, HTML entities?
Date Thu, 08 Nov 2018 15:06:29 GMT
On Wed, 7 Nov 2018 12:33:35 -0700
Amir Caspi wrote:

> In many cases, it would appear that these spams have either very
> little (real) text (besides the usual attempt at Bayes poisoning)
> and/or are using HTML-entity encoding to try to bypass Bayes.  Here
> are a couple of spamples:
> 
> https://pastebin.com/peiXZivJ
> https://pastebin.com/3h3r7r7j
> 
> Does SA decode HTML entities as part of normalize_charset?  If
> not ... can this be added?

Ordinarily yes, but these don't actually have a separate html part
because of the broken mime - the separators don't match. Presumably the
raw HTML is being treated as plain text.


Mime
View raw message