spamassassin-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzilla-dae...@bugzilla.spamassassin.org
Subject [Bug 6149] New: False positives for ISO-2022-JP (Japanese)
Date Tue, 07 Jul 2009 15:38:13 GMT
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6149

           Summary: False positives for ISO-2022-JP (Japanese)
           Product: Spamassassin
           Version: 3.2.5
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Score Generation
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: thomas@e-interwave.com


High rate of false positives occur for ISO-2022-JP encoded e-mails.  This is a
very critical problem for all Japanese users.

As examples, merely having any one of the following 3 legitimate lines of text
in the Subject will trigger PLING_QUERY (2.2) and TVD_SPACE_RATIO (2.9),
flagging the e-mail as SPAM:

【麻生内閣メールマガジン 第36号】安心・活力・責任(2009/06/25)

コンピュータサービスの件

[物件内見]メールフォームからメールが送信されました

In the e-mail's Body, certain legitimate text may also trigger WEIRD_QUOTING
(2.8), OBSCURED_EMAIL (1.9), or other conditions.  Examples will be supplied
upon request.

Such false positives DO NOT OCCUR for utf-8 encoding using the exact same text.

(This problem may be due to double-byte-character processing.  If so, other
double-byte-character sets should be checked for the same problem, such as Big5
in Chinese.)

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
Mime
View raw message