Return-Path: Delivered-To: apmail-spamassassin-users-archive@www.apache.org Received: (qmail 9567 invoked from network); 4 Aug 2009 01:18:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Aug 2009 01:18:08 -0000 Received: (qmail 613 invoked by uid 500); 4 Aug 2009 01:18:10 -0000 Delivered-To: apmail-spamassassin-users-archive@spamassassin.apache.org Received: (qmail 555 invoked by uid 500); 4 Aug 2009 01:18:09 -0000 Mailing-List: contact users-help@spamassassin.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: List-Id: Delivered-To: mailing list users@spamassassin.apache.org Received: (qmail 547 invoked by uid 99); 4 Aug 2009 01:18:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Aug 2009 01:18:09 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [218.101.54.17] (HELO mailsrv2.trimble.co.nz) (218.101.54.17) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Aug 2009 01:17:59 +0000 Received: (qmail 22464 invoked by uid 500); 4 Aug 2009 13:17:31 +1200 Received: from crom.trimble.co.nz by mailsrv2.trimble.co.nz (envelope-from , uid 108) with qmail-scanner-2.07 (clamdscan: 0.95.2/9648. trophie: 8.700-1004/337/468118. sophie: 3.06/2.85.0/4.40. spamassassin: 3.2.5. Clear:RC:1(10.3.0.198):. Processed in 0.053697 secs); 04 Aug 2009 01:17:31 -0000 Received: from crom.trimble.co.nz (10.3.0.198) by mailsrv2.trimble.co.nz with (DHE-RSA-AES256-SHA encrypted) SMTP; 4 Aug 2009 13:17:31 +1200 Received: (qmail 18052 invoked by uid 501); 4 Aug 2009 13:17:31 +1200 Received: from 10.3.0.198 by crom.trimble.co.nz (envelope-from , uid 485) with qmail-scanner-2.07 (clamdscan: 0.95.2/9648. trophie: 8.700-1004/337/468118. sophie: 3.06/2.87.1/4.42. spamassassin: 3.2.5. Clear:RC:1(10.3.0.198):. Processed in 0.121134 secs); 04 Aug 2009 01:17:31 -0000 Received: from unknown (HELO crom.trimble.co.nz) (10.3.0.198) by crom.trimble.co.nz with SMTP; 4 Aug 2009 13:17:30 +1200 Message-ID: <4A778C2A.7010901@trimble.co.nz> Date: Tue, 04 Aug 2009 13:17:30 +1200 From: Jason Haar Organization: Trimble Navigation Ltd. User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1b3pre) Gecko/20090513 Fedora/3.0-2.3.beta2.fc11 Thunderbird/3.0b2 MIME-Version: 1.0 To: users@spamassassin.apache.org Subject: large unicode email nails CPU Content-Type: multipart/alternative; boundary="------------000905090608000307070707" X-Virus-Checked: Checked by ClamAV on apache.org This is a multi-part message in MIME format. --------------000905090608000307070707 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Hi there We're got a few people subscribed to Serbian mailing-lists, and one in particular is having difficulty getting email to us - spamc/spamd times out and is never able to process the message. While it is running, spamd takes 100% of the CPU for 1.5+minutes. Here's an example: http://pastebin.com/m75f39d72 strace shows spamd running around looking for unicore/lib/gc_sc files - which is related to unicode "stuff". I don't know if that's the problem - but that's all I could find. "spamassassin -D " doesn't show anything strange other than massively long times to process DNSBLs. They are not believable: the "slow" DNSBLs change from invocation to invocation (of the same message), and "dig" shows no such issues - the DNSBLs SA says are taking 99sec to complete return instantly via dig (and yes, local caching DNS). Also, it is specifically a problem with these emails - in general we are not seeing any problems with any other email. We've also got SA in several countries, all on CentOS5 servers (perl-5.8.8,spamassassin-3.2.5-1) and they all show the same symptoms - so I don't think it's network related but rather CPU: basically these emails nail SA and it's slow to finish for them? Any ideas? Thanks! -- Cheers Jason Haar Information Security Manager, Trimble Navigation Ltd. Phone: +64 3 9635 377 Fax: +64 3 9635 417 PGP Fingerprint: 7A2E 0407 C9A6 CAF6 2B9F 8422 C063 5EBB FE1D 66D1 --------------000905090608000307070707 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 7bit Hi there

We're got a few people subscribed to Serbian mailing-lists, and one in particular is having difficulty getting email to us - spamc/spamd times out and is never able to process the message. While it is running, spamd takes 100% of the CPU for 1.5+minutes.

Here's an example: http://pastebin.com/m75f39d72

strace shows spamd running around looking for unicore/lib/gc_sc files - which is related to unicode "stuff". I don't know if that's the problem - but that's all I could find. "spamassassin -D " doesn't show anything strange other than massively long times to process DNSBLs. They are not believable: the "slow" DNSBLs change from invocation to invocation (of the same message), and "dig" shows no such issues - the DNSBLs SA says are taking 99sec to complete return instantly via dig (and yes, local caching DNS). Also, it is specifically a problem with these emails - in general we are not seeing any problems with any other email.

We've also got SA in several countries, all on CentOS5 servers (perl-5.8.8,spamassassin-3.2.5-1) and they all show the same symptoms - so I don't think it's network related but rather CPU: basically these emails nail SA and it's slow to finish for them?

Any ideas? Thanks!

-- 
Cheers

Jason Haar
Information Security Manager, Trimble Navigation Ltd.
Phone: +64 3 9635 377 Fax: +64 3 9635 417
PGP Fingerprint: 7A2E 0407 C9A6 CAF6 2B9F 8422 C063 5EBB FE1D 66D1
--------------000905090608000307070707--