Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5313B200CCA for ; Wed, 5 Jul 2017 05:53:58 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 51A001624A0; Wed, 5 Jul 2017 03:53:58 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9846616249D for ; Wed, 5 Jul 2017 05:53:57 +0200 (CEST) Received: (qmail 95154 invoked by uid 500); 5 Jul 2017 03:53:56 -0000 Mailing-List: contact sysadmins-help@spamassassin.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: sysadmins@spamassassin.apache.org Delivered-To: mailing list sysadmins@spamassassin.apache.org Received: (qmail 95143 invoked by uid 99); 5 Jul 2017 03:53:56 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Jul 2017 03:53:56 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 9B5251A958A for ; Wed, 5 Jul 2017 03:53:55 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -5 X-Spam-Level: X-Spam-Status: No, score=-5 tagged_above=-999 required=6.31 tests=[LOTS_OF_MONEY=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 5Y88isK_9dtJ for ; Wed, 5 Jul 2017 03:53:54 +0000 (UTC) Received: from intel1.peregrinehw.com (intel1.peregrinehw.com [38.124.232.10]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 3A00E5F6C4 for ; Wed, 5 Jul 2017 03:53:54 +0000 (UTC) Received: from [10.10.11.169] (localhost.localdomain [127.0.0.1]) (authenticated bits=0) by intel1.peregrinehw.com (8.14.9/8.14.9) with ESMTP id v653rrOU018569 for ; Tue, 4 Jul 2017 23:53:53 -0400 Subject: Fwd: Re: Problem with 72_scores.cf generation References: <52cd20a1-c03c-8607-6766-d5b2fdb8b018@mcgrail.com> To: sysadmins@spamassassin.apache.org From: "Kevin A. McGrail" X-Forwarded-Message-Id: <52cd20a1-c03c-8607-6766-d5b2fdb8b018@mcgrail.com> Message-ID: <0c36cf34-a139-bd39-284d-2e73e09c3c17@mcgrail.com> Date: Tue, 4 Jul 2017 23:54:11 -0400 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0 MIME-Version: 1.0 In-Reply-To: <52cd20a1-c03c-8607-6766-d5b2fdb8b018@mcgrail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-PCCC-Virus-Scan: No X-KAM-Reverse-AUTH: Exempt - 127.0.0.1 is an Authorized Sender X-PCCC-Authorized-User-Relay: 127.0.0.1 X-PCCC-SA-Scanned: No: Auth User X-Scanned-By: MIMEDefang 2.79 on 38.124.232.10 archived-at: Wed, 05 Jul 2017 03:53:58 -0000 Resending without the screen shot... Hi John, Thanks. We spent Friday as well trying to identify what's going on and I worked with Crashplan to restore the data I had for the old solaris box. It was in an old format because the solaris client was deprecated so it was a lot of juggling. In any case, I will A) look at this information again. and B) I ran crashplan on the ASF servers because ASF Infra doesn't do backups. I configure it to retain a pretty insane (unlimited) number of revisions as an anti-malware protection. This means you can go in and restore a specific version of 72_scores.cf (removed screenshot, too large and mailing list rejected) I put the credentials in SVN for the ASF crashplan and happy to help walk you through things to see if I have a backup of the file you want to compare. Regards, KAM On 7/4/2017 12:59 PM, Dave Jones wrote: > Kevin, > I have spent about 5 hours this morning trying to track down the > 72_scores.cf generation problem. I haven't pinpointed the problem yet > but here's what I have found so far: > > NOTE: su - automc for proper paths below. > > 1. ~/svn/masses/rule-update-score-gen/generate-new-scores.sh is the > script in question > > https://svn.apache.org/viewvc/spamassassin/trunk/masses/rule-update-score-gen/generate-new-scores.sh?revision=1798589&view=markup > > > Line 271 runs "runGA" > > 2. runGA creates > ~/tmp/generate-new-scores/trunk-new-rules-set0/masses/gen-set0-5-5.0-6000-ga/scores > > This 'scores' file has 345 scores in it. I wish we had a copy of this > file from mid March to see if it also had around the same number of > scores to confirm the runGA/garescorer is not the problem. However, > this file is a temp file that used to be in /tmp so it's probably not > backed up anywhere and definitely not in SVN. > > 3. Back in the generate-new-scores.sh at line 289, the > "extract-new-scores" script creates scores-new from the scores file > but excludes/culls out anything manually scored in 50_scores.cf. > > The culled scores-new file has the same 42 lines and ends at > MILLION_USD just like our 72_scores.cf so this is the smoking gun but > I haven't found the what pulled the trigger yet. There is something > different about this step than back on March 15th when we had our last > good 72_scores.cf. > > > THINGS I HAVE CHECKED: > > At first I thought that 50_scores.cf changed a lot which caused more > exclusion/culling in the 72_scores.cf but that's not it. The revision > only shows a few minor changes in 50_scores.cf: > > https://svn.apache.org/viewvc/spamassassin/trunk/rules/50_scores.cf?view=log > > > Next I looked a the garescorer since it get's compiled from > garescorer.c every run. The garescorer.c is identical in the backups > and in SVN so that's not it. Again, I wish I had a 'scores' file to > compare to from mid March. >