james-server-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefano Bagnara (JIRA)" <server-...@james.apache.org>
Subject [jira] Commented: (JAMES-387) Exception in BayesianAnalysis
Date Sun, 20 Nov 2005 22:20:29 GMT
    [ http://issues.apache.org/jira/browse/JAMES-387?page=comments#action_12358105 ] 

Stefano Bagnara commented on JAMES-387:
---------------------------------------

I increased the total memory for my "personal" james to 800MB and it only handle my own mail
(around 1000 messages per day) but it still stop checking my messages.

Here is the exception

20/11/05 00:14:57 INFO  James.Mailet: BayesianAnalysis: Exception: java.lang.Integer
java.lang.ClassCastException: java.lang.Integer
        at org.apache.james.util.BayesianAnalyzer.getTokenProbabilityStrengths(BayesianAnalyzer.java:591)
        at org.apache.james.util.BayesianAnalyzer.computeSpamProbability(BayesianAnalyzer.java:340)
        at org.apache.james.transport.mailets.BayesianAnalysis.service(BayesianAnalysis.java:289)
        at org.apache.james.transport.LinearProcessor.service(LinearProcessor.java:407)
        at org.apache.james.transport.JamesSpoolManager.process(JamesSpoolManager.java:460)
        at org.apache.james.transport.JamesSpoolManager.run(JamesSpoolManager.java:369)
        at java.lang.Thread.run(Unknown Source)

If I restart James it works for almost a day and then it break again.

My bayesiananalysis_spam count 853685 rows, while the ham counts 21253.

I configured james to automatically feed spam and ham for messages I recognize so the bayesian
can be improved. Maybe my continuous feeding is not good for the bayesian mailet.

Any Idea? 
Who is using this Matchers/Mailets? What are your spam/ham sizes? How often do you feed ham/spam?
how much memory you reserved to James? how much messages through the bayesian mailets?


> Exception in BayesianAnalysis
> -----------------------------
>
>          Key: JAMES-387
>          URL: http://issues.apache.org/jira/browse/JAMES-387
>      Project: James
>         Type: Bug
>   Components: Matchers/Mailets (bundled)
>     Versions: 3.0
>  Environment: James from svn-trunk 2005-08-01.
> MySQL 4.0
>     Reporter: Stefano Bagnara
>     Assignee: Vincenzo Gianferrari Pini
>     Priority: Minor

>
> Got this exception for every incoming mail:
> 02/08/05 00:39:25 INFO  James.Mailet: BayesianAnalysis: Exception: java.lang.Integer
> java.lang.ClassCastException: java.lang.Integer
>         at org.apache.james.util.BayesianAnalyzer.getTokenProbabilityStrengths(BayesianAnalyzer.java:591)
>         at org.apache.james.util.BayesianAnalyzer.computeSpamProbability(BayesianAnalyzer.java:340)
>         at org.apache.james.transport.mailets.BayesianAnalysis.service(BayesianAnalysis.java:289)
>         at org.apache.james.transport.LinearProcessor.service(LinearProcessor.java:407)
>         at org.apache.james.transport.JamesSpoolManager.process(JamesSpoolManager.java:460)
>         at org.apache.james.transport.JamesSpoolManager.run(JamesSpoolManager.java:369)
>         at java.lang.Thread.run(Unknown Source)
> If I clean my spam/ham db the exceptions disappears but they start again when the spam/ham
db become large.
> My bayesiananalysis_spam contains 200000 rows.
> The following are the spam tokens with higher "occurrences".
> +---------------------------+-------------+
> | token                     | occurrences |
> +---------------------------+-------------+
> | 3D                        |       82151 |
> | a                         |       59953 |
> | the                       |       45295 |
> | FONT                      |       42771 |
> | Content-Type              |       39058 |
> | to                        |       36626 |
> | com                       |       32902 |
> | http                      |       32886 |
> | of                        |       32504 |
> | font                      |       31803 |
> | and                       |       31577 |
> | Content-Transfer-Encoding |       31576 |
> | p                         |       29746 |
> | text                      |       29482 |
> | in                        |       29418 |
> | it                        |       28498 |
> | br                        |       28037 |
> | DIV                       |       27431 |

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: server-dev-unsubscribe@james.apache.org
For additional commands, e-mail: server-dev-help@james.apache.org


Mime
View raw message