james-mime4j-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <mime4j-...@james.apache.org>
Subject [jira] [Commented] (MIME4J-218) Content-Type Fallback Character Set
Date Sat, 01 Nov 2014 10:49:33 GMT

    [ https://issues.apache.org/jira/browse/MIME4J-218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14193077#comment-14193077

ASF GitHub Bot commented on MIME4J-218:

GitHub user WolfgangFahl reopened a pull request:


    fix for MIME4J-218

    fixes https://issues.apache.org/jira/browse/MIME4J-218

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/WolfgangFahl/james-mime4j trunk

Alternatively you can review and apply these changes as the patch at:


To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1
commit a8cf35f49a06292f9ac60e29dfb8019bcddcbda8
Author: Wolfgang Fahl <wf@bitplan.com>
Date:   2014-10-31T20:32:45Z

    fixes license header

commit bbd9483e84421ff4744bde29b6196151696f68be
Author: Wolfgang Fahl <wf@bitplan.com>
Date:   2014-11-01T10:43:03Z

    improves defaultCharset handling and messages for limit exceptions with
    details about limit sizes


> Content-Type Fallback Character Set
> -----------------------------------
>                 Key: MIME4J-218
>                 URL: https://issues.apache.org/jira/browse/MIME4J-218
>             Project: James Mime4j
>          Issue Type: Bug
>    Affects Versions: 0.7.2
>            Reporter: Rickard Ekeroth
>             Fix For: 0.8.0
> Would it be possible to add a feature that would allow for specifying a fallback character
set to use when the character set in a 'Content-Type' header is not recognized by Java? In
the old 0.6.2 version, that we used before, the character set 'ISO-8859-1' was used as a fallback
but in the 0.7.2 version an UnsupportedEncodingException is thrown when the parser encounters
an unknown character set in a Content-Type header.
> Here is the relevant part of the exception stack trace:
> Caused by: java.io.UnsupportedEncodingException: x-user-defined
> at sun.nio.cs.StreamDecoder.forInputStreamReader(StreamDecoder.java:52)
> at java.io.InputStreamReader.<init>(InputStreamReader.java:83)
> at org.apache.james.mime4j.message.BasicTextBody.getReader(BasicTextBody.java:49)
> We receive, parse and archive a vast number of confidential e-mail messages (for which
we use Mime4J) and every now and then we get an e-mail message that contains a non-standard
character encoding name (in this case 'x-user-defined'). With the old (0.6) Mime4J version
we were still able to parse and read most of those e-mail messages because of the fallback
character set in the parser.
> I can unfortunately not post the entire message here but the content-type header that
caused the above exception looks like this:
> Content-Type: text/plain; charset="x-user-defined" 

This message was sent by Atlassian JIRA

View raw message