Return-Path: Delivered-To: apmail-jackrabbit-dev-archive@www.apache.org Received: (qmail 5627 invoked from network); 10 Mar 2009 14:51:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 10 Mar 2009 14:51:29 -0000 Received: (qmail 90359 invoked by uid 500); 10 Mar 2009 14:51:13 -0000 Delivered-To: apmail-jackrabbit-dev-archive@jackrabbit.apache.org Received: (qmail 90295 invoked by uid 500); 10 Mar 2009 14:51:13 -0000 Mailing-List: contact dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@jackrabbit.apache.org Delivered-To: mailing list dev@jackrabbit.apache.org Received: (qmail 90198 invoked by uid 99); 10 Mar 2009 14:51:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Mar 2009 07:51:12 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Mar 2009 14:51:12 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D7F61234C04C for ; Tue, 10 Mar 2009 07:50:51 -0700 (PDT) Message-ID: <90037569.1236696651883.JavaMail.jira@brutus> Date: Tue, 10 Mar 2009 07:50:51 -0700 (PDT) From: "Thomas Mueller (JIRA)" To: dev@jackrabbit.apache.org Subject: [jira] Commented: (JCR-2012) BufferedStringValue corrupts non ISO-8859-1 characters on large Strings In-Reply-To: <856460891.1236693050942.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/JCR-2012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12680490#action_12680490 ] Thomas Mueller commented on JCR-2012: ------------------------------------- I'm working on this problem as well. There is another bug: the reported length is wrong. My patch is larger. It also improves performance and simplified the code. I will post a patch later today. > BufferedStringValue corrupts non ISO-8859-1 characters on large Strings > ----------------------------------------------------------------------- > > Key: JCR-2012 > URL: https://issues.apache.org/jira/browse/JCR-2012 > Project: Jackrabbit Content Repository > Issue Type: Bug > Components: jackrabbit-core > Affects Versions: core 1.4.5 > Environment: Sun JDK 1.6, Win 2000 > Reporter: Henryk Paluch > Priority: Critical > Attachments: BufferedStringValue.java.encFix.diff, BufferedStringValueTest.java > > > When storing,retrieving large String values (for example large "sv:property" named "content" - which contains text of paragraphs) then non-ISO-8859-1 characters are lost. > This is caused becaus of improper handling of Temporary files in BufferedStringValue - they use Readers/Writers without specifying encoding - so national characters could be lost if system wide encoding does not support them. > Pending attachments: > - JUnit test Case > - Proposed fix -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.