Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 48100 invoked from network); 23 Aug 2005 18:11:50 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 23 Aug 2005 18:11:50 -0000 Received: (qmail 58368 invoked by uid 500); 23 Aug 2005 18:11:49 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 58346 invoked by uid 500); 23 Aug 2005 18:11:48 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 58333 invoked by uid 99); 23 Aug 2005 18:11:48 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Aug 2005 11:11:48 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [69.55.225.129] (HELO ehatchersolutions.com) (69.55.225.129) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Aug 2005 11:12:04 -0700 Received: by ehatchersolutions.com (Postfix, from userid 504) id C0DFC13E2034; Tue, 23 Aug 2005 14:11:44 -0400 (EDT) Received: from [128.143.167.108] (d-128-167-108.bootp.Virginia.EDU [128.143.167.108]) by ehatchersolutions.com (Postfix) with ESMTP id CC24B13E2033 for ; Tue, 23 Aug 2005 14:11:33 -0400 (EDT) Mime-Version: 1.0 (Apple Message framework v734) In-Reply-To: <430B3DDD.9000604@gmx.de> References: <430B3DDD.9000604@gmx.de> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: <535F3DBA-9457-444C-850E-806F03C8A752@ehatchersolutions.com> Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: Little improvement for SimpleHTMLEncoder Date: Tue, 23 Aug 2005 14:11:32 -0400 To: general@lucene.apache.org X-Mailer: Apple Mail (2.734) X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on javelina X-Spam-Level: X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-2.9 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.0.1 X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Could you add this to a Bugzilla issue so it doesn't get lost in the never ending pile of e-mail that we all have? Thanks, Erik On Aug 23, 2005, at 11:16 AM, Stefan Wachter wrote: > Hi all! > > The SimpleHTMLEncoder could be improved slightly: all characters > with code >= 128 should be encoded as character entities. The > reason is, that the encoder does not know the encoding that is used > for the response. Therefore it is safer to encode all characters > beyond ASCII as character entities. Can someone judge this proposal > and commit it eventually? > > Greetings! > --Stefan > > > default: > if (c < 128) { > result.append(c); > } else { > result.append("&#").append((int)c).append(";"); > } > >