Return-Path: X-Original-To: apmail-lucy-user-archive@www.apache.org Delivered-To: apmail-lucy-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 29F7FDE89 for ; Thu, 16 Aug 2012 15:26:31 +0000 (UTC) Received: (qmail 87329 invoked by uid 500); 16 Aug 2012 15:26:31 -0000 Delivered-To: apmail-lucy-user-archive@lucy.apache.org Received: (qmail 87241 invoked by uid 500); 16 Aug 2012 15:26:31 -0000 Mailing-List: contact user-help@lucy.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucy.apache.org Delivered-To: mailing list user@lucy.apache.org Received: (qmail 87233 invoked by uid 99); 16 Aug 2012 15:26:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 15:26:31 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of leegee@gmail.com designates 74.125.82.179 as permitted sender) Received: from [74.125.82.179] (HELO mail-we0-f179.google.com) (74.125.82.179) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 15:26:23 +0000 Received: by weyx10 with SMTP id x10so1636384wey.10 for ; Thu, 16 Aug 2012 08:26:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:reply-to:user-agent:mime-version:to:subject :content-type:content-transfer-encoding; bh=2hMKgv2cphGI2PBLstajJhvg3vzEbvuh8VQOU+zojLI=; b=eVNI6hg+cay5FlXpmGMMBXsuLtBYF4YGUT6wMaaYbekx4z0tFN//nR2px6pyB67F75 36mwDdjGX1jwgkKPpneWNY9TiW+6UGpQ87TNpgHI4XH1TDHERZ58wacjBNJUeVoEdnMa ecAJ/Mv1D2XO0mIV69qIbFsbhsM4hZz6lQ4A0gtISBOiF5eIKO0cHtHV4qzJBOxcVKsn AdDHl0u5/TGYIA/W2CFTrzmXofcSu1+yyEZZaAd+FOWa3o4TYDaRNiwGYUPzswwTq+gQ 9+eKzEvfH5vfRV/jw+iigu0sGUFy9IVlw0UaMWSo+Ma0nDa4wumoavKc0iHKSy35QCyB 6MTg== Received: by 10.180.74.33 with SMTP id q1mr6964305wiv.4.1345130761810; Thu, 16 Aug 2012 08:26:01 -0700 (PDT) Received: from Lee.local (catv-178-48-88-150.catv.broadband.hu. [178.48.88.150]) by mx.google.com with ESMTPS id cu1sm4333303wib.6.2012.08.16.08.25.59 (version=TLSv1/SSLv3 cipher=OTHER); Thu, 16 Aug 2012 08:26:01 -0700 (PDT) Message-ID: <502D1106.90004@leegoddard.net> Date: Thu, 16 Aug 2012 17:25:58 +0200 From: Lee Goddard Reply-To: lee@leegoddard.net User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:14.0) Gecko/20120713 Thunderbird/14.0 MIME-Version: 1.0 To: user@lucy.apache.org Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Subject: [lucy-user] UTF-8 Error Just a quick question. If I do this $highlighter->create_excerpt($hit); I get this: Invalid UTF-8 header byte: 00000095 lucy_StrHelp_decode_utf8_char at .../Lucy-0.3.2/core/Lucy/Util/StringHelper.c line 216 I'm not doing anything to make sure my search subject is or isn't utf8, I've only been playing with Lucy for a few hours. I just thought this looked like the sort of error a Perl user isn't intended to see. Lee