Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 97929 invoked from network); 5 Sep 2005 03:27:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 5 Sep 2005 03:27:20 -0000 Received: (qmail 40904 invoked by uid 500); 5 Sep 2005 03:27:19 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 40015 invoked by uid 500); 5 Sep 2005 03:27:16 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 40002 invoked by uid 99); 5 Sep 2005 03:27:16 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Sep 2005 20:27:16 -0700 X-ASF-Spam-Status: No, hits=0.5 required=10.0 tests=HTML_20_30,HTML_MESSAGE,RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of hackingbear@gmail.com designates 66.249.82.203 as permitted sender) Received: from [66.249.82.203] (HELO xproxy.gmail.com) (66.249.82.203) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Sep 2005 20:27:29 -0700 Received: by xproxy.gmail.com with SMTP id h28so1170653wxd for ; Sun, 04 Sep 2005 20:27:13 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:mime-version:content-type; b=ZpgPg43YL0A+VGEvkI+sVNJZ629in+YC9n9zeNrnKZXOqxDrS2jiQATiUJ8NJyFZeuOogsAgb5wKpAGhayTdii3siXc3PjD75iKqNex4vacCYfJAWw6qTOZQAeYob2Ft05DbIb5VZ3ZbSXvI1fwhS3PKzRWIgtxjxqg9NQMViGM= Received: by 10.70.50.16 with SMTP id x16mr68281wxx; Sun, 04 Sep 2005 20:27:13 -0700 (PDT) Received: by 10.70.12.11 with HTTP; Sun, 4 Sep 2005 20:27:13 -0700 (PDT) Message-ID: <3feb1c060509042027163fb4bd@mail.gmail.com> Date: Mon, 5 Sep 2005 11:27:13 +0800 From: Hacking Bear To: java-user@lucene.apache.org Subject: Handling of unknown/multiple lanuage documents Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_177_29505178.1125890833858" X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_177_29505178.1125890833858 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hello, I'm new to Lucene. After some readings, I'm still not quite sure which=20 Analyzer I should be using for handling documents in unknown or multiple=20 lanugages. The documents I want to index may be written in lanuages other= =20 than the user/system's default language and one document may contain text i= n=20 different lanuages. By the way, I hope there is a search capablity in website of Lucene search= =20 engine. :-) - HB ------=_Part_177_29505178.1125890833858--