Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 16562 invoked from network); 4 Jun 2009 12:45:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Jun 2009 12:45:08 -0000 Received: (qmail 26387 invoked by uid 500); 4 Jun 2009 12:45:18 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 26337 invoked by uid 500); 4 Jun 2009 12:45:18 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 26265 invoked by uid 99); 4 Jun 2009 12:45:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Jun 2009 12:45:18 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dioxide.software@gmail.com designates 209.85.216.187 as permitted sender) Received: from [209.85.216.187] (HELO mail-px0-f187.google.com) (209.85.216.187) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Jun 2009 12:45:10 +0000 Received: by pxi17 with SMTP id 17so626684pxi.29 for ; Thu, 04 Jun 2009 05:44:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type; bh=ZcHXVA8/wAu5HJaLgj2A5dq7a7YthE1+aeaFdUaJaUQ=; b=S4rttkaeeYyAI7QM4PxKA4gDFwvE0iRj9LeuekHRNxcxZI8JPXXLEMaCdw6lvBUIGR pOqqmd+Vww2kkNqaxvO1eIegaChEVKXwKXZPod67RoIjhzKmifNveD+XUjcj4n2Ee2eA aGifn12hgJEe3x797iJDZ6YyiCTB9dg9MAhiI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=qavae9tncxf2jvcGotxV+GiuYKS/rRzpw4auEKuqVwbDD4ezEPIYYZxkEhRCWh/TYR i/sT9Jlf7yK/JKk94KxDUIqaiQu7g5Vhvm9Pe88OrHxYOYq0PShweBMclMhE1dnEd5Zc 8m+3U0yOsMRea1Nq48oJ9j1WrjIYM11AlP/dM= MIME-Version: 1.0 Received: by 10.142.126.6 with SMTP id y6mr734542wfc.269.1244119490063; Thu, 04 Jun 2009 05:44:50 -0700 (PDT) In-Reply-To: References: <8db6d74a0906030715s7d34d5c5j76c1815631f1418a@mail.gmail.com> <8f0ad1f30906030912y6b7abe07j48c6d08ffba1b30c@mail.gmail.com> <8db6d74a0906032212i62760b58r6799f3f441ee1ece@mail.gmail.com> <8f0ad1f30906040418i1577b66em72563e4817549b6@mail.gmail.com> <8f0ad1f30906040500q38d4b92bsf5dc03bf70ff98d3@mail.gmail.com> <8db6d74a0906040528p557c58b6i29153f460b4923e6@mail.gmail.com> From: KK Date: Thu, 4 Jun 2009 18:14:30 +0530 Message-ID: <8db6d74a0906040544l4b2b520dif8a7f167a24c47d3@mail.gmail.com> Subject: Re: How to support stemming and case folding for english content mixed with non-english content? To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=000e0cd3108a436d04046b852429 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd3108a436d04046b852429 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Uwe, thanks for your lightening fast reponse :-). I'm looking into that and let me see how far I can go...Also I request Muir to point me to the exact analyzer he mentiioned in thr previous mail. Thanks, KK On Thu, Jun 4, 2009 at 6:10 PM, Uwe Schindler wrote: > > I request Uwe to give me some more ideas on using the analyzers from solr > > that will do the job for me, handling a mix of both english and non- > > english content. > > Look here: > > http://lucene.apache.org/solr/api/org/apache/solr/analysis/package-summary.h > tml > > As you see, the Solr analyzers are just standard Lucene analyzers. So you > can drop the solr core jar into your project and just use them :-) > > Currently I am not sure which one is the analyzer Robert means, that can do > english stemming and detecting non-english parts, but there is to look for > it. > > Uwe > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --000e0cd3108a436d04046b852429--