Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 2808 invoked from network); 17 Jun 2003 20:41:14 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 17 Jun 2003 20:41:14 -0000 Received: (qmail 24809 invoked by uid 97); 17 Jun 2003 20:43:38 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 24802 invoked from network); 17 Jun 2003 20:43:37 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 17 Jun 2003 20:43:37 -0000 Received: (qmail 2469 invoked by uid 500); 17 Jun 2003 20:41:11 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 2447 invoked from network); 17 Jun 2003 20:41:10 -0000 Received: from unknown (HELO krayton.imanage.com) (66.54.186.39) by daedalus.apache.org with SMTP; 17 Jun 2003 20:41:10 -0000 X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0 content-class: urn:content-classes:message Subject: Any tools to detect language of document Date: Tue, 17 Jun 2003 15:41:04 -0500 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Message-ID: <7AFD8C59F5C4A044A7AC48D356469D1619B9E0@krayton.imanage.com> Content-Transfer-Encoding: quoted-printable X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Weighted Search by Field using MultiFieldQueryParser Thread-Index: AcM04gEGKL9RH1YKTom9xWdFCK5OtQALlzvQ From: "Randy Darling" To: "Lucene Users List" X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N I am attempting to come up with an automated way to select which language analyzer to use on a document. Anyone know of any algorithms available to detect what language the document may be written in? Are there any special Analyzers that attempt to support multiple languages? Thanks, Randy --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org