Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 60D3C9E15 for ; Tue, 7 Feb 2012 18:31:22 +0000 (UTC) Received: (qmail 24508 invoked by uid 500); 7 Feb 2012 18:31:21 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 24283 invoked by uid 500); 7 Feb 2012 18:31:20 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 24275 invoked by uid 99); 7 Feb 2012 18:31:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Feb 2012 18:31:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Feb 2012 18:31:19 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 1E7821A7CF6 for ; Tue, 7 Feb 2012 18:30:59 +0000 (UTC) Date: Tue, 7 Feb 2012 18:30:59 +0000 (UTC) From: "Christian Moen (Created) (JIRA)" To: dev@lucene.apache.org Message-ID: <2109162825.9424.1328639459126.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Created] (SOLR-3107) Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor ---------------------------------------------------------------------- Key: SOLR-3107 URL: https://issues.apache.org/jira/browse/SOLR-3107 Project: Solr Issue Type: Improvement Components: contrib - LangId Affects Versions: 3.6, 4.0 Reporter: Christian Moen Priority: Minor The {{language-detection}} library used by {{LangDetectLanguageIdentifierUpdateProcessor}} uses a random sampling feature enabled by default as a means of avoiding local noise in input. The feature has its merits, but it can also be confusing to users who aren't aware of it since it may give different on the same input. I recommend turning it off to prevent confusion. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org