Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id ED62D200D29 for ; Thu, 26 Oct 2017 21:02:43 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id EC0D61609E8; Thu, 26 Oct 2017 19:02:43 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3F0761609E5 for ; Thu, 26 Oct 2017 21:02:43 +0200 (CEST) Received: (qmail 41981 invoked by uid 500); 26 Oct 2017 19:02:42 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 41947 invoked by uid 99); 26 Oct 2017 19:02:41 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Oct 2017 19:02:41 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id BF78D180847 for ; Thu, 26 Oct 2017 19:02:40 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -2.799 X-Spam-Level: X-Spam-Status: No, score=-2.799 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=fucit-org.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id lo94eP6yZBJl for ; Thu, 26 Oct 2017 19:02:38 +0000 (UTC) Received: from mail-pg0-f47.google.com (mail-pg0-f47.google.com [74.125.83.47]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 6C46A5F238 for ; Thu, 26 Oct 2017 19:02:38 +0000 (UTC) Received: by mail-pg0-f47.google.com with SMTP id s75so3433173pgs.0 for ; Thu, 26 Oct 2017 12:02:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fucit-org.20150623.gappssmtp.com; s=20150623; h=date:from:to:subject:in-reply-to:message-id:references:user-agent :mime-version; bh=yDNfVUc/auO+rcZlBFTvEyWAjpviLOie2hyBoIMikFg=; b=yDFf90Hi/iKgkqPnS9ZJIivIho1TxhTEAfwosirznBMik06nfP/Y3oOvaXqzMk4egf WH+sfxEsZUqklAAkoLFbwxS0AaH15V89fZPg1JX+XAe1qHV0mmH2W3S87jCTBXw11OXf qsQY76y1rq0RwArNrvIzQJBlU2DihaAMa7Iz/CiZ+kDfHUP0sR8pPMj7w2FkfZrxNOVc Dd9sRWEPly6TL3DlHao3wAv6SfUvRIF+BfoLBt80TscYs5R4fULqTfcumaIP3+qpiqCT GoSdjCJCsSG2xCAX7TZH16b/gwxTZwgDqZkHnhvPRVhaPaUWEM8QVwtS5bJCgSewXUOh sGiw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:subject:in-reply-to:message-id :references:user-agent:mime-version; bh=yDNfVUc/auO+rcZlBFTvEyWAjpviLOie2hyBoIMikFg=; b=ekugJjV17re5MMG8w7zXTpc58OREJ9G9Uuo0hAjxf5FjTknl6ETGmC7xVoKtnmyXpr /+0BSRGm3H2NSCLkqKnm0ICo7r6/njkckvIEWMFBgPI3R9sdr9csjVk3dgUUYoyutN2V hVLoPC9ZROTVC6crVU4p3yP4V6nMB4ZcFI1RAZxpnSGU2Qnkrn+Vy7OX6lwFctSeSIFa OhlcrRj69vywBgpu8WkN4TSoZix+DPbK62tMVbi29N6vuEToxQOwVugkOGCIPI8tcXv2 TsyhFVtLKaKfj7jthlNxew9XaFH8awg3tdCI90C+f4eTpQieykkeXTtrxMCMRqtQMfDv y5oQ== X-Gm-Message-State: AMCzsaVAdVoUgwuuqAebj0+4HrVMZSv/tsho16OQ/2J/iyVZdCItj7hX V2Pd8BS1yWx9x2BuvNyNIekXagPN X-Google-Smtp-Source: ABhQp+SidgcZohxRGY8HQfZnvUZHIBwM1bJI6hDlHsW1WfqKxAFoCLRpaNeNM6wVkjWUD3ZeG3Rlfw== X-Received: by 10.84.164.165 with SMTP id w34mr5058135pla.308.1509044556431; Thu, 26 Oct 2017 12:02:36 -0700 (PDT) Received: from tray (c-73-24-207-196.hsd1.az.comcast.net. [73.24.207.196]) by smtp.gmail.com with ESMTPSA id f6sm9154361pgo.11.2017.10.26.12.02.34 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 26 Oct 2017 12:02:35 -0700 (PDT) Date: Thu, 26 Oct 2017 12:02:33 -0700 (MST) From: Chris Hostetter To: Lucene Users Subject: Re: ClassicAnalyzer Behavior on accent character In-Reply-To: Message-ID: References: User-Agent: Alpine 2.11 (DEB 23 2013-08-11) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII archived-at: Thu, 26 Oct 2017 19:02:44 -0000 Classic is ... "classic" ... it exists largely for historical purposes to provide a tokenizer that does exactly what the javadocs say it does (regarding punctuation, "produc numbers", and email addresses), so that people who depend on that behavior can continue to rely on it. Standard is ... "standard" ... it implements that Unicode Standard text segmentation rules. : Date: Fri, 20 Oct 2017 18:58:35 +0530 : From: Chitra : Reply-To: java-user@lucene.apache.org : To: Lucene Users : Subject: Re: ClassicAnalyzer Behavior on accent character : : Hi, : I found the difference and understand the behavior of both : tokenizers appropriately. : : Could you please suggest me which one is the better to use : ClassicTokenizer/StandardTokenizer? : : -- : Regards, : Chitra : -Hoss http://www.lucidworks.com/ --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org