Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ACCA0117FE for ; Tue, 19 Aug 2014 21:59:37 +0000 (UTC) Received: (qmail 55023 invoked by uid 500); 19 Aug 2014 21:59:34 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 54963 invoked by uid 500); 19 Aug 2014 21:59:34 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 54949 invoked by uid 99); 19 Aug 2014 21:59:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Aug 2014 21:59:33 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of trejkaz@trypticon.org designates 209.85.220.178 as permitted sender) Received: from [209.85.220.178] (HELO mail-vc0-f178.google.com) (209.85.220.178) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Aug 2014 21:59:28 +0000 Received: by mail-vc0-f178.google.com with SMTP id la4so8097644vcb.23 for ; Tue, 19 Aug 2014 14:59:07 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=IiZOa04v/CzSOirCasoG0/8w0NhxkbpyJyZk17ipwh4=; b=XjY2iMDlyI8gA1Ns2L2U9veimLZjasMyHi6n6jcIlzDzmq/SrXChfIPTB8OKfwcwWZ r5cESsSBiM+eKL/EKaG0SdUqTU4ivCKSknWHo0wPa+OfS0WhIHnd1UCKmP7Wdrp1kjXJ t2Xb5SPdRCQoiF4da2tmKsJ1D7uYYqj04QkmDAWbHU9t8BerTrBt7QjSJxl/X8LpG34v NokOy2BfI5aFAGnpy3nrJIGQnXzMsUoQKA5x978No06jkCLiTUrltW0dEApxbDw0KxaX qlwx8jNyLo6mz+7fWezRKnBtickW7WWqrdAeMbYbMQ1RdedplbLF2nB38VPZwuUD+u1s vgWA== X-Gm-Message-State: ALoCoQmeTk7FMMZii+s/ZF18Z9Ci6DWuUiE5GTg5ZvzwZwODy/EN2RFoZnquTF1WR0G9/564Ml7Q X-Received: by 10.52.61.99 with SMTP id o3mr2970996vdr.46.1408485547782; Tue, 19 Aug 2014 14:59:07 -0700 (PDT) Received: from mail-vc0-f174.google.com (mail-vc0-f174.google.com [209.85.220.174]) by mx.google.com with ESMTPSA id ry6sm62278297vdb.16.2014.08.19.14.59.06 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 19 Aug 2014 14:59:06 -0700 (PDT) Received: by mail-vc0-f174.google.com with SMTP id la4so8192420vcb.19 for ; Tue, 19 Aug 2014 14:59:06 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.52.30.2 with SMTP id o2mr272420vdh.12.1408485546436; Tue, 19 Aug 2014 14:59:06 -0700 (PDT) Received: by 10.220.94.138 with HTTP; Tue, 19 Aug 2014 14:59:06 -0700 (PDT) In-Reply-To: <008101cfbb7f$083845a0$18a8d0e0$@thetaphi.de> References: <008101cfbb7f$083845a0$18a8d0e0$@thetaphi.de> Date: Wed, 20 Aug 2014 07:59:06 +1000 Message-ID: Subject: Re: Can some terms from analysis be silently dropped when indexing? Because I'm pretty sure I'm seeing that happen. From: Trejkaz To: Lucene Users Mailing List Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org On Tue, Aug 19, 2014 at 5:27 PM, Uwe Schindler wrote: > Hi, > > You forgot to close (or commit) IndexWriter before opening the reader. Huh? The code I posted is closing it: try (IndexWriter writer = new IndexWriter(directory, new IndexWriterConfig(Version.LUCENE_36, analyser))) { Document document = new Document(); document.add(new Field("content", "blah blah commercial blah blah \u79CB\u8449\u539F blah blah", Field.Store.NO, Field.Index.ANALYZED)); writer.addDocument(document); } <-- closed here And anyway, if it weren't being closed, how would you explain the terms ending up in the index? TX --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org