Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 95585 invoked from network); 30 Jul 2002 15:04:58 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 30 Jul 2002 15:04:58 -0000 Received: (qmail 12515 invoked by uid 97); 30 Jul 2002 15:05:17 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 12470 invoked by uid 97); 30 Jul 2002 15:05:16 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 12454 invoked by uid 98); 30 Jul 2002 15:05:16 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Reply-To: From: "Nader S. Henein" To: "Lucene Users List" Subject: RE: Using Different Stop Analyzers... Date: Tue, 30 Jul 2002 19:07:14 +0400 Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook IMO, Build 9.0.2416 (9.0.2910.0) Importance: Normal X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 In-Reply-To: <919A8C811073E949901532C46927EE30126CD7@mailsrv1.copyright.com> X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Just have two different analyzers with two sets of stop words, both inheriting from the same analyzer (for you other analyzer criteria such as whitespace analysis), this would work but why would you want to do that -----Original Message----- From: Samir Satam [mailto:ssatam@copyright.com] Sent: Tuesday, July 30, 2002 6:57 PM To: Lucene Users List Subject: Using Different Stop Analyzers... Hi All, I have a question about Analyzers. Now I know that the documentation states that I need to use the same analyzers for both Indexing and Searching. But this question is only about the Stopanalyzer. Lets say I construct a StopAnalyzer with certain set of stopwords (stopwords(x)) while indexing. My guess is that Lucene will probably read those stopwords and remove them from all my indexed fields. Thus if I search on those stopwords I will not be able to find them. Can I pass a different set of stopwords (stopwords(y)) possibly a superset of stopwords(x)? And in this case will lucene think of stop words as totalstopwordset = stopwords(x) (these are already removed from the index) + stopwords(y) (these will be removed while searching) Can somebody please let me know if my thinking is correct on this one? The advantage of passing a different set while searching is that I can eliminate stopwords without re-indexing all my documents. thank you in advance, Samir -- To unsubscribe, e-mail: For additional commands, e-mail: -- To unsubscribe, e-mail: For additional commands, e-mail: