Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 10991 invoked from network); 11 Jun 2002 14:51:36 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 11 Jun 2002 14:51:36 -0000 Received: (qmail 28755 invoked by uid 97); 11 Jun 2002 14:51:33 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 28659 invoked by uid 97); 11 Jun 2002 14:51:31 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 28571 invoked by uid 98); 11 Jun 2002 14:51:30 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Message-ID: <20020611145125.77449.qmail@web12701.mail.yahoo.com> Date: Tue, 11 Jun 2002 07:51:25 -0700 (PDT) From: Otis Gospodnetic Subject: Re: How does simple analyser work To: Lucene Users List In-Reply-To: <000801c21108$be30e080$2564a8c0@jade> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N --- Harpreet S Walia wrote: > Hi, > > Are there any resources available which explain how the simple > analyser processes the data given to it . > what i want to know is that suppose i have a set of words , what > exact rules are applied to tokenize and index these words and how can > i customize them. > > My requirement is that the words be broken only by spaces and not at > any other character . I understand that this can be done by writing > a parser in JAVACC . but is there any simpler way of achieving this . Actually, this can be done by writing your own custom Analyzer. Check this: ./org/apache/lucene/analysis/standard/StandardAnalyzer.java ./org/apache/lucene/analysis/Analyzer.java ./org/apache/lucene/analysis/de/GermanAnalyzer.java ./org/apache/lucene/analysis/SimpleAnalyzer.java ./org/apache/lucene/analysis/StopAnalyzer.java ./org/apache/lucene/analysis/WhitespaceAnalyzer.java Maybe this last one is what you are looking for. Otis __________________________________________________ Do You Yahoo!? Yahoo! - Official partner of 2002 FIFA World Cup http://fifaworldcup.yahoo.com -- To unsubscribe, e-mail: For additional commands, e-mail: