Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 40450 invoked from network); 25 Nov 2006 10:07:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Nov 2006 10:07:29 -0000 Received: (qmail 48567 invoked by uid 500); 25 Nov 2006 10:07:36 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 48366 invoked by uid 500); 25 Nov 2006 10:07:35 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 48348 invoked by uid 99); 25 Nov 2006 10:07:35 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 25 Nov 2006 02:07:35 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy includes SPF record at spf.trusted-forwarder.org) Received: from [217.146.176.228] (HELO web25410.mail.ukl.yahoo.com) (217.146.176.228) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 25 Nov 2006 02:07:22 -0800 Received: (qmail 64592 invoked by uid 60001); 25 Nov 2006 10:06:59 -0000 Message-ID: <20061125100659.64590.qmail@web25410.mail.ukl.yahoo.com> DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.fr; h=X-YMail-OSG:Received:Date:From:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding:Message-ID; b=v/q3G0pXL1PaVWn+o8zlbj9oO4+xDn1lYzglbnyrG9/+IlxzOb+jKYvp8g0OkT7h4aabU3VPYIznTge+AjIlBwoS4IFzuMy1zM7potENvyi1b39KOVLbrfbPUxkxhLeOyldy8ZUiPC2dJSspq/FdAMGowZi4y5fwa5a5T1JCpKY=; X-YMail-OSG: XQ5p6t0VM1mG4co2yaORjw1Ziu8LyIOXjrg7rhf2YSd_Vt_yOKtEAbyr_AoW4ysvYB5OsSg4wCOoacdmkfWiIl2VyFVH173HzgBxsA1x4RNYaWvRczezhOjP8QrTQINPqpUWiyg3qCkP Received: from [84.6.194.96] by web25410.mail.ukl.yahoo.com via HTTP; Sat, 25 Nov 2006 11:06:59 CET Date: Sat, 25 Nov 2006 11:06:59 +0100 (CET) From: christophe leroy Subject: I want to develop an analyzer based on StandardAnalyzer To: java-dev@lucene.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hello, I need an Analyzer which doesn't separate the underscored words and with the StandardAnalyzer functionnalities. So, the solution is to create a new Analyzer . I have seen several mails about that. I tried to apply the proposed solutions but I have a problem. The idea is to compile from sources of lucene. The version of lucene is 1.4.3. Resume of actions: 1. I had add the line "\u002d","\u005f" in the StandardTokenizer.jj (for "_" and "-"): | < #LETTER: // unicode letters [ "\u0041"-"\u005a", "\u0061"-"\u007a", "\u00c0"-"\u00d6", "\u00d8"-"\u00f6", "\u00f8"-"\u00ff", "\u0100"-"\u1fff", "\u002d","\u005f" ] > 2. I used javacc to generate the java files and I have a warning C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3 \src\java\org\apache\lucene\analysis\standard>"C:\Documents and Settings\CLEROY\ Bureau\pb_lucene2\javacc-4.0\javacc-4.0\bin\javacc" StandardTokenizer.jj Java Compiler Compiler Version 4.0 (Parser Generator) (type "javacc" with no arguments for help) Reading from file StandardTokenizer.jj . . . Warning: Line 61, Column 3: Bad option name "OPTIMIZE_TOKEN_MANAGER". Option se tting will be ignored. Note: UNICODE_INPUT option is specified. Please make sure you create the parser/ lexer using a Reader with the correct character encoding. Parser generated with 0 errors and 1 warnings. C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3 \src\java\org\apache\lucene\analysis\standard> 2. And I try to create the jar of lucene but I have a lot of problems: C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3 >ant Buildfile: build.xml init: compile-core: [javac] Compiling 7 source files to C:\Documents and Settings\CLEROY\Bureau\ pb_lucene2\lucene-1.4.3-src\lucene-1.4.3\build\classes\java [javac] C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\ lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java :15: cannot resolve symbol [javac] symbol : class Reader [javac] location: class org.apache.lucene.analysis.standard.StandardTokenize r [javac] public StandardTokenizer(Reader reader) { [javac] ^ [javac] C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\ lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java :24: cannot resolve symbol [javac] symbol : class IOException [javac] location: class org.apache.lucene.analysis.standard.StandardTokenize r [javac] final public org.apache.lucene.analysis.Token next() throws ParseE xception, IOException { [javac] ^ [javac] C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\ lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java :24: next() in org.apache.lucene.analysis.standard.StandardTokenizer cannot over ride next() in org.apache.lucene.analysis.TokenStream; overridden method does no t throw org.apache.lucene.analysis.standard.ParseException [javac] final public org.apache.lucene.analysis.Token next() throws ParseE xception, IOException { [javac] ^ [javac] C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\ lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java :15: recursive constructor invocation [javac] public StandardTokenizer(Reader reader) { [javac] ^ [javac] 4 errors BUILD FAILED C:\Documents and Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3 \build.xml:140: Compile failed; see the compiler error output for details. Total time: 2 seconds Could You help me. It is very important. My chief will kill me... Thank, Christophe ___________________________________________________________________________ Yahoo! Mail r�invente le mail ! D�couvrez le nouveau Yahoo! Mail et son interface r�volutionnaire. http://fr.mail.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org