lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From christophe leroy <hawa...@yahoo.fr>
Subject I want to develop an analyzer based on StandardAnalyzer
Date Sat, 25 Nov 2006 10:06:59 GMT
Hello,

I need an Analyzer which doesn't separate the
underscored words and with the StandardAnalyzer
functionnalities.
So, the solution is to create a new Analyzer .

I have seen several mails about that. I tried to apply
the proposed solutions but I have a problem.

The idea is to compile from sources of lucene. The
version of lucene is 1.4.3.

Resume of actions:
1. I had add the line "\u002d","\u005f" in the
StandardTokenizer.jj (for "_" and "-"):
| < #LETTER:                      // unicode letters
      [
       "\u0041"-"\u005a",
       "\u0061"-"\u007a",
       "\u00c0"-"\u00d6",
       "\u00d8"-"\u00f6",
       "\u00f8"-"\u00ff",
       "\u0100"-"\u1fff",
       "\u002d","\u005f"
      ]
  >

2. I used javacc to generate the java files and I have
a warning

C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3
\src\java\org\apache\lucene\analysis\standard>"C:\Documents
and Settings\CLEROY\
Bureau\pb_lucene2\javacc-4.0\javacc-4.0\bin\javacc"
StandardTokenizer.jj
Java Compiler Compiler Version 4.0 (Parser Generator)
(type "javacc" with no arguments for help)
Reading from file StandardTokenizer.jj . . .
Warning: Line 61, Column 3: Bad option name
"OPTIMIZE_TOKEN_MANAGER".  Option se
tting will be ignored.
Note: UNICODE_INPUT option is specified. Please make
sure you create the parser/
lexer using a Reader with the correct character
encoding.
Parser generated with 0 errors and 1 warnings.

C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3
\src\java\org\apache\lucene\analysis\standard>

2. And I try to create the jar of lucene but I have a
lot of problems:

C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3
>ant
Buildfile: build.xml

init:

compile-core:
    [javac] Compiling 7 source files to C:\Documents
and Settings\CLEROY\Bureau\
pb_lucene2\lucene-1.4.3-src\lucene-1.4.3\build\classes\java
    [javac] C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\
lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java
:15: cannot resolve symbol
    [javac] symbol  : class Reader
    [javac] location: class
org.apache.lucene.analysis.standard.StandardTokenize
r
    [javac]   public StandardTokenizer(Reader reader)
{
    [javac]                            ^
    [javac] C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\
lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java
:24: cannot resolve symbol
    [javac] symbol  : class IOException
    [javac] location: class
org.apache.lucene.analysis.standard.StandardTokenize
r
    [javac]   final public
org.apache.lucene.analysis.Token next() throws ParseE
xception, IOException {
    [javac]
          ^
    [javac] C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\
lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java
:24: next() in
org.apache.lucene.analysis.standard.StandardTokenizer
cannot over
ride next() in org.apache.lucene.analysis.TokenStream;
overridden method does no
t throw
org.apache.lucene.analysis.standard.ParseException
    [javac]   final public
org.apache.lucene.analysis.Token next() throws ParseE
xception, IOException {
    [javac]                                           
     ^
    [javac] C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\
lucene-1.4.3\src\java\org\apache\lucene\analysis\standard\StandardTokenizer.java
:15: recursive constructor invocation
    [javac]   public StandardTokenizer(Reader reader)
{
    [javac]          ^
    [javac] 4 errors

BUILD FAILED
C:\Documents and
Settings\CLEROY\Bureau\pb_lucene2\lucene-1.4.3-src\lucene-1.4.3
\build.xml:140: Compile failed; see the compiler error
output for details.

Total time: 2 seconds

Could You help me. It is very important. My chief will
kill me...

Thank,

Christophe


	

	
		
___________________________________________________________________________ 
Yahoo! Mail réinvente le mail ! Découvrez le nouveau Yahoo! Mail et son interface révolutionnaire.
http://fr.mail.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message