lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Combs, Craig" <Craig.Co...@compuware.com>
Subject Why are tokens not being indexed?
Date Wed, 30 Nov 2005 13:04:56 GMT
I have a body of text which is being added to a document as unstored.  All
the words in the body text are coming through in the token stream for
analyzing.  For some reason I can search on some of tokens and others I can
not.

Take the following string:

"L'amministrazione di Uniface View consente un elevato grado di flessibilità
e può essere realizzata in base ai requisiti del proprio sito. Nel manuale
Guida all'amministrazione di Uniface View  vengono descritte le procedure di
amministrazione del sistema. Utilizzare le informazioni riportate di seguito
per creare il portale più adatto alle proprie esigenze."

If I search for "consente" or "elevato" no results are found.  If I search
for "view" or "grado" results are found.  I find an explanation for this
behavior.  These tokens are being returned to the reader but don't seem to
be making it into the index.

Any ideas on how to debugs or an explanation why this might be happening?

Regards,
Craig Combs




The contents of this e-mail are intended for the named addressee only. It
contains information that may be confidential. Unless you are the named
addressee or an authorized designee, you may not copy or use it, or disclose
it to anyone else. If you received it in error please notify us immediately
and then destroy it. 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message