mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mengfei Ren <>
Subject Write my Analyzer and Fillter, what would the dictionary-file-0 be like?
Date Fri, 21 Jun 2013 17:32:37 GMT

I'm try to write my own analyzer for Naive Bayes classification.
When I run the seq2sparse command and called -a MyANalyzer, The output
shows the algorithim used my filter and analyzer:

### Using length filter
### Using stopword fileter
### Using my filter
### Using my keyword filter

But if I check the dictionary-file-0 generated in this step, I  still find
all the tokens (not filtered!). Does that mean my analyzer didn't work, or
the dictionary-file-0 just record all tokens but the algorithm actually use
the filtered data?


Mengfei Ren

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message