opennlp-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Kottmann <>
Subject Re: abbreviation diccionary format
Date Tue, 10 Apr 2012 13:18:11 GMT
On 04/10/2012 03:15 PM, Jim - FooBar(); wrote:
> But you still cannot "train" anything (maxent/perceptron) on the 
> dictionary, can you???
> One needs training data for that yes? 

The dictionary is used to produce additional features to our standard 
feature set.
Therefor you need training data to train our statistical tokenizer, even 
so the feature
generation can use a dictionary to produce features.


View raw message