opennlp-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jim - FooBar();" <jimpil1...@gmail.com>
Subject Re: abbreviation diccionary format
Date Tue, 10 Apr 2012 13:20:10 GMT
On 10/04/12 14:18, Jörn Kottmann wrote:
> On 04/10/2012 03:15 PM, Jim - FooBar(); wrote:
>>
>> But you still cannot "train" anything (maxent/perceptron) on the 
>> dictionary, can you???
>> One needs training data for that yes? 
>
> The dictionary is used to produce additional features to our standard 
> feature set.
> Therefor you need training data to train our statistical tokenizer, 
> even so the feature
> generation can use a dictionary to produce features.
>
> Jörn

aha ok, that makes sense...

Jim

Mime
View raw message