opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeffrey Zemerick <>
Subject Hardcoded length in prefix and suffix feature generators
Date Thu, 09 Feb 2017 21:55:52 GMT

I noticed that the length is hardcoded to 4 in the PrefixFeatureGenerator
and the SuffixFeatureGenerator. I made this value configurable in the XML
for each feature generator. I also add a check for the length to keep
duplicate prefixes or suffixes being returned. (If the token is "yes" with
a length of 4 there would be two "yes" features returned.) If a value is
not provided in the XML it uses the default value of 4.

You can preview the changes here:

If this is a change that's desired by the group I can make a JIRA and a
pull request.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message