incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sahand.T" <>
Subject Need help with Hunspell's .aff syntax
Date Thu, 10 May 2012 00:56:01 GMT
Hello, I just joined this mailing list for the purpose of understanding
Hunspell better. I am trying to create a spell checker for central
kurdish/sorani and am currently looking through examples and playing with
the .aff file.

I don't really know how mailing lists works but if anyone has answers to
these things I'd appreciate it (follow up questions may arise).

1.What does the TRY attribute actually do? I found the manuals cryptical in
their explanation. I understand that it is used to determine wrong
characters in words, I don't get how it does it though or how I should set
it up for my needs.

2.Taken from manual4:

 *"Personal dictionaries are simple word lists. Asterisk at the first
> character position signs prohibition. A second word separated by a slash
> sets the affixation.
> **
> **foo
> **Foo/Simpson
> ***bar
> **
> **In this example, "foo" and "Foo" are personal words, plus Foo will be
> recognized with affixes of Simpson (Foo’s etc.) and bar is a forbidden
> word."*

What does the "affixes of Simpson" mean? Is Simpson a flag/class in the
.aff file or what? Or does it  mean "FooSimpson" will be allowed?

3. What does this compoundrule from an en_US.aff mean and how does it make
the rules for adding "st", "th", "nd", "rd" to numbers properly?

*# ordinal numbers
> **# only in compounds: 1th, 2th, 3th
> **# compound rules:
> **# 1. [0-9]*1[0-9]th (10th, 11th, 12th, 56714th, etc.)
> **# 2. [0-9]*[02-9](1st|2nd|3rd|[4-9]th) (21st, 22nd, 123rd, 1234th, etc.)
> **WORDCHARS 0123456789 *

4. When I've created all the rules and a dictionary. Do I then use Hunspell
to generate better .dic/.aff files? If so, how are they better? (words with
prefixes are removed?)

What else do you need the hunspell source and executables for? Is it for
the testing features or is there something I've missed that is awesome
about having the Hunspell source?

Thanks for reading! It's bedtime for me now so I won't be answering until
tomorrow. Nitey!


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message