Table of Contents
fsa_morph - perform morphological analysis of a word
[ options ] [ <infile ] [ >outfile ]
fsa_morph reads lines
from the input. Each line contains one word. For each (inflected) word,
its morphological analysis is printed. It consists of one or more pairs
- -d dictionary
- use that dictionary. Several dictionaries
may be given. At least one dictionary must be specified. Dictionaries are
automata built using fsa_ubuild or fsa_build. The data for them must be
prepared in a special way. Each line of data consists of the inflected
form, followed by a separator, followed by a code, followed by the ending
of the lexeme, followed by the separator, followed by tags. The code specifies
how many characters from the end of the inflected form are different from
last characters of the lexeme (i.e. how many characters constitute the inflected
word ending). 'A' means that no characters are to be rejected, 'B' - 1, 'C' - 2,
and so on. Though the data is different from the normal word list format,
the automaton is the same (its content is different), so the magic number
is the same as with the normal data.
- -i input_file
- specifies an input file
- file that contains words that should be analysed. More than one file can
be specified (i.e. the option can be used more than once). In absence of
-i option, standard input is used.
- -l language_file
- specifies a file that
hold language specific information, i.e. (for now) characters that form
words, and pairs of (lowercase, uppercase) characters for case conversion.
If the option is not specified, latin letters with standard case conversions
wil be used.
Note: whether this information will be used or not depends
on the module used for I/O handling (one_word_io or text_io).
that the dictionary contains coded prefixes (see fsa_morph(5)
that the dictionary contains coded infixes (see fsa_morph(5)
version details, including compile options used to build the program.
Send bug reports to the
author: Jan Daciuk, email@example.com (correct the stuttering!).
- Invalid options, or lack of a required option.
file could not be opened.
- Not enough memory.
Table of Contents