Berkeley DB indexed file.
It may be created with the
src/utilities/indexdict program provided with FreeLing. The source
file must have the sense list for one lemma-PoS per line.
Each line has format: lemma:PoS synset1 synset2 .... E.g.
cebolla:N 05760066 08734429 08734702
The first sense code in the list is assumed to be the most frequent sense for that lemma-PoS by the sense annotation module. This only takes effect when value msf is selected for the SenseAnnotation option.
Sense codes can be anything (assuming your later processes know what to do with them). The provided files contain WordNet 1.6 synset codes.
Currently, only the PoS tag selected by the tagger is annotated, though the library is designed to support sense annotation for all possible tags of each word.