HMM tagger parameter file

Initial probabilities, transition probabilities, lexical probabilities, etc. The file has six sections: <Tag>, <Bigram>, <Trigram>, <Initial>, <Word>, and <Smoothing>. Each section is closed by it corresponding tag </Tag>, </Bigram>, </Trigram>, etc.

The tag (unigram), bigram, and trigram probabilities are used in Linear Interpolation smoothing by the tagger. The package includes a perl script that may be used to generate an appropriate config file from a tagged corpus. See the file src/utilities/hmm_smooth.perl for details.

2008-01-24