Trigger-based language models: a maximum entropy approach
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 45-48 vol.2
- https://doi.org/10.1109/icassp.1993.319225
Abstract
Ongoing efforts at adaptive statistical language modeling are described. To extract information from the document history, trigger pairs are used as the basic information-bearing elements. To combine statistical evidence from multiple triggers, the principle of maximum entropy (ME) is used. To combine the trigger-based model with the static model, the latter is absorbed into the ME formalism. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. Among the advantages of this approach are its simplicity, generality, and incremental nature. Among its disadvantages are its computational requirements. The model described here was trained on five million words of Wall Street Journal text. It used some 40000 unigram constraints, 200000 bigram constraints, 200000 trigram constraints, and 60000 trigger constraints. After 13 iterations, it produced a language model whose perplexity was 12% lower than that of a conventional trigram, as measured on independent data.Keywords
This publication has 7 references indexed in Scilit:
- Improvements in stochastic language modelingPublished by Association for Computational Linguistics (ACL) ,1992
- A dynamic language model for speech recognitionPublished by Association for Computational Linguistics (ACL) ,1991
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- Generalized Iterative Scaling for Log-Linear ModelsThe Annals of Mathematical Statistics, 1972
- Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency TablesThe Annals of Mathematical Statistics, 1963
- Information Theory and Statistical MechanicsPhysical Review B, 1957
- The Population Frequencies of Species and the Estimation of Population ParametersBiometrika, 1953