TINA. A probabilistic syntactic parser for speech understanding systems
- 13 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
A natural language system, TINA, which integrates key ideas from context-free grammars, augmented transition networks (ATNs), and unification grammars has been developed. The parser uses a best-first search strategy, with probability assignments on all arcs obtained automatically from a set of example sentences. An initial context-free grammar, derived from the example sentences, is converted to an implicit probabilistic network structure. Control includes both top-down and bottom-up cycles, and key parameters are passed among nodes to deal with long-distance movement and agreement constraints. The probabilities provide a natural mechanism for exploring more common grammatical constructions first. Arc probabilities reduce test-set perplexity sixfold. A strategy for dealing with movement that can handle nested and chained gaps efficiently and rejects crossed gaps is introduced.Keywords
This publication has 4 references indexed in Scilit:
- TINA. A probabilistic syntactic parser for speech understanding systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Efficient Parsing for Natural LanguagePublished by Springer Nature ,1986
- Grammatically-based automatic word class formationInformation Processing & Management, 1975
- Transition network grammars for natural language analysisCommunications of the ACM, 1970