Inference of k-testable languages in the strict sense and application to syntactic pattern recognition
- 1 September 1990
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 12 (9) , 920-925
- https://doi.org/10.1109/34.57687
Abstract
The inductive inference of the class of k-testable languages in the strict sense (k-TSSL) is considered. A k-TSSL is essentially defined by a finite set of substrings of length k that are permitted to appear in the strings of the language. Given a positive sample R of strings of an unknown language, a deterministic finite-state automation that recognizes the smallest k-TSSL containing R is obtained. The inferred automation is shown to have a number of transitions bounded by O(m) where m is the number of substrings defining this k-TSSL, and the inference algorithm works in O(kn log m) where n is the sum of the lengths of all the strings in R. The proposed methods are illustrated through syntactic pattern recognition experiments in which a number of strings generated by ten given (source) non-k-TSSL grammars are used to infer ten k-TSSL stochastic automata, which are further used to classify new strings generated by the same source grammars. The results of these experiments are consistent with the theory and show the ability of (stochastic) k-TSSLs to approach other classes of regular languages.<>Keywords
This publication has 27 references indexed in Scilit:
- Application of a sequential pattern learning system to connected speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Local Languages, the Succesor Method, and a Step Towards a General Methodology for the Inference of Regular GrammarsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1987
- Approximating grammar probabilitiesJournal of the ACM, 1986
- A General Fuzzy-Parsing Scheme for Speech RecognitionPublished by Springer Nature ,1985
- Complexity of automaton identification from given dataInformation and Control, 1978
- Inference of Finite-State Probabilistic GrammarsIEEE Transactions on Computers, 1977
- Grammatical Inference: Introduction and Survey - Part IIEEE Transactions on Systems, Man, and Cybernetics, 1975
- Locally testable languagesJournal of Computer and System Sciences, 1972
- Language identification in the limitInformation and Control, 1967
- Prediction and Entropy of Printed EnglishBell System Technical Journal, 1951