UNIX Time-Sharing System: Statistical Text Processing
- 8 July 1978
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Bell System Technical Journal
- Vol. 57 (6) , 2137-2154
- https://doi.org/10.1002/j.1538-7305.1978.tb02146.x
Abstract
Several studies of the statistical properties of English text have used the UNIX∗ system and UNIX programming tools. This paper describes several of the useful UNIX facilities for statistical studies and summarizes some studies that have been made at the character level, the character-string level, and the level of English words. The descriptions give a sample of the results obtained and constitute a short introduction, by case-study, on how to use UNIX tools for studying the statistics of English.Keywords
This publication has 1 reference indexed in Scilit:
- Computer detection of typographical errorsIEEE Transactions on Dependable and Secure Computing, 1975