Models for power law relations in linguistics and information science
- 1 May 1998
- journal article
- research article
- Published by Taylor & Francis in Journal of Quantitative Linguistics
- Vol. 5 (1-2) , 35-61
- https://doi.org/10.1080/09296179808590110
Abstract
Two well‐established robust laws in behavioural sciences are Zipf's law in Linguistics, and Bradford's Law in Informetrics. Both are similar power‐law functions. In an earlier work (1992) the authors developed an information theoretic model for Zipf's law based on the classical Shannon theory. A newer version, based on algorithmic information theory, was proposed in 1996. These two models are now extended to Bradford's law. The meaning of a discourse, ignored in Shannon's theory, is given special significance in the algorithmic information theory for language discourses and Zipf's law is shown to be a consequence of an optimum meaning‐preserving code of the discourse. Such a code exhibits characteristics of complex adaptive systems ‐ a mixture of elements of order and randomness. A complexity function was defined for a discourse, which is maximum for a state intermediate between order and disorder, and it was shown that it is nearly maximal for natural discourses. A general discussion of power law distributions reveals the uniqueness of Zipf's law as the only one associated with an optimal meaning‐preserving code. Power laws are a natural consequence of scale invariance; an elementary mathematical treatment is presented. Functions more complicated than a simple power law also show scale invariance and one such function may in fact conform to some observations. Some general comments about the nature of scientific laws conclude with a suggestion that Zipf's law may indeed qualify for the title ‘mathematical law’.Keywords
This publication has 32 references indexed in Scilit:
- Quantitative linguistics and complex system studies*Journal of Quantitative Linguistics, 1996
- Power laws and universalityNature, 1995
- Informetric distributions, part I: Unified overviewJournal of the American Society for Information Science, 1990
- Algorithmic randomness and physical entropyPhysical Review A, 1989
- TOWARDS INFORMETRICS: HAITUN, LAPLACE, ZIPF, BRADFORD AND THE ALVEY PROGRAMMEJournal of Documentation, 1984
- POWER LAW RELATIONS IN SCIENCE BIBLIOGRAPHY—A SELF‐CONSISTENT INTERPRETATIONJournal of Documentation, 1971
- Programming languages in mechanized documentationJournal of Documentation, 1971
- Bradford's Law of Bibliography of Science: an InterpretationNature, 1970
- Bradford's Law and Library AcquisitionsNature, 1970
- THE RELATION BETWEEN THE DICTIONARY DISTRIBUTION AND THE OCCURRENCE DISTRIBUTION OF WORD LENGTH AND ITS IMPORTANCE FOR THE STUDY OF QUANTITATIVE LINGUISTICSBiometrika, 1958