Languages cool as they expand: Allometric scaling and the decreasing need for new words
Top Cited Papers
Open Access
- 10 December 2012
- journal article
- research article
- Published by Springer Nature in Scientific Reports
- Vol. 2 (1) , 943
- https://doi.org/10.1038/srep00943
Abstract
We analyze the occurrence frequencies of over 15 million words recorded in millions of books published during the past two centuries in seven different languages. For all languages and chronological subsets of the data we confirm that two scaling regimes characterize the word frequency distributions, with only the more common words obeying the classic Zipf law. Using corpora of unprecedented size, we test the allometric scaling relation between the corpus size and the vocabulary size of growing languages to demonstrate a decreasing marginal need for new words, a feature that is likely related to the underlying correlations between words. We calculate the annual growth fluctuations of word use which has a decreasing trend as the corpus size increases, indicating a slowdown in linguistic evolution following language expansion. This “cooling pattern” forms the basis of a third statistical regularity, which unlike the Zipf and the Heaps law, is dynamical in nature.All Related Versions
This publication has 52 references indexed in Scilit:
- Evolution of the most common English words and phrases over the centuriesJournal of The Royal Society Interface, 2012
- On the origin of long-range correlations in textsProceedings of the National Academy of Sciences, 2012
- Statistical Laws Governing Fluctuations in Word Use from Word Birth to Word DeathScientific Reports, 2012
- Culturomics meets random fractal theory: insights into long-range correlations of social and natural phenomena over the past two centuriesJournal of The Royal Society Interface, 2012
- Quantitative Analysis of Culture Using Millions of Digitized BooksScience, 2011
- Computational Social ScienceScience, 2009
- The size variance relationship of business firm growth ratesProceedings of the National Academy of Sciences, 2008
- Laws of population growthProceedings of the National Academy of Sciences, 2008
- Growth, innovation, scaling, and the pace of life in citiesProceedings of the National Academy of Sciences, 2007
- Hierarchical structures induce long-range dynamical correlations in written textsProceedings of the National Academy of Sciences, 2006