The application of principal component analysis to stylometry
- 1 December 1999
- journal article
- Published by Oxford University Press (OUP) in Literary and Linguistic Computing
- Vol. 14 (4) , 445-466
- https://doi.org/10.1093/llc/14.4.445
Abstract
In recent years principal component analysis has become popular for investigations in computational stylistics, particularly for studies of authorship. The mathematical nature of the theory that underpins the method makes it rather inaccessible to linguists and literary scholars. Consequently, confidence in its correct application is diminished. By first restricting the procedure to the use of two marker words, a pictorial description of its operation is derived. Some characteristics of the method are then examined. Finally, in the context of a Shakespearean example the technique is extended to p words, and suggestions are advanced to alleviate possible shortcomings.Keywords
This publication has 0 references indexed in Scilit: