Linguistic approaches to the analysis of sequence information