Comparative statistics for DNA and protein sequences: multiple sequence analysis.

AUTOR(ES)
RESUMO

Concepts and methods [Karlin, S. & Ghandour, G. (1985) Proc. Natl. Acad. Sci. USA 82, 5800-5804] for the analysis of patterns and relationships are extended to multiple DNA and protein sequences. Functionals include multiple sequence common word occurrence distributions, characterizations of high frequency shared words, and ascertainment of long block identities. Various comparisons of sequences using natural alphabets obtained from grouping nucleotides or amino acids by their chemical and functional characteristics are described. Specific applications are given to globin genes, mitochondrial genomes, and a variety of mammalian viruses.

Documentos Relacionados