Hidden Markov models of biological primary sequence information.

Baldi, P

Hidden Markov models of biological primary sequence information.

AUTOR(ES)

Baldi, P

RESUMO

Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth and convergent algorithm is introduced to iteratively adapt the transition and emission parameters of the models from the examples in a given family. The HMM approach is applied to three protein families: globins, immunoglobulins, and kinases. In all cases, the models derived capture the important statistical characteristics of the family and can be used for a number of tasks, including multiple alignments, motif detection, and classification. For K sequences of average length N, this approach yields an effective multiple-alignment algorithm which requires O(KN2) operations, linear in the number of sequences.

ACESSO AO ARTIGO

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=521453

Documentos Relacionados

Hidden Markov models from molecular dynamics simulations on DNA
Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models
Hidden Markov models applied to a subsequence of the Xylella fastidiosa genome
Preparation of name and address data for record linkage using hidden Markov models
PrediÃÃo de palavras baseada em modelos ocultos de Markov