An approach to identify over-represented cis-elements in related sequences
AUTOR(ES)
Zheng, Jiashun
FONTE
Oxford University Press
RESUMO
Computational identification of transcription factor binding sites is an important research area of computational biology. Positional weight matrix (PWM) is a model to describe the sequence pattern of binding sites. Usually, transcription factor binding sites prediction methods based on PWMs require user-defined thresholds. The arbitrary threshold and also the relatively low specificity of the algorithm prevent the result of such an analysis from being properly interpreted. In this study, a method was developed to identify over-represented cis-elements with PWM-based similarity scores. Three sets of closely related promoters were analyzed, and only over- represented motifs with high PWM similarity scores were reported. The thresholds to evaluate the similarity scores to the PWMs of putative transcription factors binding sites can also be automatically determined during the analysis, which can also be used in further research with the same PWMs. The online program is available on the website: http://www.bioinfo.tsinghua.edu.cn/∼zhengjsh/OTFBS/.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=152803Documentos Relacionados
- Statistical analysis of over-represented words in human promoter sequences
- Complex cis-elements determine an RNA editing site in pea mitochondria
- Long W tracts are over-represented in the Escherichia coli and Haemophilus influenzae genomes.
- Silencer binding proteins function on multiple cis-elements in the glutathione transferase P gene.
- Transcriptional Similarities, Dissimilarities, and Conservation of cis-Elements in Duplicated Genes of Arabidopsis1[w]