ProbeMatch: rapid alignment of oligonucleotides to genome allowing both gaps and mismatches
AUTOR(ES)
Jung Kim, You
FONTE
Oxford University Press
RESUMO
Summary: We have developed a tool, called ProbeMatch, for matching a large set of oligonucleotide sequences against a genome database using gapped alignments. Unlike most of the existing tools such as ELAND which only perform ungapped alignments allowing at most two mismatches, ProbeMatch generates both ungapped and gapped alignments allowing up to three errors including insertion, deletion and mismatch. To speedup sequence alignment, ProbeMatch uses gapped q-grams and q-grams of various patterns to identify target hits to a query sequence. This approach results in fewer initial sequences to examine with no loss in sensitivity. ProbeMatch has been used to align 169 095 Illumina GAII reads against the human genome, which could not be mapped by ELAND, and found alignments for 28 625 reads of the 169 095 reads in less than 3 h.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2682521Documentos Relacionados
- Mind the gaps: Progress in progressive alignment
- Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human
- 1H NMR determination of base-pair lifetimes in oligonucleotides containing single base mismatches
- Rapid detection, classification and accurate alignment of up to a million or more related protein sequences
- Frequent oligonucleotides and peptides of the Haemophilus influenzae genome.