Origin of noncoding DNA sequences: molecular fossils of genome evolution.

AUTOR(ES)
RESUMO

The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. We propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approximately equal to 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approximately equal to 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stop codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. We attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.

Documentos Relacionados