### Abstract

Motivation: Topological entropy has been one of the most difficult to implement of all the entropy-theoretic notions. This is primarily due to finite sample effects and high-dimensionality problems. In particular, topological entropy has been implemented in previous literature to conclude that entropy of exons is higher than of introns, thus implying that exons are more 'random' than introns.Results: We define a new approximation to topological entropy free from the aforementioned difficulties. We compute its expected value and apply this definition to the intron and exon regions of the human genome to observe that as expected, the entropy of introns are significantly higher than that of exons. We also find that introns are less random than expected: their entropy is lower than the computed expected value. We also observe the perplexing phenomena that introns on chromosome Y have atypically low and bimodal entropy, possibly corresponding to random sequences (high entropy) and sequences that posses hidden structure or function (low entropy).

Original language | English (US) |
---|---|

Article number | btr077 |

Pages (from-to) | 1061-1067 |

Number of pages | 7 |

Journal | Bioinformatics |

Volume | 27 |

Issue number | 8 |

DOIs | |

State | Published - Apr 1 2011 |

### Fingerprint

### All Science Journal Classification (ASJC) codes

- Statistics and Probability
- Biochemistry
- Molecular Biology
- Computer Science Applications
- Computational Theory and Mathematics
- Computational Mathematics

### Cite this

*Bioinformatics*,

*27*(8), 1061-1067. [btr077]. https://doi.org/10.1093/bioinformatics/btr077

}

*Bioinformatics*, vol. 27, no. 8, btr077, pp. 1061-1067. https://doi.org/10.1093/bioinformatics/btr077

**Topological entropy of DNA sequences.** / Koslicki, David.

Research output: Contribution to journal › Article

TY - JOUR

T1 - Topological entropy of DNA sequences

AU - Koslicki, David

PY - 2011/4/1

Y1 - 2011/4/1

N2 - Motivation: Topological entropy has been one of the most difficult to implement of all the entropy-theoretic notions. This is primarily due to finite sample effects and high-dimensionality problems. In particular, topological entropy has been implemented in previous literature to conclude that entropy of exons is higher than of introns, thus implying that exons are more 'random' than introns.Results: We define a new approximation to topological entropy free from the aforementioned difficulties. We compute its expected value and apply this definition to the intron and exon regions of the human genome to observe that as expected, the entropy of introns are significantly higher than that of exons. We also find that introns are less random than expected: their entropy is lower than the computed expected value. We also observe the perplexing phenomena that introns on chromosome Y have atypically low and bimodal entropy, possibly corresponding to random sequences (high entropy) and sequences that posses hidden structure or function (low entropy).

AB - Motivation: Topological entropy has been one of the most difficult to implement of all the entropy-theoretic notions. This is primarily due to finite sample effects and high-dimensionality problems. In particular, topological entropy has been implemented in previous literature to conclude that entropy of exons is higher than of introns, thus implying that exons are more 'random' than introns.Results: We define a new approximation to topological entropy free from the aforementioned difficulties. We compute its expected value and apply this definition to the intron and exon regions of the human genome to observe that as expected, the entropy of introns are significantly higher than that of exons. We also find that introns are less random than expected: their entropy is lower than the computed expected value. We also observe the perplexing phenomena that introns on chromosome Y have atypically low and bimodal entropy, possibly corresponding to random sequences (high entropy) and sequences that posses hidden structure or function (low entropy).

UR - http://www.scopus.com/inward/record.url?scp=79954465218&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79954465218&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btr077

DO - 10.1093/bioinformatics/btr077

M3 - Article

C2 - 21317142

AN - SCOPUS:79954465218

VL - 27

SP - 1061

EP - 1067

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 8

M1 - btr077

ER -