VNTR allele frequency distributions under the stepwise mutation model

A computer simulation approach

Mark Shriver, L. Jin, R. Chakraborty, E. Boerwinkle

Research output: Contribution to journalArticle

269 Citations (Scopus)

Abstract

Variable numbers of tandem repeats (VNTRs) are a class of highly informative and widely dispersed genetic markers. Despite their wide application in biological science, little is known about their mutational mechanisms or population dynamics. The objective of this work was to investigate four summary measures of VNTR allele frequency distributions: number of alleles, number of modes, range in allele size and heterozygosity, using computer simulations of the one-step stepwise mutation model (SMM). We estimated these measures and their probability distributions for a wide range of mutation rates and compared the simulation results with predictions from analytical formulations of the one-step SMM. The average heterozygosity from the simulations agreed with the analytical expectation under the SMM. The average number of alleles, however, was larger in the simulations than the analytical expectation of the SMM. We then compared our simulation expectations with actual data reported in the literature. We used the sample size and observed heterozygosity to determine the expected value, 5th and 95th percentiles for the other three summary measures, allelic size range, number of modes and number of alleles. The loci analyzed were classified into three groups based on the size of the repeat unit: microsatellites (1-2 base pair (bp) repeat unit), short tandem repeats [(STR) 3-5 bp repeat unit], and minisatellites (15-70 bp repeat unit). In general, STR loci were most similar to the simulation results under the SMM for the three summary measures (number of alleles, number of modes and range in allele size), followed by the microsatellite loci and then by the minisatellite loci, which showed deviations in the direction of the infinite allele model (IAM). Based on these differences, we hypothesize that these three classes of loci are subject to different mutational forces.

Original languageEnglish (US)
Pages (from-to)983-993
Number of pages11
JournalGenetics
Volume134
Issue number3
StatePublished - Jan 1 1993

Fingerprint

Minisatellite Repeats
Gene Frequency
Computer Simulation
Alleles
Mutation
Microsatellite Repeats
Base Pairing
Biological Science Disciplines
Population Dynamics
Mutation Rate
Genetic Markers
Sample Size

All Science Journal Classification (ASJC) codes

  • Genetics

Cite this

Shriver, Mark ; Jin, L. ; Chakraborty, R. ; Boerwinkle, E. / VNTR allele frequency distributions under the stepwise mutation model : A computer simulation approach. In: Genetics. 1993 ; Vol. 134, No. 3. pp. 983-993.
@article{a75c0db5bc244b63be635b3e706c57bb,
title = "VNTR allele frequency distributions under the stepwise mutation model: A computer simulation approach",
abstract = "Variable numbers of tandem repeats (VNTRs) are a class of highly informative and widely dispersed genetic markers. Despite their wide application in biological science, little is known about their mutational mechanisms or population dynamics. The objective of this work was to investigate four summary measures of VNTR allele frequency distributions: number of alleles, number of modes, range in allele size and heterozygosity, using computer simulations of the one-step stepwise mutation model (SMM). We estimated these measures and their probability distributions for a wide range of mutation rates and compared the simulation results with predictions from analytical formulations of the one-step SMM. The average heterozygosity from the simulations agreed with the analytical expectation under the SMM. The average number of alleles, however, was larger in the simulations than the analytical expectation of the SMM. We then compared our simulation expectations with actual data reported in the literature. We used the sample size and observed heterozygosity to determine the expected value, 5th and 95th percentiles for the other three summary measures, allelic size range, number of modes and number of alleles. The loci analyzed were classified into three groups based on the size of the repeat unit: microsatellites (1-2 base pair (bp) repeat unit), short tandem repeats [(STR) 3-5 bp repeat unit], and minisatellites (15-70 bp repeat unit). In general, STR loci were most similar to the simulation results under the SMM for the three summary measures (number of alleles, number of modes and range in allele size), followed by the microsatellite loci and then by the minisatellite loci, which showed deviations in the direction of the infinite allele model (IAM). Based on these differences, we hypothesize that these three classes of loci are subject to different mutational forces.",
author = "Mark Shriver and L. Jin and R. Chakraborty and E. Boerwinkle",
year = "1993",
month = "1",
day = "1",
language = "English (US)",
volume = "134",
pages = "983--993",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "3",

}

Shriver, M, Jin, L, Chakraborty, R & Boerwinkle, E 1993, 'VNTR allele frequency distributions under the stepwise mutation model: A computer simulation approach', Genetics, vol. 134, no. 3, pp. 983-993.

VNTR allele frequency distributions under the stepwise mutation model : A computer simulation approach. / Shriver, Mark; Jin, L.; Chakraborty, R.; Boerwinkle, E.

In: Genetics, Vol. 134, No. 3, 01.01.1993, p. 983-993.

Research output: Contribution to journalArticle

TY - JOUR

T1 - VNTR allele frequency distributions under the stepwise mutation model

T2 - A computer simulation approach

AU - Shriver, Mark

AU - Jin, L.

AU - Chakraborty, R.

AU - Boerwinkle, E.

PY - 1993/1/1

Y1 - 1993/1/1

N2 - Variable numbers of tandem repeats (VNTRs) are a class of highly informative and widely dispersed genetic markers. Despite their wide application in biological science, little is known about their mutational mechanisms or population dynamics. The objective of this work was to investigate four summary measures of VNTR allele frequency distributions: number of alleles, number of modes, range in allele size and heterozygosity, using computer simulations of the one-step stepwise mutation model (SMM). We estimated these measures and their probability distributions for a wide range of mutation rates and compared the simulation results with predictions from analytical formulations of the one-step SMM. The average heterozygosity from the simulations agreed with the analytical expectation under the SMM. The average number of alleles, however, was larger in the simulations than the analytical expectation of the SMM. We then compared our simulation expectations with actual data reported in the literature. We used the sample size and observed heterozygosity to determine the expected value, 5th and 95th percentiles for the other three summary measures, allelic size range, number of modes and number of alleles. The loci analyzed were classified into three groups based on the size of the repeat unit: microsatellites (1-2 base pair (bp) repeat unit), short tandem repeats [(STR) 3-5 bp repeat unit], and minisatellites (15-70 bp repeat unit). In general, STR loci were most similar to the simulation results under the SMM for the three summary measures (number of alleles, number of modes and range in allele size), followed by the microsatellite loci and then by the minisatellite loci, which showed deviations in the direction of the infinite allele model (IAM). Based on these differences, we hypothesize that these three classes of loci are subject to different mutational forces.

AB - Variable numbers of tandem repeats (VNTRs) are a class of highly informative and widely dispersed genetic markers. Despite their wide application in biological science, little is known about their mutational mechanisms or population dynamics. The objective of this work was to investigate four summary measures of VNTR allele frequency distributions: number of alleles, number of modes, range in allele size and heterozygosity, using computer simulations of the one-step stepwise mutation model (SMM). We estimated these measures and their probability distributions for a wide range of mutation rates and compared the simulation results with predictions from analytical formulations of the one-step SMM. The average heterozygosity from the simulations agreed with the analytical expectation under the SMM. The average number of alleles, however, was larger in the simulations than the analytical expectation of the SMM. We then compared our simulation expectations with actual data reported in the literature. We used the sample size and observed heterozygosity to determine the expected value, 5th and 95th percentiles for the other three summary measures, allelic size range, number of modes and number of alleles. The loci analyzed were classified into three groups based on the size of the repeat unit: microsatellites (1-2 base pair (bp) repeat unit), short tandem repeats [(STR) 3-5 bp repeat unit], and minisatellites (15-70 bp repeat unit). In general, STR loci were most similar to the simulation results under the SMM for the three summary measures (number of alleles, number of modes and range in allele size), followed by the microsatellite loci and then by the minisatellite loci, which showed deviations in the direction of the infinite allele model (IAM). Based on these differences, we hypothesize that these three classes of loci are subject to different mutational forces.

UR - http://www.scopus.com/inward/record.url?scp=0027229993&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0027229993&partnerID=8YFLogxK

M3 - Article

VL - 134

SP - 983

EP - 993

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 3

ER -