Identification of Recurrent DNA Copy Number Aberrations in Tumors

Vonn Walter, Andrew B. Nobel, D. Neil Hayes, Fred A. Wright

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Genetic mutations and alterations are the hallmark of cancer. When these alterations change the expression or protein product of a gene, increased invasiveness into surrounding tissue can result from unchecked cell cycle progression and improper regulation of cell death. In turn, these can contribute to tumor genesis, development, and expansion. A variety of somatic mutations can occur in tumor tissue, including point mutations, changes in methylation status, and gains and losses of chromosomal regions. Here we will focus primarily on mutations of the last type, which are termed DNA copy number aberrations (CNAs). Many CNAs can arise due to general genomic instability, and occur sporadically in locations throughout the genome. A smaller subset of CNAs appears to be recurrent, occurring repeatedly in the same region across multiple individuals. Recurrent CNAs are thought to be due to regional chromosome structure, or to a selection effect in which gain or loss of important regions leads to increased tumor growth rate. The identification of true recurrent CNAs is important, because these regions may play a role in the initiation and progression of tumors, perhaps even highlighting individual genes for further study or targeted treatment. The detection of recurrent CNAs is largely a statistical problem, and a number of methods have been proposed to address this problem. In this chapter, we survey several methods for analyzing DNA copy number data in tumors, with the DiNAMIC approach [1] described in some detail. The nomenclature in the literature on DNA copy number mutations has sometimes been inconsistent, so we begin by providing relevant definitions. Next we discuss the biological changes that can lead to alterations in tumor DNA copy number, as well as how tumors can result from these changes. The analysis of DNA copy number relies crucially on genomic technologies, and we survey platforms for assaying copy number, noting some of the challenges associated with these data. In Sections 13.3 and 13.4, we survey some of the available methods for analyzing DNA copy number data. Methods for detecting recurrent CNAs share common features including computation of summary statistics in genomic regions, use of resampling in order to create "null" distributions, and adjustments for multiple comparisons. Some of these methods require specific preprocessing steps, and we describe these as well. Section 13.5 is devoted to DiNAMIC. This method uses a novel permutation scheme called cyclic shift to compute its null distribution, and we describe comparisons to other permutation schemes. Although some of the issues may seem technical, the simulation results of Walter et al. [1] suggest that DiNAMIC's cyclic shift procedure is attractive in comparison to other permutation schemes, and leads to proper control of error rates under a variety of realistic marker correlation structures. We conclude by introducing a confidence interval procedure for recurrent CNAs [2]. Publicly available tumor datasets were analyzed with DiNAMIC and the confidence interval procedure, and the results briefly surveyed here and in [2] have underlying biological support.

Original languageEnglish (US)
Title of host publicationStatistical Diagnostics for Cancer
Subtitle of host publicationAnalyzing High-Dimensional Data
PublisherWiley-VCH
Pages239-260
Number of pages22
Volume3
ISBN (Print)9783527332625
DOIs
StatePublished - Apr 8 2013

Fingerprint

Aberrations
Tumors
DNA
Neoplasms
Genes
Mutation
Tissue
Confidence Intervals
Chromosome Structures
Methylation
Cell death
Terminology
Chromosomes
Genomic Instability
Point Mutation
Cells
Cell Cycle
Statistics
Cell Death
Genome

All Science Journal Classification (ASJC) codes

  • Biochemistry, Genetics and Molecular Biology(all)

Cite this

Walter, V., Nobel, A. B., Hayes, D. N., & Wright, F. A. (2013). Identification of Recurrent DNA Copy Number Aberrations in Tumors. In Statistical Diagnostics for Cancer: Analyzing High-Dimensional Data (Vol. 3, pp. 239-260). Wiley-VCH. https://doi.org/10.1002/9783527665471.ch13
Walter, Vonn ; Nobel, Andrew B. ; Hayes, D. Neil ; Wright, Fred A. / Identification of Recurrent DNA Copy Number Aberrations in Tumors. Statistical Diagnostics for Cancer: Analyzing High-Dimensional Data. Vol. 3 Wiley-VCH, 2013. pp. 239-260
@inbook{0ed1a96d46e7481aaeb9743638747914,
title = "Identification of Recurrent DNA Copy Number Aberrations in Tumors",
abstract = "Genetic mutations and alterations are the hallmark of cancer. When these alterations change the expression or protein product of a gene, increased invasiveness into surrounding tissue can result from unchecked cell cycle progression and improper regulation of cell death. In turn, these can contribute to tumor genesis, development, and expansion. A variety of somatic mutations can occur in tumor tissue, including point mutations, changes in methylation status, and gains and losses of chromosomal regions. Here we will focus primarily on mutations of the last type, which are termed DNA copy number aberrations (CNAs). Many CNAs can arise due to general genomic instability, and occur sporadically in locations throughout the genome. A smaller subset of CNAs appears to be recurrent, occurring repeatedly in the same region across multiple individuals. Recurrent CNAs are thought to be due to regional chromosome structure, or to a selection effect in which gain or loss of important regions leads to increased tumor growth rate. The identification of true recurrent CNAs is important, because these regions may play a role in the initiation and progression of tumors, perhaps even highlighting individual genes for further study or targeted treatment. The detection of recurrent CNAs is largely a statistical problem, and a number of methods have been proposed to address this problem. In this chapter, we survey several methods for analyzing DNA copy number data in tumors, with the DiNAMIC approach [1] described in some detail. The nomenclature in the literature on DNA copy number mutations has sometimes been inconsistent, so we begin by providing relevant definitions. Next we discuss the biological changes that can lead to alterations in tumor DNA copy number, as well as how tumors can result from these changes. The analysis of DNA copy number relies crucially on genomic technologies, and we survey platforms for assaying copy number, noting some of the challenges associated with these data. In Sections 13.3 and 13.4, we survey some of the available methods for analyzing DNA copy number data. Methods for detecting recurrent CNAs share common features including computation of summary statistics in genomic regions, use of resampling in order to create {"}null{"} distributions, and adjustments for multiple comparisons. Some of these methods require specific preprocessing steps, and we describe these as well. Section 13.5 is devoted to DiNAMIC. This method uses a novel permutation scheme called cyclic shift to compute its null distribution, and we describe comparisons to other permutation schemes. Although some of the issues may seem technical, the simulation results of Walter et al. [1] suggest that DiNAMIC's cyclic shift procedure is attractive in comparison to other permutation schemes, and leads to proper control of error rates under a variety of realistic marker correlation structures. We conclude by introducing a confidence interval procedure for recurrent CNAs [2]. Publicly available tumor datasets were analyzed with DiNAMIC and the confidence interval procedure, and the results briefly surveyed here and in [2] have underlying biological support.",
author = "Vonn Walter and Nobel, {Andrew B.} and Hayes, {D. Neil} and Wright, {Fred A.}",
year = "2013",
month = "4",
day = "8",
doi = "10.1002/9783527665471.ch13",
language = "English (US)",
isbn = "9783527332625",
volume = "3",
pages = "239--260",
booktitle = "Statistical Diagnostics for Cancer",
publisher = "Wiley-VCH",

}

Walter, V, Nobel, AB, Hayes, DN & Wright, FA 2013, Identification of Recurrent DNA Copy Number Aberrations in Tumors. in Statistical Diagnostics for Cancer: Analyzing High-Dimensional Data. vol. 3, Wiley-VCH, pp. 239-260. https://doi.org/10.1002/9783527665471.ch13

Identification of Recurrent DNA Copy Number Aberrations in Tumors. / Walter, Vonn; Nobel, Andrew B.; Hayes, D. Neil; Wright, Fred A.

Statistical Diagnostics for Cancer: Analyzing High-Dimensional Data. Vol. 3 Wiley-VCH, 2013. p. 239-260.

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - Identification of Recurrent DNA Copy Number Aberrations in Tumors

AU - Walter, Vonn

AU - Nobel, Andrew B.

AU - Hayes, D. Neil

AU - Wright, Fred A.

PY - 2013/4/8

Y1 - 2013/4/8

N2 - Genetic mutations and alterations are the hallmark of cancer. When these alterations change the expression or protein product of a gene, increased invasiveness into surrounding tissue can result from unchecked cell cycle progression and improper regulation of cell death. In turn, these can contribute to tumor genesis, development, and expansion. A variety of somatic mutations can occur in tumor tissue, including point mutations, changes in methylation status, and gains and losses of chromosomal regions. Here we will focus primarily on mutations of the last type, which are termed DNA copy number aberrations (CNAs). Many CNAs can arise due to general genomic instability, and occur sporadically in locations throughout the genome. A smaller subset of CNAs appears to be recurrent, occurring repeatedly in the same region across multiple individuals. Recurrent CNAs are thought to be due to regional chromosome structure, or to a selection effect in which gain or loss of important regions leads to increased tumor growth rate. The identification of true recurrent CNAs is important, because these regions may play a role in the initiation and progression of tumors, perhaps even highlighting individual genes for further study or targeted treatment. The detection of recurrent CNAs is largely a statistical problem, and a number of methods have been proposed to address this problem. In this chapter, we survey several methods for analyzing DNA copy number data in tumors, with the DiNAMIC approach [1] described in some detail. The nomenclature in the literature on DNA copy number mutations has sometimes been inconsistent, so we begin by providing relevant definitions. Next we discuss the biological changes that can lead to alterations in tumor DNA copy number, as well as how tumors can result from these changes. The analysis of DNA copy number relies crucially on genomic technologies, and we survey platforms for assaying copy number, noting some of the challenges associated with these data. In Sections 13.3 and 13.4, we survey some of the available methods for analyzing DNA copy number data. Methods for detecting recurrent CNAs share common features including computation of summary statistics in genomic regions, use of resampling in order to create "null" distributions, and adjustments for multiple comparisons. Some of these methods require specific preprocessing steps, and we describe these as well. Section 13.5 is devoted to DiNAMIC. This method uses a novel permutation scheme called cyclic shift to compute its null distribution, and we describe comparisons to other permutation schemes. Although some of the issues may seem technical, the simulation results of Walter et al. [1] suggest that DiNAMIC's cyclic shift procedure is attractive in comparison to other permutation schemes, and leads to proper control of error rates under a variety of realistic marker correlation structures. We conclude by introducing a confidence interval procedure for recurrent CNAs [2]. Publicly available tumor datasets were analyzed with DiNAMIC and the confidence interval procedure, and the results briefly surveyed here and in [2] have underlying biological support.

AB - Genetic mutations and alterations are the hallmark of cancer. When these alterations change the expression or protein product of a gene, increased invasiveness into surrounding tissue can result from unchecked cell cycle progression and improper regulation of cell death. In turn, these can contribute to tumor genesis, development, and expansion. A variety of somatic mutations can occur in tumor tissue, including point mutations, changes in methylation status, and gains and losses of chromosomal regions. Here we will focus primarily on mutations of the last type, which are termed DNA copy number aberrations (CNAs). Many CNAs can arise due to general genomic instability, and occur sporadically in locations throughout the genome. A smaller subset of CNAs appears to be recurrent, occurring repeatedly in the same region across multiple individuals. Recurrent CNAs are thought to be due to regional chromosome structure, or to a selection effect in which gain or loss of important regions leads to increased tumor growth rate. The identification of true recurrent CNAs is important, because these regions may play a role in the initiation and progression of tumors, perhaps even highlighting individual genes for further study or targeted treatment. The detection of recurrent CNAs is largely a statistical problem, and a number of methods have been proposed to address this problem. In this chapter, we survey several methods for analyzing DNA copy number data in tumors, with the DiNAMIC approach [1] described in some detail. The nomenclature in the literature on DNA copy number mutations has sometimes been inconsistent, so we begin by providing relevant definitions. Next we discuss the biological changes that can lead to alterations in tumor DNA copy number, as well as how tumors can result from these changes. The analysis of DNA copy number relies crucially on genomic technologies, and we survey platforms for assaying copy number, noting some of the challenges associated with these data. In Sections 13.3 and 13.4, we survey some of the available methods for analyzing DNA copy number data. Methods for detecting recurrent CNAs share common features including computation of summary statistics in genomic regions, use of resampling in order to create "null" distributions, and adjustments for multiple comparisons. Some of these methods require specific preprocessing steps, and we describe these as well. Section 13.5 is devoted to DiNAMIC. This method uses a novel permutation scheme called cyclic shift to compute its null distribution, and we describe comparisons to other permutation schemes. Although some of the issues may seem technical, the simulation results of Walter et al. [1] suggest that DiNAMIC's cyclic shift procedure is attractive in comparison to other permutation schemes, and leads to proper control of error rates under a variety of realistic marker correlation structures. We conclude by introducing a confidence interval procedure for recurrent CNAs [2]. Publicly available tumor datasets were analyzed with DiNAMIC and the confidence interval procedure, and the results briefly surveyed here and in [2] have underlying biological support.

UR - http://www.scopus.com/inward/record.url?scp=84888665754&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84888665754&partnerID=8YFLogxK

U2 - 10.1002/9783527665471.ch13

DO - 10.1002/9783527665471.ch13

M3 - Chapter

AN - SCOPUS:84888665754

SN - 9783527332625

VL - 3

SP - 239

EP - 260

BT - Statistical Diagnostics for Cancer

PB - Wiley-VCH

ER -

Walter V, Nobel AB, Hayes DN, Wright FA. Identification of Recurrent DNA Copy Number Aberrations in Tumors. In Statistical Diagnostics for Cancer: Analyzing High-Dimensional Data. Vol. 3. Wiley-VCH. 2013. p. 239-260 https://doi.org/10.1002/9783527665471.ch13