Pitfalls of exome sequencing

A case study of the attribution of HABP2 rs7080536 in familial non-medullary thyroid cancer

Glenn S. Gerhard, Darrin V. Bann, James Broach, David Goldenberg

Research output: Contribution to journalReview article

4 Citations (Scopus)

Abstract

Next-generation sequencing using exome capture is a common approach used for analysis of familial cancer syndromes. Despite the development of robust computational algorithms, the accrued experience of analyzing exome data sets and published guidelines, the analytical process remains an ad hoc series of important decisions and interpretations that require significant oversight. Processes and tools used for sequence data generation have matured and are standardized to a significant degree. For the remainder of the analytical pipeline, however, the results can be highly dependent on the choices made and careful review of results. We used primary exome sequence data, generously provided by the corresponding author, from a family with highly penetrant familial non-medullary thyroid cancer reported to be caused by HABP2 rs7080536 to review the importance of several key steps in the application of exome sequencing for discovery of new familial cancer genes. Differences in allele frequencies across populations, probabilities of familial segregation, functional impact predictions, corroborating biological support, and inconsistent replication studies can play major roles in influencing interpretation of results. In the case of HABP2 rs7080536 and familial non-medullary thyroid cancer, these factors led to the conclusion of an association that most data and our re-analysis fail to support, although larger studies from diverse populations will be needed to definitively determine its role.

Original languageEnglish (US)
Article number8
Journalnpj Genomic Medicine
Volume2
Issue number1
DOIs
StatePublished - Dec 1 2017

Fingerprint

Exome
Neoplasm Genes
Gene Frequency
Population
Guidelines
Familial medullary thyroid carcinoma
Neoplasms

All Science Journal Classification (ASJC) codes

  • Genetics
  • Molecular Biology
  • Genetics(clinical)

Cite this

@article{d2c0a87f4388431a953f75a85cd399d9,
title = "Pitfalls of exome sequencing: A case study of the attribution of HABP2 rs7080536 in familial non-medullary thyroid cancer",
abstract = "Next-generation sequencing using exome capture is a common approach used for analysis of familial cancer syndromes. Despite the development of robust computational algorithms, the accrued experience of analyzing exome data sets and published guidelines, the analytical process remains an ad hoc series of important decisions and interpretations that require significant oversight. Processes and tools used for sequence data generation have matured and are standardized to a significant degree. For the remainder of the analytical pipeline, however, the results can be highly dependent on the choices made and careful review of results. We used primary exome sequence data, generously provided by the corresponding author, from a family with highly penetrant familial non-medullary thyroid cancer reported to be caused by HABP2 rs7080536 to review the importance of several key steps in the application of exome sequencing for discovery of new familial cancer genes. Differences in allele frequencies across populations, probabilities of familial segregation, functional impact predictions, corroborating biological support, and inconsistent replication studies can play major roles in influencing interpretation of results. In the case of HABP2 rs7080536 and familial non-medullary thyroid cancer, these factors led to the conclusion of an association that most data and our re-analysis fail to support, although larger studies from diverse populations will be needed to definitively determine its role.",
author = "Gerhard, {Glenn S.} and Bann, {Darrin V.} and James Broach and David Goldenberg",
year = "2017",
month = "12",
day = "1",
doi = "10.1038/s41525-017-0011-x",
language = "English (US)",
volume = "2",
journal = "npj Genomic Medicine",
issn = "2056-7944",
publisher = "Nature Publishing Group",
number = "1",

}

Pitfalls of exome sequencing : A case study of the attribution of HABP2 rs7080536 in familial non-medullary thyroid cancer. / Gerhard, Glenn S.; Bann, Darrin V.; Broach, James; Goldenberg, David.

In: npj Genomic Medicine, Vol. 2, No. 1, 8, 01.12.2017.

Research output: Contribution to journalReview article

TY - JOUR

T1 - Pitfalls of exome sequencing

T2 - A case study of the attribution of HABP2 rs7080536 in familial non-medullary thyroid cancer

AU - Gerhard, Glenn S.

AU - Bann, Darrin V.

AU - Broach, James

AU - Goldenberg, David

PY - 2017/12/1

Y1 - 2017/12/1

N2 - Next-generation sequencing using exome capture is a common approach used for analysis of familial cancer syndromes. Despite the development of robust computational algorithms, the accrued experience of analyzing exome data sets and published guidelines, the analytical process remains an ad hoc series of important decisions and interpretations that require significant oversight. Processes and tools used for sequence data generation have matured and are standardized to a significant degree. For the remainder of the analytical pipeline, however, the results can be highly dependent on the choices made and careful review of results. We used primary exome sequence data, generously provided by the corresponding author, from a family with highly penetrant familial non-medullary thyroid cancer reported to be caused by HABP2 rs7080536 to review the importance of several key steps in the application of exome sequencing for discovery of new familial cancer genes. Differences in allele frequencies across populations, probabilities of familial segregation, functional impact predictions, corroborating biological support, and inconsistent replication studies can play major roles in influencing interpretation of results. In the case of HABP2 rs7080536 and familial non-medullary thyroid cancer, these factors led to the conclusion of an association that most data and our re-analysis fail to support, although larger studies from diverse populations will be needed to definitively determine its role.

AB - Next-generation sequencing using exome capture is a common approach used for analysis of familial cancer syndromes. Despite the development of robust computational algorithms, the accrued experience of analyzing exome data sets and published guidelines, the analytical process remains an ad hoc series of important decisions and interpretations that require significant oversight. Processes and tools used for sequence data generation have matured and are standardized to a significant degree. For the remainder of the analytical pipeline, however, the results can be highly dependent on the choices made and careful review of results. We used primary exome sequence data, generously provided by the corresponding author, from a family with highly penetrant familial non-medullary thyroid cancer reported to be caused by HABP2 rs7080536 to review the importance of several key steps in the application of exome sequencing for discovery of new familial cancer genes. Differences in allele frequencies across populations, probabilities of familial segregation, functional impact predictions, corroborating biological support, and inconsistent replication studies can play major roles in influencing interpretation of results. In the case of HABP2 rs7080536 and familial non-medullary thyroid cancer, these factors led to the conclusion of an association that most data and our re-analysis fail to support, although larger studies from diverse populations will be needed to definitively determine its role.

UR - http://www.scopus.com/inward/record.url?scp=85042295440&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042295440&partnerID=8YFLogxK

U2 - 10.1038/s41525-017-0011-x

DO - 10.1038/s41525-017-0011-x

M3 - Review article

VL - 2

JO - npj Genomic Medicine

JF - npj Genomic Medicine

SN - 2056-7944

IS - 1

M1 - 8

ER -