The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction

Alison A. Motsinger, Marylyn Deriggi Ritchie

Research output: Contribution to journalArticle

37 Citations (Scopus)

Abstract

Multifactor Dimensionality Reduction (MDR) was developed to detect genetic polymorphisms that present an increased risk of disease. Cross-validation (CV) is an important part of the MDR algorithm, as it prevents over-fitting and allows the predictive ability of a model to be evaluated. CV is a computationally intensive step in the MDR algorithm. Traditionally, MDR has been implemented using 10-fold CV In order to reduce computation time and therefore allow MDR analysis to be applied to larger datasets, we evaluated the possibility of eliminating or reducing the number of CV intervals used for analysis. We found that eliminating CV made final model selection impossible, but that reducing the number of CV intervals from ten to five caused no loss of power, thereby reducing the computation time of the algorithm by half. The validity of this reduction was confirmed with data from an Alzheimer's disease (AD) study.

Original languageEnglish (US)
Pages (from-to)546-555
Number of pages10
JournalGenetic Epidemiology
Volume30
Issue number6
DOIs
StatePublished - Sep 1 2006

Fingerprint

Multifactor Dimensionality Reduction
Aptitude
Genetic Polymorphisms
Alzheimer Disease

All Science Journal Classification (ASJC) codes

  • Epidemiology
  • Genetics(clinical)

Cite this

Motsinger, Alison A. ; Ritchie, Marylyn Deriggi. / The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction. In: Genetic Epidemiology. 2006 ; Vol. 30, No. 6. pp. 546-555.
@article{f01d6790f920489e9fec569e8f5a4fa3,
title = "The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction",
abstract = "Multifactor Dimensionality Reduction (MDR) was developed to detect genetic polymorphisms that present an increased risk of disease. Cross-validation (CV) is an important part of the MDR algorithm, as it prevents over-fitting and allows the predictive ability of a model to be evaluated. CV is a computationally intensive step in the MDR algorithm. Traditionally, MDR has been implemented using 10-fold CV In order to reduce computation time and therefore allow MDR analysis to be applied to larger datasets, we evaluated the possibility of eliminating or reducing the number of CV intervals used for analysis. We found that eliminating CV made final model selection impossible, but that reducing the number of CV intervals from ten to five caused no loss of power, thereby reducing the computation time of the algorithm by half. The validity of this reduction was confirmed with data from an Alzheimer's disease (AD) study.",
author = "Motsinger, {Alison A.} and Ritchie, {Marylyn Deriggi}",
year = "2006",
month = "9",
day = "1",
doi = "10.1002/gepi.20166",
language = "English (US)",
volume = "30",
pages = "546--555",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "6",

}

The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction. / Motsinger, Alison A.; Ritchie, Marylyn Deriggi.

In: Genetic Epidemiology, Vol. 30, No. 6, 01.09.2006, p. 546-555.

Research output: Contribution to journalArticle

TY - JOUR

T1 - The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction

AU - Motsinger, Alison A.

AU - Ritchie, Marylyn Deriggi

PY - 2006/9/1

Y1 - 2006/9/1

N2 - Multifactor Dimensionality Reduction (MDR) was developed to detect genetic polymorphisms that present an increased risk of disease. Cross-validation (CV) is an important part of the MDR algorithm, as it prevents over-fitting and allows the predictive ability of a model to be evaluated. CV is a computationally intensive step in the MDR algorithm. Traditionally, MDR has been implemented using 10-fold CV In order to reduce computation time and therefore allow MDR analysis to be applied to larger datasets, we evaluated the possibility of eliminating or reducing the number of CV intervals used for analysis. We found that eliminating CV made final model selection impossible, but that reducing the number of CV intervals from ten to five caused no loss of power, thereby reducing the computation time of the algorithm by half. The validity of this reduction was confirmed with data from an Alzheimer's disease (AD) study.

AB - Multifactor Dimensionality Reduction (MDR) was developed to detect genetic polymorphisms that present an increased risk of disease. Cross-validation (CV) is an important part of the MDR algorithm, as it prevents over-fitting and allows the predictive ability of a model to be evaluated. CV is a computationally intensive step in the MDR algorithm. Traditionally, MDR has been implemented using 10-fold CV In order to reduce computation time and therefore allow MDR analysis to be applied to larger datasets, we evaluated the possibility of eliminating or reducing the number of CV intervals used for analysis. We found that eliminating CV made final model selection impossible, but that reducing the number of CV intervals from ten to five caused no loss of power, thereby reducing the computation time of the algorithm by half. The validity of this reduction was confirmed with data from an Alzheimer's disease (AD) study.

UR - http://www.scopus.com/inward/record.url?scp=33747613579&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33747613579&partnerID=8YFLogxK

U2 - 10.1002/gepi.20166

DO - 10.1002/gepi.20166

M3 - Article

VL - 30

SP - 546

EP - 555

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 6

ER -