Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations

Brenna R. Forester, Jesse R. Lasky, Helene H. Wagner, Dean L. Urban

Research output: Contribution to journalArticle

26 Citations (Scopus)

Abstract

Identifying adaptive loci can provide insight into the mechanisms underlying local adaptation. Genotype–environment association (GEA) methods, which identify these loci based on correlations between genetic and environmental data, are particularly promising. Univariate methods have dominated GEA, despite the high dimensional nature of genotype and environment. Multivariate methods, which analyse many loci simultaneously, may be better suited to these data as they consider how sets of markers covary in response to environment. These methods may also be more effective at detecting adaptive processes that result in weak, multilocus signatures. Here, we evaluate four multivariate methods and five univariate and differentiation-based approaches, using published simulations of multilocus selection. We found that Random Forest performed poorly for GEA. Univariate GEAs performed better, but had low detection rates for loci under weak selection. Constrained ordinations, particularly redundancy analysis (RDA), showed a superior combination of low false-positive and high true-positive rates across all levels of selection. These results were robust across the demographic histories, sampling designs, sample sizes and weak population structure tested here. The value of combining detections from different methods was variable and depended on the study goals and knowledge of the drivers of selection. Re-analysis of genomic data from grey wolves highlighted the unique, covarying sets of adaptive loci that could be identified using RDA. Although additional testing is needed, this study indicates that RDA is an effective means of detecting adaptation, including signatures of weak, multilocus selection, providing a powerful tool for investigating the genetic basis of local adaptation.

Original languageEnglish (US)
Pages (from-to)2215-2233
Number of pages19
JournalMolecular ecology
Volume27
Issue number9
DOIs
StatePublished - May 1 2018

Fingerprint

loci
local adaptation
methodology
demographic history
Canis lupus
ordination
Sample Size
population structure
method
genomics
genotype
demographic statistics
Genotype
Demography
sampling
history
analysis
Population
simulation
testing

All Science Journal Classification (ASJC) codes

  • Ecology, Evolution, Behavior and Systematics
  • Genetics

Cite this

Forester, Brenna R. ; Lasky, Jesse R. ; Wagner, Helene H. ; Urban, Dean L. / Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations. In: Molecular ecology. 2018 ; Vol. 27, No. 9. pp. 2215-2233.
@article{17d52f9908704558a9072112ac12a6ea,
title = "Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations",
abstract = "Identifying adaptive loci can provide insight into the mechanisms underlying local adaptation. Genotype–environment association (GEA) methods, which identify these loci based on correlations between genetic and environmental data, are particularly promising. Univariate methods have dominated GEA, despite the high dimensional nature of genotype and environment. Multivariate methods, which analyse many loci simultaneously, may be better suited to these data as they consider how sets of markers covary in response to environment. These methods may also be more effective at detecting adaptive processes that result in weak, multilocus signatures. Here, we evaluate four multivariate methods and five univariate and differentiation-based approaches, using published simulations of multilocus selection. We found that Random Forest performed poorly for GEA. Univariate GEAs performed better, but had low detection rates for loci under weak selection. Constrained ordinations, particularly redundancy analysis (RDA), showed a superior combination of low false-positive and high true-positive rates across all levels of selection. These results were robust across the demographic histories, sampling designs, sample sizes and weak population structure tested here. The value of combining detections from different methods was variable and depended on the study goals and knowledge of the drivers of selection. Re-analysis of genomic data from grey wolves highlighted the unique, covarying sets of adaptive loci that could be identified using RDA. Although additional testing is needed, this study indicates that RDA is an effective means of detecting adaptation, including signatures of weak, multilocus selection, providing a powerful tool for investigating the genetic basis of local adaptation.",
author = "Forester, {Brenna R.} and Lasky, {Jesse R.} and Wagner, {Helene H.} and Urban, {Dean L.}",
year = "2018",
month = "5",
day = "1",
doi = "10.1111/mec.14584",
language = "English (US)",
volume = "27",
pages = "2215--2233",
journal = "Molecular Ecology",
issn = "0962-1083",
publisher = "Wiley-Blackwell",
number = "9",

}

Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations. / Forester, Brenna R.; Lasky, Jesse R.; Wagner, Helene H.; Urban, Dean L.

In: Molecular ecology, Vol. 27, No. 9, 01.05.2018, p. 2215-2233.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations

AU - Forester, Brenna R.

AU - Lasky, Jesse R.

AU - Wagner, Helene H.

AU - Urban, Dean L.

PY - 2018/5/1

Y1 - 2018/5/1

N2 - Identifying adaptive loci can provide insight into the mechanisms underlying local adaptation. Genotype–environment association (GEA) methods, which identify these loci based on correlations between genetic and environmental data, are particularly promising. Univariate methods have dominated GEA, despite the high dimensional nature of genotype and environment. Multivariate methods, which analyse many loci simultaneously, may be better suited to these data as they consider how sets of markers covary in response to environment. These methods may also be more effective at detecting adaptive processes that result in weak, multilocus signatures. Here, we evaluate four multivariate methods and five univariate and differentiation-based approaches, using published simulations of multilocus selection. We found that Random Forest performed poorly for GEA. Univariate GEAs performed better, but had low detection rates for loci under weak selection. Constrained ordinations, particularly redundancy analysis (RDA), showed a superior combination of low false-positive and high true-positive rates across all levels of selection. These results were robust across the demographic histories, sampling designs, sample sizes and weak population structure tested here. The value of combining detections from different methods was variable and depended on the study goals and knowledge of the drivers of selection. Re-analysis of genomic data from grey wolves highlighted the unique, covarying sets of adaptive loci that could be identified using RDA. Although additional testing is needed, this study indicates that RDA is an effective means of detecting adaptation, including signatures of weak, multilocus selection, providing a powerful tool for investigating the genetic basis of local adaptation.

AB - Identifying adaptive loci can provide insight into the mechanisms underlying local adaptation. Genotype–environment association (GEA) methods, which identify these loci based on correlations between genetic and environmental data, are particularly promising. Univariate methods have dominated GEA, despite the high dimensional nature of genotype and environment. Multivariate methods, which analyse many loci simultaneously, may be better suited to these data as they consider how sets of markers covary in response to environment. These methods may also be more effective at detecting adaptive processes that result in weak, multilocus signatures. Here, we evaluate four multivariate methods and five univariate and differentiation-based approaches, using published simulations of multilocus selection. We found that Random Forest performed poorly for GEA. Univariate GEAs performed better, but had low detection rates for loci under weak selection. Constrained ordinations, particularly redundancy analysis (RDA), showed a superior combination of low false-positive and high true-positive rates across all levels of selection. These results were robust across the demographic histories, sampling designs, sample sizes and weak population structure tested here. The value of combining detections from different methods was variable and depended on the study goals and knowledge of the drivers of selection. Re-analysis of genomic data from grey wolves highlighted the unique, covarying sets of adaptive loci that could be identified using RDA. Although additional testing is needed, this study indicates that RDA is an effective means of detecting adaptation, including signatures of weak, multilocus selection, providing a powerful tool for investigating the genetic basis of local adaptation.

UR - http://www.scopus.com/inward/record.url?scp=85045850789&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045850789&partnerID=8YFLogxK

U2 - 10.1111/mec.14584

DO - 10.1111/mec.14584

M3 - Article

C2 - 29633402

AN - SCOPUS:85045850789

VL - 27

SP - 2215

EP - 2233

JO - Molecular Ecology

JF - Molecular Ecology

SN - 0962-1083

IS - 9

ER -