TY - JOUR
T1 - Recombination and lineage-specific mutations linked to the emergence of SARS-CoV-2
AU - Patiño-Galindo, Juan Ángel
AU - Filip, Ioan
AU - Chowdhury, Ratul
AU - Maranas, Costas D.
AU - Sorger, Peter K.
AU - AlQuraishi, Mohammed
AU - Rabadan, Raul
N1 - Funding Information:
R.R. is a member of the SAB of AimedBio, consults for Arquimea Research and is founder of Genotwin. PKS is a member of the SAB or Board of Directors of Applied Biomath LLC, Glencoe Software Inc., and RareCyte Inc. and has equity in these companies; his is also on the SAB of NanoString Inc. In the last 5 years, the Sorger lab has received research funding from Novartis and Merck. Sorger declares that none of these relationships are directly or indirectly related to the content of this manuscript. The remaining authors declare that they have no competing interests.
Funding Information:
This work has been funded by NIH grants R01 GM117591 and U54-CA225088 and by DARPA/DOD grant W911NF-14-1-0397.
Funding Information:
We thank GISAID and all the laboratories where the data used in this study was collected and processed (Additional File 1 : Table S4). We would also like to thank Karen Gomez and Andrew Chen for their help editing the manuscript and Zixuan Wang for her help on the figures. The icons for the host species in our figures were created with BioRender.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Background: The emergence of SARS-CoV-2 underscores the need to better understand the evolutionary processes that drive the emergence and adaptation of zoonotic viruses in humans. In the betacoronavirus genus, which also includes SARS-CoV and MERS-CoV, recombination frequently encompasses the receptor binding domain (RBD) of the Spike protein, which is responsible for viral binding to host cell receptors. In this work, we reconstruct the evolutionary events that have accompanied the emergence of SARS-CoV-2, with a special emphasis on the RBD and its adaptation for binding to its receptor, human ACE2. Methods: By means of phylogenetic and recombination analyses, we found evidence of a recombination event in the RBD involving ancestral linages to both SARS-CoV and SARS-CoV-2. We then assessed the effect of this recombination at protein level by reconstructing the RBD of the closest ancestors to SARS-CoV-2, SARS-CoV, and other Sarbecoviruses, including the most recent common ancestor of the recombining clade. The resulting information was used to measure and compare, in silico, their ACE2-binding affinities using the physics-based trRosetta algorithm. Results: We show that, through an ancestral recombination event, SARS-CoV and SARS-CoV-2 share an RBD sequence that includes two insertions (positions 432-436 and 460-472), as well as the variants 427N and 436Y. Both 427N and 436Y belong to a helix that interacts directly with the human ACE2 (hACE2) receptor. Reconstruction of ancestral states, combined with protein-binding affinity analyses, suggests that the recombination event involving ancestral strains of SARS-CoV and SARS-CoV-2 led to an increased affinity for hACE2 binding and that alleles 427N and 436Y significantly enhanced affinity as well. Conclusions: We report an ancestral recombination event affecting the RBD of both SARS-CoV and SARS-CoV-2 that was associated with an increased binding affinity to hACE2. Structural modeling indicates that ancestors of SARS-CoV-2 may have acquired the ability to infect humans decades ago. The binding affinity with the human receptor would have been subsequently boosted in SARS-CoV and SARS-CoV-2 through further mutations in RBD.
AB - Background: The emergence of SARS-CoV-2 underscores the need to better understand the evolutionary processes that drive the emergence and adaptation of zoonotic viruses in humans. In the betacoronavirus genus, which also includes SARS-CoV and MERS-CoV, recombination frequently encompasses the receptor binding domain (RBD) of the Spike protein, which is responsible for viral binding to host cell receptors. In this work, we reconstruct the evolutionary events that have accompanied the emergence of SARS-CoV-2, with a special emphasis on the RBD and its adaptation for binding to its receptor, human ACE2. Methods: By means of phylogenetic and recombination analyses, we found evidence of a recombination event in the RBD involving ancestral linages to both SARS-CoV and SARS-CoV-2. We then assessed the effect of this recombination at protein level by reconstructing the RBD of the closest ancestors to SARS-CoV-2, SARS-CoV, and other Sarbecoviruses, including the most recent common ancestor of the recombining clade. The resulting information was used to measure and compare, in silico, their ACE2-binding affinities using the physics-based trRosetta algorithm. Results: We show that, through an ancestral recombination event, SARS-CoV and SARS-CoV-2 share an RBD sequence that includes two insertions (positions 432-436 and 460-472), as well as the variants 427N and 436Y. Both 427N and 436Y belong to a helix that interacts directly with the human ACE2 (hACE2) receptor. Reconstruction of ancestral states, combined with protein-binding affinity analyses, suggests that the recombination event involving ancestral strains of SARS-CoV and SARS-CoV-2 led to an increased affinity for hACE2 binding and that alleles 427N and 436Y significantly enhanced affinity as well. Conclusions: We report an ancestral recombination event affecting the RBD of both SARS-CoV and SARS-CoV-2 that was associated with an increased binding affinity to hACE2. Structural modeling indicates that ancestors of SARS-CoV-2 may have acquired the ability to infect humans decades ago. The binding affinity with the human receptor would have been subsequently boosted in SARS-CoV and SARS-CoV-2 through further mutations in RBD.
UR - http://www.scopus.com/inward/record.url?scp=85112610808&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112610808&partnerID=8YFLogxK
U2 - 10.1186/s13073-021-00943-6
DO - 10.1186/s13073-021-00943-6
M3 - Article
C2 - 34362430
AN - SCOPUS:85112610808
SN - 1756-994X
VL - 13
JO - Genome Medicine
JF - Genome Medicine
IS - 1
M1 - 124
ER -