Mutational landscape of SARS-CoV-2 genome in Turkey and impact of mutations on spike protein structure.

Creative Commons License

Hatirnaz N., Akyoney S., Sahin I., Soykam H., Bayram A., Ozdemir Ö., ...More

PloS one, vol.16, no.12, 2021 (SCI-Expanded) identifier identifier identifier

  • Publication Type: Article / Article
  • Volume: 16 Issue: 12
  • Publication Date: 2021
  • Doi Number: 10.1371/journal.pone.0260438
  • Journal Name: PloS one
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Agricultural & Environmental Science Database, Animal Behavior Abstracts, Aquatic Science & Fisheries Abstracts (ASFA), BIOSIS, Biotechnology Research Abstracts, Chemical Abstracts Core, EMBASE, Food Science & Technology Abstracts, Index Islamicus, Linguistic Bibliography, MEDLINE, Pollution Abstracts, Psycinfo, zbMATH, Directory of Open Access Journals
  • Acibadem Mehmet Ali Aydinlar University Affiliated: Yes


The Coronavirus Disease 2019 (COVID-19) was declared a pandemic in March 2020 by the World Health Organization (WHO). As of May 25th, 2021 there were 2.059.941 SARS-COV2 genome sequences that have been submitted to the GISAID database, with numerous variations. Here, we aim to analyze the SARS-CoV-2 genome data submitted to the GISAID database from Turkey and to determine the variant and clade distributions by the end of May 2021, in accordance with their appearance timeline. We compared these findings to USA, Europe, and Asia data as well. We have also evaluated the effects of spike protein variations, detected in a group of genome sequences of 13 patients who applied to our clinic, by using 3D modeling algorithms. For this purpose, we analyzed 4607 SARS-CoV-2 genome sequences submitted by different lab centers from Turkey to the GISAID database between March 2020 and May 2021. Described mutations were also introduced in silico to the spike protein structure to analyze their isolated impacts on the protein structure. The most abundant clade was GR followed by G, GH, and GRY and we did not detect any V clade. The most common variant was B.1, followed by B.1.1, and the UK variant, B.1.1.7. Our results clearly show a concordance between the variant distributions, the number of cases, and the timelines of different variant accumulations in Turkey. The 3D simulations indicate an increase in the surface hydrophilicity of the reference spike protein and the detected mutations. There was less surface hydrophilicity increase in the Asp614Gly mutation, which exhibits a more compact conformation around the ACE-2 receptor binding domain region, rendering the structure in a "down" conformation. Our genomic findings can help to model vaccination programs and protein modeling may lead to different approaches for COVID-19 treatment strategies.