Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 1;21(1):775.
doi: 10.1186/s12967-023-04588-4.

Genetic risk factors for severe and fatigue dominant long COVID and commonalities with ME/CFS identified by combinatorial analysis

Affiliations

Genetic risk factors for severe and fatigue dominant long COVID and commonalities with ME/CFS identified by combinatorial analysis

Krystyna Taylor et al. J Transl Med. .

Abstract

Background: Long COVID is a debilitating chronic condition that has affected over 100 million people globally. It is characterized by a diverse array of symptoms, including fatigue, cognitive dysfunction and respiratory problems. Studies have so far largely failed to identify genetic associations, the mechanisms behind the disease, or any common pathophysiology with other conditions such as myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) that present with similar symptoms.

Methods: We used a combinatorial analysis approach to identify combinations of genetic variants significantly associated with the development of long COVID and to examine the biological mechanisms underpinning its various symptoms. We compared two subpopulations of long COVID patients from Sano Genetics' Long COVID GOLD study cohort, focusing on patients with severe or fatigue dominant phenotypes. We evaluated the genetic signatures previously identified in an ME/CFS population against this long COVID population to understand similarities with other fatigue disorders that may be triggered by a prior viral infection. Finally, we also compared the output of this long COVID analysis against known genetic associations in other chronic diseases, including a range of metabolic and neurological disorders, to understand the overlap of pathophysiological mechanisms.

Results: Combinatorial analysis identified 73 genes that were highly associated with at least one of the long COVID populations included in this analysis. Of these, 9 genes have prior associations with acute COVID-19, and 14 were differentially expressed in a transcriptomic analysis of long COVID patients. A pathway enrichment analysis revealed that the biological pathways most significantly associated with the 73 long COVID genes were mainly aligned with neurological and cardiometabolic diseases. Expanded genotype analysis suggests that specific SNX9 genotypes are a significant contributor to the risk of or protection against severe long COVID infection, but that the gene-disease relationship is context dependent and mediated by interactions with KLF15 and RYR3. Comparison of the genes uniquely associated with the Severe and Fatigue Dominant long COVID patients revealed significant differences between the pathways enriched in each subgroup. The genes unique to Severe long COVID patients were associated with immune pathways such as myeloid differentiation and macrophage foam cells. Genes unique to the Fatigue Dominant subgroup were enriched in metabolic pathways such as MAPK/JNK signaling. We also identified overlap in the genes associated with Fatigue Dominant long COVID and ME/CFS, including several involved in circadian rhythm regulation and insulin regulation. Overall, 39 SNPs associated in this study with long COVID can be linked to 9 genes identified in a recent combinatorial analysis of ME/CFS patient from UK Biobank. Among the 73 genes associated with long COVID, 42 are potentially tractable for novel drug discovery approaches, with 13 of these already targeted by drugs in clinical development pipelines. From this analysis for example, we identified TLR4 antagonists as repurposing candidates with potential to protect against long term cognitive impairment pathology caused by SARS-CoV-2. We are currently evaluating the repurposing potential of these drug targets for use in treating long COVID and/or ME/CFS.

Conclusion: This study demonstrates the power of combinatorial analytics for stratifying heterogeneous populations in complex diseases that do not have simple monogenic etiologies. These results build upon the genetic findings from combinatorial analyses of severe acute COVID-19 patients and an ME/CFS population and we expect that access to additional independent, larger patient datasets will further improve the disease insights and validate potential treatment options in long COVID.

Keywords: Combinatorial analytics; Long COVID; ME/CFS; PASC; POTS; Patient stratification; Post-Covid; Post-acute COVID syndrome; Post-acute sequelae of COVID-19.

PubMed Disclaimer

Conflict of interest statement

K.T., M.P., S.D., K.C., J.S, and S.G. are employees of PrecisionLife, Ltd. S.G. is a shareholder of PrecisionLife, Ltd.

Figures

Fig. 1
Fig. 1
Monthly distribution of the first self-reported COVID-19 diagnosis for the 1829 individuals included in Sano Genetics’ Long COVID GOLD study (2020–2022)
Fig. 2
Fig. 2
Distribution of the (a) ‘Total Change’ score for cases and controls in the Severe long COVID and (b) ‘Fatigue Change’ (part of ‘Total Change’ score) score in the Fatigue Dominant long COVID cohorts
Fig. 3
Fig. 3
Conceptual representation of features, combinations and disease signatures that form part of PrecisionLife’s combinatorial analytics methodology. In the case of the long COVID study all features were SNP genotypes, but other feature types, e.g., a patient’s expression level of a specific protein, medication history or clinical features such as their eosinophil level, can also be used, independently from or in combination with the genotype data
Fig. 4
Fig. 4
Hypothetical example of an expanded genotypes analysis for a disease signature comprised of two SNPs. After controlling for the confounding effects of the interacting SNP genotype, patients with one or two copies of the critical SNP minor allele (genotypes “1” and “2”) have consistently elevated odds of disease relative to patients with the wild type genotype (“0”) at the critical SNP
Fig. 5
Fig. 5
Variation of the long COVID symptom-based ‘Total Change’ scores with COVID-19 diagnosis for 1829 individuals in the Sano Genetics’ Long COVID GOLD study (2020–2022)
Fig. 6
Fig. 6
Distribution of combinatorial order (i.e., number of component SNPs) for the validated combinatorial disease signatures from the Severe and Fatigue Dominant long COVID cohorts
Fig. 7
Fig. 7
Disease architecture diagrams representing (a) the Severe and (b) Fatigue Dominant long COVID patient populations generated by the PrecisionLife platform. Each circle represents a disease-associated SNP genotype, and edges represent their co-association in patients in disease signature(s). The critical SNP genotypes identified in each case population are highlighted in dark green
Fig. 8
Fig. 8
Pathway enrichment plot for disease-associated genes found in the Severe and Fatigue Dominant long COVID cohorts. GeneRatio represents the ratio of genes found in the pathway compared to the genes associated with a cohort and p.adjust represents the p-value adjusted for multiple testing. The dots in the plot are colour-coded based on their corresponding p.adjust values
Fig. 9
Fig. 9
Heatmap plot showing 19 biological processes (Gene Ontology biological process terms) shared between 73 long COVID genes identified in the GOLD cohort and genes with genetic evidence in one or more indication groups (neurodegenerative, mental or behavioral, cardiovascular, gastrointestinal, autoimmune and metabolic disorders). For each indication group, only the significantly enriched biological processes (p < 0.05) are shown in blue and the intensity of the color is based on the p values of the Gene Ontology term in each indication group

Similar articles

Cited by

References

    1. World Health Organization WHO Fact Sheets Post COVID-19 condition (Long COVID) available from https://www.who.int/europe/news-room/fact-sheets/item/post-covid-19-cond.... Accessed 8 Oct 2023
    1. O'Mahoney LL, Routen A, Gillies C, Ekezie W, Welford A, Zhang A, Karamchandani U, Simms-Williams N, Cassambai S, et al. The prevalence and long-term health effects of long Covid among hospitalised and non-hospitalised populations: a systematic review and meta-analysis. EClinicalMedicine. 2022;1(55):101762. doi: 10.1016/j.eclinm.2022.101762.Erratum.In:EClinicalMedicine.2023May;59:101959. - DOI - PMC - PubMed
    1. WHO Coronavirus (COVID-19) Dashboard https://covid19.who.int/, last Accessed 4 June 2023
    1. Mallick D, Goyal L, Chourasia P, Zapata MR, Yashi K, Surani S. COVID-19 induced postural orthostatic tachycardia syndrome (POTS): a review. Cureus. 2023;15(3):e36955. doi: 10.7759/cureus.36955. - DOI - PMC - PubMed
    1. Ballouz T, Menges D, Anagnostopoulos A, Domenghino A, Aschmann HE, Frei A, Fehr JS, Puhan MA. Recovery and symptom trajectories up to two years after SARS-CoV-2 infection: population based, longitudinal cohort study. BMJ. 2023;31(381):e074425. doi: 10.1136/bmj-2022-074425. - DOI - PMC - PubMed

Publication types