1
|
Fareed MM, Shityakov S. Quantifying pleiotropy through directed signaling networks: A synchronous Boolean network approach and in-silico pleiotropic scoring. Biosystems 2025; 250:105416. [PMID: 39988275 DOI: 10.1016/j.biosystems.2025.105416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 02/07/2025] [Accepted: 02/09/2025] [Indexed: 02/25/2025]
Abstract
Pleiotropy refers to a gene's ability to influence multiple phenotypes or traits. In the context of human genetic diseases, pleiotropy manifests as different pathological effects resulting from mutations in the same gene. This phenomenon plays a crucial role in understanding gene-gene interactions in system-level biological diseases. Previous studies have largely focused on pleiotropy within undirected molecular correlation networks, leaving a gap in examining pleiotropy induced by directed signaling networks, which can better explain dynamic gene-gene interactions. In this study, we utilized a synchronous Boolean network model to explore pleiotropic dynamics induced by various mutations in large-scale networks. We introduced an in-silico Pleiotropic Score (sPS) to quantify the impact of gene mutations and validated the model against observational pleiotropy data from the Human Phenotype Ontology (HPO). Our results indicate a significant correlation between sPS and network structural characteristics, including degree centrality and feedback loop involvement. The highest correlation was observed between closeness centrality and sPS (0.6), suggesting that genes more central in the network exhibit higher pleiotropic potential. Furthermore, genes involved in feedback loops demonstrated higher sPS values (p < 0.0001), supporting the role of feedback loops in amplifying pleiotropic behavior. Our model provides a novel approach for quantifying pleiotropy through directed network dynamics, complementing traditional observational methods.
Collapse
Affiliation(s)
- Muhammad Mazhar Fareed
- School of Science and Engineering, Department of Computer Science, Università degli studi di Verona, Verona, Italy.
| | - Sergey Shityakov
- Laboratory of Chemoinformatics, Infochemistry Scientific Center, ITMO University, Saint Petersburg, Russia.
| |
Collapse
|
2
|
Chitra U, Arnold B, Raphael BJ. Resolving discrepancies between chimeric and multiplicative measures of higher-order epistasis. Nat Commun 2025; 16:1711. [PMID: 39962081 PMCID: PMC11833126 DOI: 10.1038/s41467-025-56986-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 02/06/2025] [Indexed: 02/20/2025] Open
Abstract
Epistasis - the interaction between alleles at different genetic loci - plays a fundamental role in biology. However, several recent approaches quantify epistasis using a chimeric formula that measures deviations from a multiplicative fitness model on an additive scale, thus mixing two scales. Here, we show that for pairwise interactions, the chimeric formula yields a different magnitude but the same sign of epistasis compared to the multiplicative formula that measures both fitness and deviations on a multiplicative scale. However, for higher-order interactions, we show that the chimeric formula can have both different magnitude and sign compared to the multiplicative formula. We resolve these inconsistencies by deriving mathematical relationships between the different epistasis formulae and different parametrizations of the multivariate Bernoulli distribution. We argue that the chimeric formula does not appropriately model interactions between the Bernoulli random variables. In simulations, we show that the chimeric formula is less accurate than the classical multiplicative/additive epistasis formulae and may falsely detect higher-order epistasis. Analyzing multi-gene knockouts in yeast, multi-way drug interactions in E. coli, and deep mutational scanning of several proteins, we find that approximately 10% to 60% of inferred higher-order interactions change sign using the multiplicative/additive formula compared to the chimeric formula.
Collapse
Affiliation(s)
- Uthsav Chitra
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Brian Arnold
- Department of Computer Science, Princeton University, Princeton, NJ, USA
- Center for Statistics and Machine Learning, Princeton University, Princeton, NJ, USA
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
3
|
Li H, Zeng J, Snyder MP, Zhang S. Modeling gene interactions in polygenic prediction via geometric deep learning. Genome Res 2025; 35:178-187. [PMID: 39562137 PMCID: PMC11789630 DOI: 10.1101/gr.279694.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 11/14/2024] [Indexed: 11/21/2024]
Abstract
Polygenic risk score (PRS) is a widely used approach for predicting individuals' genetic risk of complex diseases, playing a pivotal role in advancing precision medicine. Traditional PRS methods, predominantly following a linear structure, often fall short in capturing the intricate relationships between genotype and phenotype. In this study, we present PRS-Net, an interpretable geometric deep learning-based framework that effectively models the nonlinearity of biological systems for enhanced disease prediction and biological discovery. PRS-Net begins by deconvoluting the genome-wide PRS at the single-gene resolution and then explicitly encapsulates gene-gene interactions leveraging a graph neural network (GNN) for genetic risk prediction, enabling a systematic characterization of molecular interplay underpinning diseases. An attentive readout module is introduced to facilitate model interpretation. Extensive tests across multiple complex traits and diseases demonstrate the superior prediction performance of PRS-Net compared with a wide range of conventional PRS methods. The interpretability of PRS-Net further enhances the identification of disease-relevant genes and gene programs. PRS-Net provides a potent tool for concurrent genetic risk prediction and biological discovery for complex diseases.
Collapse
Affiliation(s)
- Han Li
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, 100084, China
| | - Jianyang Zeng
- School of Engineering, Research Center for Industries of the Future, Westlake University, Hangzhou, 310030, Zhejiang, China;
| | - Michael P Snyder
- Department of Genetics, Center for Genomics and Personalized Medicine, Stanford University School of Medicine, Stanford, California 94304, USA;
| | - Sai Zhang
- Department of Epidemiology, University of Florida, Gainesville, Florida 32603, USA;
- Departments of Biostatistics & Biomedical Engineering, UF Genetics Institute, University of Florida, Gainesville, Florida 32603, USA
| |
Collapse
|
4
|
Zhang M, Zhang M, Han R, Yu X, Song Z. The correlation between miR-21 single nucleotide polymorphisms and the susceptibility of non-small cell lung cancer. J Cardiothorac Surg 2025; 20:76. [PMID: 39833839 PMCID: PMC11748343 DOI: 10.1186/s13019-024-03322-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Accepted: 12/26/2024] [Indexed: 01/22/2025] Open
Abstract
BACKGROUND There are still gaps in the study of the miRNA and its SNPs in some diseases such as non-small cell lung cancer (NSCLC). The study aimed to provide useful information on the treatment of NSCLC by investigating the association between miR-21 and its SNPs and NSCLC susceptibility. METHODS The serum of NSCLC patients (n = 205) and cancer-free controls (n = 217) were collected in this study for RNA extraction. The qRT-PCR was used to evaluate the expression of miR-21 and Taqman qPCR was used for genotyping and quantifying miR-21 SNPs (rs1292037, rs6504593). The association of the expression of miR-21, the miR-21 SNPs and their interactions with the susceptibility of NSCLC patients were analysed using logistic regression analysis in this study. RESULTS This study showed that the overexpression of miR-21 was related to NSCLC. The C allele and CC genotypes of rs1292037 and rs6504593 were associated with the increased risk of NSCLC susceptibility. Moreover, the interactions of rs1292037 and rs6504593 were also a risk factor for NSCLC. The CC genotypes of rs1292037 and rs6504593 were associated with the increase of miR-21 expression. CONCLUSION The overexpression of miR-21, the miR-21 SNPs rs1292037 and rs6504593 and their interactions were associated with NSCLC susceptibility. MiR-21 and its SNPs have potential for being targets in the therapy of NSCLC. This study provided important information for the treatment of NSCLC.
Collapse
Affiliation(s)
- Miao Zhang
- Department of Respiratory, Jiaozhou Central Hospital of Qingdao, Qingdao, 266300, China
| | - Ming Zhang
- Department of Clinical Medicine, Anhui Medical College, Hefei, 230601, China
| | - Ruixue Han
- Department of Oncology, Yuhuan Second People's Hospital, Yuhuan, 317605, China
| | - Xin Yu
- Department of Respiratory Medicine, Traditional Chinese Medical Hospital of Zhuji, No. 521, East 2nd Road, Huandong Street, Zhuji, 311800, China.
| | - Zhaolu Song
- Department of Urology Surgery, Jiaozhou Central Hospital of Qingdao, No. 99, Yunxi Henan Road, Jiaozhou, Qingdao, 266300, China.
| |
Collapse
|
5
|
Chen L, Wang X, Xie N, Zhang Z, Xu X, Xue M, Yang Y, Liu L, Su L, Bjaanæs M, Karlsson A, Planck M, Staaf J, Helland Å, Esteller M, Christiani DC, Chen F, Zhang R. A two-phase epigenome-wide four-way gene-smoking interaction study of overall survival for early-stage non-small cell lung cancer. Mol Oncol 2025; 19:173-187. [PMID: 39630602 PMCID: PMC11705728 DOI: 10.1002/1878-0261.13766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2024] [Revised: 10/05/2024] [Accepted: 11/07/2024] [Indexed: 12/07/2024] Open
Abstract
High-order interactions associated with non-small cell lung cancer (NSCLC) survival may elucidate underlying molecular mechanisms and identify potential therapeutic targets. Our previous work has identified a three-way interaction among pack-year of smoking (the number of packs of cigarettes smoked per day multiplied by the number of years the person has smoked) and two DNA methylation probes (cg05293407TRIM27 and cg00060500KIAA0226). However, whether a four-way interaction exists remains unclear. Therefore, we adopted a two-phase design to identify the four-way gene-smoking interactions by a hill-climbing strategy on the basis of the previously detected three-way interaction. One CpG probe, cg16658473SHISA9, was identified with FDR-q ≤ 0.05 in the discovery phase and P ≤ 0.05 in the validation phase. Meanwhile, the four-way interaction improved the discrimination ability for the prognostic prediction model, as indicated by the area under the receiver operating characteristic curve (AUC) for both 3- and 5-year survival. In summary, we identified a four-way interaction associated with NSCLC survival among pack-year of smoking, cg05293407TRIM27, cg00060500KIAA0226 and g16658473SHISA9, providing novel insights into the complex mechanisms underlying NSCLC progression.
Collapse
Affiliation(s)
- Leyi Chen
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Xiang Wang
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Ning Xie
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Zhongwen Zhang
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Xiaowen Xu
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Maojie Xue
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
- Department of Health Inspection and Quarantine, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Yuqing Yang
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Liya Liu
- School of Public Health, Health Science CenterNingbo UniversityChina
| | - Li Su
- Department of Environmental HealthHarvard T.H. Chan School of Public HealthBostonMAUSA
- Pulmonary and Critical Care Division, Department of MedicineMassachusetts General Hospital and Harvard Medical SchoolBostonMAUSA
| | - Maria Bjaanæs
- Department of Cancer Genetics, Institute for Cancer ResearchOslo University HospitalNorway
| | - Anna Karlsson
- Division of Oncology, Department of Clinical Sciences Lund and CREATE Health Strategic Center for Translational Cancer ResearchLund UniversitySweden
| | - Maria Planck
- Division of Oncology, Department of Clinical Sciences Lund and CREATE Health Strategic Center for Translational Cancer ResearchLund UniversitySweden
| | - Johan Staaf
- Division of Oncology, Department of Clinical Sciences Lund and CREATE Health Strategic Center for Translational Cancer ResearchLund UniversitySweden
| | - Åslaug Helland
- Department of Cancer Genetics, Institute for Cancer ResearchOslo University HospitalNorway
- Institute of Clinical MedicineUniversity of OsloNorway
| | - Manel Esteller
- Josep Carreras Leukaemia Research InstituteBarcelonaSpain
- Centro de Investigacion Biomedica en Red CancerMadridSpain
- Institucio Catalana de Recerca i Estudis AvançatsBarcelonaSpain
- Physiological Sciences Department, School of Medicine and Health SciencesUniversity of BarcelonaSpain
| | - David C. Christiani
- Department of Environmental HealthHarvard T.H. Chan School of Public HealthBostonMAUSA
- Pulmonary and Critical Care Division, Department of MedicineMassachusetts General Hospital and Harvard Medical SchoolBostonMAUSA
| | - Feng Chen
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
| | - Ruyang Zhang
- Department of Biostatistics, Center for Global Health, School of Public HealthNanjing Medical UniversityChina
- China International Cooperation Center for Environment and Human HealthNanjing Medical UniversityChina
- Changzhou Medical CenterNanjing Medical UniversityChangzhouChina
- Information CenterThe Affiliated Changzhou Second People's Hospital of Nanjing Medical UniversityChangzhouChina
| |
Collapse
|
6
|
Malakhov MM, Pan W. Co-expression-wide association studies link genetically regulated interactions with complex traits. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.10.02.24314813. [PMID: 39711708 PMCID: PMC11661334 DOI: 10.1101/2024.10.02.24314813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Transcriptome- and proteome-wide association studies (TWAS/PWAS) have proven successful in prioritizing genes and proteins whose genetically regulated expression modulates disease risk, but they ignore potential co-expression and interaction effects. To address this limitation, we introduce the co-expression-wide association study (COWAS) method, which can identify pairs of genes or proteins whose genetically regulated co-expression is associated with complex traits. COWAS first trains models to predict expression and co-expression conditional on genetic variation, and then tests for association between imputed co-expression and the trait of interest while also accounting for direct effects from each exposure. We applied our method to plasma proteomic concentrations from the UK Biobank, identifying dozens of interacting protein pairs associated with cholesterol levels, Alzheimer's disease, and Parkinson's disease. Notably, our results demonstrate that co-expression between proteins may affect complex traits even if neither protein is detected to influence the trait when considered on its own. We also show how COWAS can help disentangle direct and interaction effects, providing a richer picture of the molecular networks that mediate genetic effects on disease outcomes.
Collapse
Affiliation(s)
- Mykhaylo M. Malakhov
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Wei Pan
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
7
|
Juárez-Cedillo T, Martínez-Rodríguez N, Juárez-Cedillo E, Ramirez A, Suerna-Hernández A. Intergenic Interactions of ESR1, GSTO1 and AGER and Risk of Dementia in Community-Dwelling Elderly (SADEM Study). Genes (Basel) 2024; 15:1395. [PMID: 39596595 PMCID: PMC11594218 DOI: 10.3390/genes15111395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 10/24/2024] [Accepted: 10/25/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND Dementia causes the loss of functional independence, resulting in a decrease in the quality of life of those who suffer from it. AIMS This study aimed to investigate the interactions influencing susceptibility to the development of dementia through multifactor dimensionality reduction (MDR). METHODS the study population was made up of 221 cases and 534 controls. We performed an MDR analysis as well as a bioinformatic analysis to identify interactions between the genes GSTO1_rs4925, AGER_rs2070600, and ESR1_rs3844508 associated with susceptibility to dementia. RESULTS We observed associations between the polymorphism of GSTO1 and risk of dementia for the site rs4925 with the recessive model (OR = 1.720, 95% CI = 1.166-2.537 p = 0.006). Similarly, the site AGER rs2070600 showed risk of dementia with an additive model of inheritance (OR = 7.278, 95% CI = 3.140-16.868; p < 0.001). Furthermore, we identified the best risk model with a high precision of 79.6% that, when combined with three environmental risk factors, did not give an OR = 26.662 95%CI (16.164-43.979) with p < 0.001. CONCLUSIONS The MDR and bioinformatics results provide new information on the molecular pathogenesis of dementia, allowing identification of possible diagnostic markers and new therapeutic targets.
Collapse
Affiliation(s)
- Teresa Juárez-Cedillo
- Unidad de Investigación Médica en Epidemiológica Clínica, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City 06720, CP, Mexico
| | - Nancy Martínez-Rodríguez
- Epidemiology, Endocrinology and Nutrition Research Unit, Hospital Infantil de México Federico Gómez, Ministry of Health (SSA), Mexico City 06720, CP, Mexico;
| | - Enrique Juárez-Cedillo
- Facultad de Medicina, Universidad Nacional Autónoma de Mexico, Mexico City 04360, CP, Mexico; (E.J.-C.); (A.S.-H.)
| | - Alfredo Ramirez
- Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, University of Cologne, 50923 Köln, Germany;
| | - Alan Suerna-Hernández
- Facultad de Medicina, Universidad Nacional Autónoma de Mexico, Mexico City 04360, CP, Mexico; (E.J.-C.); (A.S.-H.)
| |
Collapse
|
8
|
Ward A, Mauleon R, Ooi CY, Rosic N. Impact of Gene Modifiers on Cystic Fibrosis Phenotypic Profiles: A Systematic Review. Hum Mutat 2024; 2024:6165547. [PMID: 40225935 PMCID: PMC11919198 DOI: 10.1155/2024/6165547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 08/05/2024] [Accepted: 08/23/2024] [Indexed: 04/15/2025]
Abstract
Cystic fibrosis (CF) is a complex monogenic disorder with a large variability in disease severity. Growing evidence suggests that the variation observed depends not only on variations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene but also on modifier genes. Utilizing five databases (including CINAHL, PubMed, Science Direct, Scopus, and Web of Science), a systematic review was conducted to examine the current literature on the known impacts of genomic variations in modifier genes on the CF disease progression, severity, and therapeutic response. A total of 70 full-text articles describing over 80 gene modifiers associated with CF were selected. The modifier genes included genes associated with the CFTR interactome, the inflammatory response, microbial profiles, and other genes affecting the critical physiological pathways of multiple organ systems, such as the respiratory and gastrointestinal systems. Limitations of the existing literature embrace the lack of clinical studies investigating pharmacogenetic impacts and the significance of gene modifiers on the CF clinical picture, including a limited number of replication and validation studies. Further investigations into other potential gene modifiers using genome-wide association studies are needed to critically explore new therapeutic targets and provide a better understanding of the CF disease phenotype under specific drug treatments.
Collapse
Affiliation(s)
- Anastasia Ward
- Faculty of Health, Southern Cross University, Coolangatta, Gold Coast, Queensland, Australia
| | - Ramil Mauleon
- Faculty of Health, Southern Cross University, Coolangatta, Gold Coast, Queensland, Australia
- Rice Breeding Innovations, International Rice Research Institute, Los Banos, Laguna, Philippines
| | - Chee Y. Ooi
- School of Clinical Medicine, Discipline of Paediatrics & Child Health, Randwick Clinical Campus, UNSW Medicine & Health, UNSW, Sydney, Australia
- Department of Gastroenterology, Sydney Children's Hospital, Randwick, New South Wales, Australia
| | - Nedeljka Rosic
- Faculty of Health, Southern Cross University, Coolangatta, Gold Coast, Queensland, Australia
| |
Collapse
|
9
|
Wang JH, Hou PL, Chen YH. Multicategory Survival Outcomes Classification via Overlapping Group Screening Process Based on Multinomial Logistic Regression Model With Application to TCGA Transcriptomic Data. Cancer Inform 2024; 23:11769351241286710. [PMID: 39385930 PMCID: PMC11462568 DOI: 10.1177/11769351241286710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 09/05/2024] [Indexed: 10/12/2024] Open
Abstract
Objectives Under the classification of multicategory survival outcomes of cancer patients, it is crucial to identify biomarkers that affect specific outcome categories. The classification of multicategory survival outcomes from transcriptomic data has been thoroughly investigated in computational biology. Nevertheless, several challenges must be addressed, including the ultra-high-dimensional feature space, feature contamination, and data imbalance, all of which contribute to the instability of the diagnostic model. Furthermore, although most methods achieve accurate predicted performance for binary classification with high-dimensional transcriptomic data, their extension to multi-class classification is not straightforward. Methods We employ the One-versus-One strategy to transform multi-class classification into multiple binary classification, and utilize the overlapping group screening procedure with binary logistic regression to include pathway information for identifying important genes and gene-gene interactions for multicategory survival outcomes. Results A series of simulation studies are conducted to compare the classification accuracy of our proposed approach with some existing machine learning methods. In practical data applications, we utilize the random oversampling procedure to tackle class imbalance issues. We then apply the proposed method to analyze transcriptomic data from various cancers in The Cancer Genome Atlas, such as kidney renal papillary cell carcinoma, lung adenocarcinoma, and head and neck squamous cell carcinoma. Our aim is to establish an accurate microarray-based multicategory cancer diagnosis model. The numerical results illustrate that the new proposal effectively enhances cancer diagnosis compared to approaches that neglect pathway information. Conclusions We showcase the effectiveness of the proposed method in terms of class prediction accuracy through evaluations on simulated synthetic datasets as well as real dataset applications. We also identified the cancer-related gene-gene interaction biomarkers and reported the corresponding network structure. According to the identified major genes and gene-gene interactions, we can predict for each patient the probabilities that he/she belongs to each of the survival outcome classes.
Collapse
Affiliation(s)
- Jie-Huei Wang
- Department of Mathematics, National Chung Cheng University, Chiayi City, Taiwan
| | - Po-Lin Hou
- Department of Mathematics, National Chung Cheng University, Chiayi City, Taiwan
| | - Yi-Hau Chen
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
10
|
Gandhi CK, Depicolzuane LC, Chen C, Roberts CM, Sicher N, Johnson Wegerson K, Thomas NJ, Wu R, Floros J. Association of SNP-SNP interactions of surfactant protein genes with severity of respiratory syncytial virus infection in children. Physiol Genomics 2024; 56:691-697. [PMID: 39222066 PMCID: PMC11495184 DOI: 10.1152/physiolgenomics.00045.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 07/23/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024] Open
Abstract
The severity of respiratory syncytial virus (RSV) may be linked to host genetic susceptibility. Surfactant protein (SP) genetic variants have been associated with RSV severity, but the impact of single-nucleotide polymorphism (SNP)-SNP interactions remains unexplored. Therefore, we used a novel statistical model to investigate the association of SNP-SNP interactions of SFTP genes with RSV severity in two- and three-interaction models. We analyzed available genotype and clinical data from prospectively enrolled 405 children diagnosed with RSV, categorizing them into moderate or severe RSV groups. Using Wang's statistical model, we studied significant associations of SNP-SNP interactions with RSV severity in a case-control design. We observed, first, association of three interactions with increased risk of severe RSV in a two-SNP model. One intragenic interaction was between SNPs of SFTPA2, and the other two were intergenic, involving SNPs of hydrophilic and hydrophobic SPs alone. We also observed, second, association of 22 interactions with RSV severity in a three-SNP model. Among these, 20 were unique, with 12 and 10 interactions associated with increased or decreased risk of RSV severity, respectively, and included at least one SNP of either SFTPA1 or SFTPA2. All interactions were intergenic except one, among SNPs of SFTPA1. The remaining interactions were either among SNPs of hydrophilic SPs alone (n = 8) or among SNPs of both hydrophilic or hydrophobic SPs (n = 11). Our findings indicate that SNPs of all SFTPs may contribute to genetic susceptibility to RSV severity. However, the predominant involvement of SFTPA1 and/or SFTPA2 SNPs in these interactions underscores their significance in RSV severity.NEW & NOTEWORTHY Although surfactant protein (SP) genetic variants are associated with respiratory syncytial virus (RSV) severity, the impact of single-nucleotide polymorphism (SNP)-SNP interactions of SP genes remained unexplored. Using advanced statistical models, we uncovered 22 SNP-SNP interactions associated with RSV severity, with notable involvement of SFTPA1 and SFTPA2 SNPs. This highlights the comprehensive role of all SPs in genetic susceptibility to RSV severity, shedding light on potential avenues for targeted interventions.
Collapse
Affiliation(s)
- Chintan K Gandhi
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Lynnlee C Depicolzuane
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Chixiang Chen
- Department of Public Health Science, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Catherine M Roberts
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Natalie Sicher
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Katelyn Johnson Wegerson
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Neal J Thomas
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Rongling Wu
- Department of Public Health Science, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| | - Joanna Floros
- Department of Pediatrics, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
- Department of Obstetrics and Gynecology, The Pennsylvania State College of Medicine, Hershey, Pennsylvania, United States
| |
Collapse
|
11
|
Kikuchi M, Viet J, Nagata K, Sato M, David G, Audic Y, Silverman MA, Yamamoto M, Akatsu H, Hashizume Y, Takeda S, Akamine S, Miyamoto T, Uozumi R, Gotoh S, Mori K, Ikeda M, Paillard L, Morihara T. Gene-gene functional relationships in Alzheimer's disease: CELF1 regulates KLC1 alternative splicing. Biochem Biophys Res Commun 2024; 721:150025. [PMID: 38768546 DOI: 10.1016/j.bbrc.2024.150025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 04/16/2024] [Accepted: 04/26/2024] [Indexed: 05/22/2024]
Abstract
The causes of Alzheimer's disease (AD) are poorly understood, although many genes are known to be involved in this pathology. To gain insights into the underlying molecular mechanisms, it is essential to identify the relationships between individual AD genes. Previous work has shown that the splice variant E of KLC1 (KLC1_vE) promotes AD, and that the CELF1 gene, which encodes an RNA-binding protein involved in splicing regulation, is at a risk locus for AD. Here, we identified a functional link between CELF1 and KLC1 in AD pathogenesis. Transcriptomic data from human samples from different ethnic groups revealed that CELF1 mRNA levels are low in AD brains, and the splicing pattern of KLC1 is strongly correlated with CELF1 expression levels. Specifically, KLC1_vE is negatively correlated with CELF1. Depletion and overexpression experiments in cultured cells demonstrated that the CELF1 protein down-regulates KLC1_vE. In a cross-linking and immunoprecipitation sequencing (CLIP-seq) database, CELF1 directly binds to KLC1 RNA, following which it likely modulates terminal exon usage, hence KLC1_vE formation. These findings reveal a new pathogenic pathway where a risk allele of CELF1 is associated with reduced CELF1 expression, which up-regulates KLC1_vE to promote AD.
Collapse
Affiliation(s)
- Masataka Kikuchi
- Department of Genome Informatics, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Justine Viet
- Université de Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes), UMR 6290, F-35000, Rennes, France
| | - Kenichi Nagata
- Department of Functional Anatomy and Neuroscience, Graduate School of Medicine, Nagoya University, Nagoya, Japan
| | - Masahiro Sato
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Geraldine David
- Université de Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes), UMR 6290, F-35000, Rennes, France
| | - Yann Audic
- Université de Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes), UMR 6290, F-35000, Rennes, France
| | - Michael A Silverman
- Department of Biological Sciences, Centre for Cell Biology, Development, and Disease, Simon Fraser University, Burnaby, Canada
| | - Mitsuko Yamamoto
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Hiroyasu Akatsu
- Department of Community-based Medical Education, Graduate School of Medicine, Nagoya City University, Nagoya, Japan; Choju Medical/Neuropathological Institute, Fukushimura Hospital, Toyohashi, Japan
| | | | - Shuko Takeda
- Department of Clinical Gene Therapy, Graduate School of Medicine, Osaka University, Suita, Japan; Osaka Psychiatric Medical Center, Osaka Psychiatric Research Center, Hirakata, Japan
| | - Shoshin Akamine
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Tesshin Miyamoto
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Ryota Uozumi
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Shiho Gotoh
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Kohji Mori
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Manabu Ikeda
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Luc Paillard
- Université de Rennes, CNRS, IGDR (Institut de Génétique et Développement de Rennes), UMR 6290, F-35000, Rennes, France.
| | - Takashi Morihara
- Department of Psychiatry, Graduate School of Medicine, Osaka University, Suita, Japan; Toyonaka Municipal Hospital, Toyonaka, Japan.
| |
Collapse
|
12
|
Jang MJ, Tan LJ, Park MY, Shin S, Kim JM. Identification of interactions between genetic risk scores and dietary patterns for personalized prevention of kidney dysfunction in a population-based cohort. Nutr Diabetes 2024; 14:62. [PMID: 39143076 PMCID: PMC11325018 DOI: 10.1038/s41387-024-00316-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 07/09/2024] [Accepted: 07/16/2024] [Indexed: 08/16/2024] Open
Abstract
BACKGROUND & AIM Chronic kidney disease (CKD) is a heterogeneous disorder that affects the kidney structure and function. This study investigated the effect of the interaction between genetic factors and dietary pattern on kidney dysfunction in Korean adults. METHODS Baseline data were obtained from the Ansan and Ansung Study of the Korean Genome and Epidemiology Study involving 8230 participants aged 40-69 years. Kidney dysfunction was defined as an estimated glomerular filtration rate < 90 mL/minute/1.73 m2. Genomic DNAs genotyped on the Affymetrix® Genome-Wide Human SNP array 5.0 were isolated from peripheral blood. A genome-wide association study using a generalized linear model was performed on 1,590,162 single-nucleotide polymorphisms (SNPs). To select significant SNPs, the threshold criterion was set at P-value < 5 × 10-8. Linkage disequilibrium clumping was performed based on the R2 value, and 94 SNPs had a significant effect. Participants were divided into two groups based on their generic risk score (GRS): the low-GR group had GRS > 0, while the high-GR group had GRS ≤ 0. RESULTS Three distinct dietary patterns were extracted, namely, the "prudent pattern," "flour-based and animal food pattern," and "white rice pattern," to analyze the effect of dietary pattern on kidney function. In the "flour-based and animal food pattern," higher pattern scores were associated with a higher prevalence of kidney dysfunction in both the low and high GR groups (P for trend < 0.0001 in the low-, high-GR groups of model 1; 0.0050 and 0.0065 in the low-, high-GR groups of model 2, respectively). CONCLUSIONS The results highlight a significant association between the 'flour-based and animal food pattern' and higher kidney dysfunction prevalence in individuals with both low and high GR. These findings suggest that personalized nutritional interventions based on GR profiles may become the basis for presenting GR-based individual dietary patterns for kidney dysfunction.
Collapse
Affiliation(s)
- Min-Jae Jang
- Department of Animal Science and Technology, Chung-Ang University, Gyeonggi-do, 17546, Korea
| | - Li-Juan Tan
- Department of Food and Nutrition, Chung-Ang University, Gyeonggi-do, 17546, Korea
| | - Min Young Park
- Department of Molecular Pathobiology, NYU College of Dentistry, New York, USA
| | - Sangah Shin
- Department of Food and Nutrition, Chung-Ang University, Gyeonggi-do, 17546, Korea.
| | - Jun-Mo Kim
- Department of Animal Science and Technology, Chung-Ang University, Gyeonggi-do, 17546, Korea.
| |
Collapse
|
13
|
Lin HY, Mazumder H, Sarkar I, Huang PY, Eeles RA, Kote-Jarai Z, Muir KR, Schleutker J, Pashayan N, Batra J, Neal DE, Nielsen SF, Nordestgaard BG, Grönberg H, Wiklund F, MacInnis RJ, Haiman CA, Travis RC, Stanford JL, Kibel AS, Cybulski C, Khaw KT, Maier C, Thibodeau SN, Teixeira MR, Cannon-Albright L, Brenner H, Kaneva R, Pandha H, Park JY. Cluster effect for SNP-SNP interaction pairs for predicting complex traits. Sci Rep 2024; 14:18677. [PMID: 39134575 PMCID: PMC11319716 DOI: 10.1038/s41598-024-66311-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 07/01/2024] [Indexed: 08/15/2024] Open
Abstract
Single nucleotide polymorphism (SNP) interactions are the key to improving polygenic risk scores. Previous studies reported several significant SNP-SNP interaction pairs that shared a common SNP to form a cluster, but some identified pairs might be false positives. This study aims to identify factors associated with the cluster effect of false positivity and develop strategies to enhance the accuracy of SNP-SNP interactions. The results showed the cluster effect is a major cause of false-positive findings of SNP-SNP interactions. This cluster effect is due to high correlations between a causal pair and null pairs in a cluster. The clusters with a hub SNP with a significant main effect and a large minor allele frequency (MAF) tended to have a higher false-positive rate. In addition, peripheral null SNPs in a cluster with a small MAF tended to enhance false positivity. We also demonstrated that using the modified significance criterion based on the 3 p-value rules and the bootstrap approach (3pRule + bootstrap) can reduce false positivity and maintain high true positivity. In addition, our results also showed that a pair without a significant main effect tends to have weak or no interaction. This study identified the cluster effect and suggested using the 3pRule + bootstrap approach to enhance SNP-SNP interaction detection accuracy.
Collapse
Affiliation(s)
- Hui-Yi Lin
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA.
| | - Harun Mazumder
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA
| | - Indrani Sarkar
- Biostatistics and Data Science Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA
| | - Po-Yu Huang
- Information and Communications Research Laboratories, Industrial Technology Research Institute, Hsinchu, Taiwan
| | - Rosalind A Eeles
- The Institute of Cancer Research, London, SM2 5NG, UK
- Royal Marsden NHS Foundation Trust, London, SW3 6JJ, UK
| | | | - Kenneth R Muir
- Division of Population Health, Health Services Research and Primary Care, University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Johanna Schleutker
- Institute of Biomedicine, University of Turku, Turku, Finland
- Department of Medical Genetics, Genomics, Laboratory Division, Turku University Hospital, PO Box 52, 20521, Turku, Finland
| | - Nora Pashayan
- Department of Applied Health Research, University College London, London, WC1E 7HB, UK
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Strangeways Laboratory, Worts Causeway, Cambridge, CB1 8RN, UK
| | - Jyotsna Batra
- Australian Prostate Cancer Research Centre-Qld, Institute of Health and Biomedical Innovation and School of Biomedical Science, Queensland University of Technology, Brisbane, QLD, 4059, Australia
- Translational Research Institute, Brisbane, QLD, 4102, Australia
| | - David E Neal
- Nuffield Department of Surgical Sciences, University of Oxford, John Radcliffe Hospital, Room 6603, Level 6, Headley Way, Headington, Oxford, OX3 9DU, UK
- Department of Oncology, University of Cambridge, Addenbrooke's Hospital, Hills Road, Box 279, Cambridge, CB2 0QQ, UK
- Cancer Research UK, Cambridge Research Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
| | - Sune F Nielsen
- Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
- Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
| | - Børge G Nordestgaard
- Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
- Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
| | - Henrik Grönberg
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 171 77, Stockholm, Sweden
| | - Fredrik Wiklund
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 171 77, Stockholm, Sweden
| | - Robert J MacInnis
- Cancer Epidemiology Division, Cancer Council Victoria, 200 Victoria Parade, East Melbourne, 3002, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Grattan Street, Parkville, VIC, 3010, Australia
| | - Christopher A Haiman
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, CA, 90015, USA
| | - Ruth C Travis
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, OX3 7LF, UK
| | - Janet L Stanford
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109-1024, USA
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, 98195, USA
| | - Adam S Kibel
- Division of Urologic Surgery, Brigham and Womens Hospital, 75 Francis Street, Boston, MA, 02115, USA
| | - Cezary Cybulski
- International Hereditary Cancer Center, Department of Genetics and Pathology, Pomeranian Medical University, 70-115, Szczecin, Poland
| | - Kay-Tee Khaw
- Clinical Gerontology Unit, University of Cambridge, Cambridge, CB2 2QQ, UK
| | - Christiane Maier
- Humangenetik Tuebingen, Paul-Ehrlich-Str 23, 72076, Tuebingen, Germany
| | - Stephen N Thibodeau
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Manuel R Teixeira
- Department of Laboratory Genetics, Portuguese Oncology Institute of Porto (IPO Porto)/Porto Comprehensive Cancer Center, Porto, Portugal
- Cancer Genetics Group, IPO Porto Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO Porto)/Porto Comprehensive Cancer Center, Porto, Portugal
- School of Medicine and Biomedical Sciences (ICBAS), University of Porto, Porto, Portugal
| | - Lisa Cannon-Albright
- Division of Epidemiology, Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, UT, 84132, USA
- George E. Wahlen Department of Veterans Affairs Medical Center, Salt Lake City, UT, 84148, USA
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, 69120, Heidelberg, Germany
| | - Radka Kaneva
- Molecular Medicine Center, Department of Medical Chemistry and Biochemistry, Medical University of Sofia, Sofia, 2 Zdrave Str., 1431, Sofia, Bulgaria
| | - Hardev Pandha
- The University of Surrey, Guildford, Surrey, GU2 7XH, UK
| | - Jong Y Park
- Department of Cancer Epidemiology, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, FL, 33612, USA
| |
Collapse
|
14
|
Hsieh AR, Tsai CY. Biomedical literature mining: graph kernel-based learning for gene-gene interaction extraction. Eur J Med Res 2024; 29:404. [PMID: 39095899 PMCID: PMC11297645 DOI: 10.1186/s40001-024-01983-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 07/17/2024] [Indexed: 08/04/2024] Open
Abstract
The supervised machine learning method is often used for biomedical relationship extraction. The disadvantage is that it requires much time and money to manually establish an annotated dataset. Based on distant supervision, the knowledge base is combined with the corpus, thus, the training corpus can be automatically annotated. As many biomedical databases provide knowledge bases for study with a limited number of annotated corpora, this method is practical in biomedicine. The clinical significance of each patient's genetic makeup can be understood based on the healthcare provider's genetic database. Unfortunately, the lack of previous biomedical relationship extraction studies focuses on gene-gene interaction. The main purpose of this study is to develop extraction methods for gene-gene interactions that can help explain the heritability of human complex diseases. This study referred to the information on gene-gene interactions in the KEGG PATHWAY database, the abstracts in PubMed were adopted to generate the training sample set, and the graph kernel method was adopted to extract gene-gene interactions. The best assessment result was an F1-score of 0.79. Our developed distant supervision method automatically finds sentences through the corpus without manual labeling for extracting gene-gene interactions, which can effectively reduce the time cost for manual annotation data; moreover, the relationship extraction method based on a graph kernel can be successfully applied to extract gene-gene interactions. In this way, the results of this study are expected to help achieve precision medicine.
Collapse
Affiliation(s)
- Ai-Ru Hsieh
- Department of Statistics, Tamkang University, Tamsui District, New Taipei City, 251301, Taiwan.
| | - Chen-Yu Tsai
- Department of Statistics, Tamkang University, Tamsui District, New Taipei City, 251301, Taiwan
| |
Collapse
|
15
|
Chitra U, Arnold BJ, Raphael BJ. Quantifying higher-order epistasis: beware the chimera. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603976. [PMID: 39071303 PMCID: PMC11275791 DOI: 10.1101/2024.07.17.603976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Epistasis, or interactions in which alleles at one locus modify the fitness effects of alleles at other loci, plays a fundamental role in genetics, protein evolution, and many other areas of biology. Epistasis is typically quantified by computing the deviation from the expected fitness under an additive or multiplicative model using one of several formulae. However, these formulae are not all equivalent. Importantly, one widely used formula - which we call the chimeric formula - measures deviations from a multiplicative fitness model on an additive scale, thus mixing two measurement scales. We show that for pairwise interactions, the chimeric formula yields a different magnitude, but the same sign (synergistic vs. antagonistic) of epistasis compared to the multiplicative formula that measures both fitness and deviations on a multiplicative scale. However, for higher-order interactions, we show that the chimeric formula can have both different magnitude and sign compared to the multiplicative formula - thus confusing negative epistatic interactions with positive interactions, and vice versa. We resolve these inconsistencies by deriving fundamental connections between the different epistasis formulae and the parameters of the multivariate Bernoulli distribution . Our results demonstrate that the additive and multiplicative epistasis formulae are more mathematically sound than the chimeric formula. Moreover, we demonstrate that the mathematical issues with the chimeric epistasis formula lead to markedly different biological interpretations of real data. Analyzing multi-gene knockout data in yeast, multi-way drug interactions in E. coli , and deep mutational scanning (DMS) of several proteins, we find that 10 - 60% of higher-order interactions have a change in sign with the multiplicative or additive epistasis formula. These sign changes result in qualitatively different findings on functional divergence in the yeast genome, synergistic vs. antagonistic drug interactions, and and epistasis between protein mutations. In particular, in the yeast data, the more appropriate multiplicative formula identifies nearly 500 additional negative three-way interactions, thus extending the trigenic interaction network by 25%.
Collapse
|
16
|
Khemka N, Morris G, Kazemzadeh L, Costard LS, Neubert V, Bauer S, Rosenow F, Venø MT, Kjems J, Henshall DC, Prehn JHM, Connolly NMC. Integrative network analysis of miRNA-mRNA expression profiles during epileptogenesis in rats reveals therapeutic targets after emergence of first spontaneous seizure. Sci Rep 2024; 14:15313. [PMID: 38961125 PMCID: PMC11222454 DOI: 10.1038/s41598-024-66117-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 06/27/2024] [Indexed: 07/05/2024] Open
Abstract
Epileptogenesis is the process by which a normal brain becomes hyperexcitable and capable of generating spontaneous recurrent seizures. The extensive dysregulation of gene expression associated with epileptogenesis is shaped, in part, by microRNAs (miRNAs) - short, non-coding RNAs that negatively regulate protein levels. Functional miRNA-mediated regulation can, however, be difficult to elucidate due to the complexity of miRNA-mRNA interactions. Here, we integrated miRNA and mRNA expression profiles sampled over multiple time-points during and after epileptogenesis in rats, and applied bi-clustering and Bayesian modelling to construct temporal miRNA-mRNA-mRNA interaction networks. Network analysis and enrichment of network inference with sequence- and human disease-specific information identified key regulatory miRNAs with the strongest influence on the mRNA landscape, and miRNA-mRNA interactions closely associated with epileptogenesis and subsequent epilepsy. Our findings underscore the complexity of miRNA-mRNA regulation, can be used to prioritise miRNA targets in specific systems, and offer insights into key regulatory processes in epileptogenesis with therapeutic potential for further investigation.
Collapse
Affiliation(s)
- Niraj Khemka
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Gareth Morris
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland
- Neuroscience, Physiology and Pharmacology, University College London, London, UK
- Division of Neuroscience, University of Manchester, Manchester, UK
| | - Laleh Kazemzadeh
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Lara S Costard
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Valentin Neubert
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Sebastian Bauer
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Felix Rosenow
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Morten T Venø
- Interdisciplinary Nanoscience Center, Dept. of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
- Omiics ApS, Aarhus, Denmark
| | - Jørgen Kjems
- Interdisciplinary Nanoscience Center, Dept. of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - David C Henshall
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Jochen H M Prehn
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
| | - Niamh M C Connolly
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
| |
Collapse
|
17
|
Aranda S, Jiménez E, Canales-Rodríguez EJ, Verdolini N, Alonso S, Sepúlveda E, Julià A, Marsal S, Bobes J, Sáiz PA, García-Portilla P, Menchón JM, Crespo JM, González-Pinto A, Pérez V, Arango C, Sierra P, Sanjuán J, Pomarol-Clotet E, Vieta E, Vilella E. Processing speed mediates the relationship between DDR1 and psychosocial functioning in euthymic patients with bipolar disorder presenting psychotic symptoms. Mol Psychiatry 2024; 29:2050-2058. [PMID: 38374360 DOI: 10.1038/s41380-024-02480-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 02/01/2024] [Accepted: 02/05/2024] [Indexed: 02/21/2024]
Abstract
The DDR1 locus is associated with the diagnosis of schizophrenia and with processing speed in patients with schizophrenia and first-episode psychosis. Here, we investigated whether DDR1 variants are associated with bipolar disorder (BD) features. First, we performed a case‒control association study comparing DDR1 variants between patients with BD and healthy controls. Second, we performed linear regression analyses to assess the associations of DDR1 variants with neurocognitive domains and psychosocial functioning. Third, we conducted a mediation analysis to explore whether neurocognitive impairment mediated the association between DDR1 variants and psychosocial functioning in patients with BD. Finally, we studied the association between DDR1 variants and white matter microstructure. We did not find any statistically significant associations in the case‒control association study; however, we found that the combined genotypes rs1264323AA-rs2267641AC/CC were associated with worse neurocognitive performance in patients with BD with psychotic symptoms. In addition, the combined genotypes rs1264323AA-rs2267641AC/CC were associated with worse psychosocial functioning through processing speed. We did not find correlations between white matter microstructure abnormalities and the neurocognitive domains associated with the combined genotypes rs1264323AA-rs2267641AC/CC. Overall, the results suggest that DDR1 may be a marker of worse neurocognitive performance and psychosocial functioning in patients with BD, specifically those with psychotic symptoms.
Collapse
Affiliation(s)
- Selena Aranda
- Institut d'Investigació Sanitària Pere Virgili-CERCA, Reus, Spain
- Hospital Universitari Institut Pere Mata, Reus, Spain
- Universitat Rovira i Virgili, Reus, Spain
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
| | - Esther Jiménez
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bipolar and Depressive Disorders Unit, Hospital Clinic, Institute of Neurosciences, University of Barcelona, IDIBAPS, Barcelona, Spain
- Department of Psychiatry, University of the Basque Country (UPV-EHU), Vitoria-Gasteiz, Spain
| | - Erick J Canales-Rodríguez
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- FIDMAG Germanes Hospitalàries Research Foundation, Sant Boi de Llobregat, Barcelona, Spain
- Signal Processing Laboratory (LTS5), École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Norma Verdolini
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bipolar and Depressive Disorders Unit, Hospital Clinic, Institute of Neurosciences, University of Barcelona, IDIBAPS, Barcelona, Spain
- FIDMAG Germanes Hospitalàries Research Foundation, Sant Boi de Llobregat, Barcelona, Spain
| | - Silvia Alonso
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bipolar and Depressive Disorders Unit, Hospital Clinic, Institute of Neurosciences, University of Barcelona, IDIBAPS, Barcelona, Spain
| | - Esteban Sepúlveda
- Institut d'Investigació Sanitària Pere Virgili-CERCA, Reus, Spain
- Hospital Universitari Institut Pere Mata, Reus, Spain
- Universitat Rovira i Virgili, Reus, Spain
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
| | - Antonio Julià
- Rheumatology Research Group, Vall d'Hebron Research Institute, Barcelona, Spain
| | - Sara Marsal
- Rheumatology Research Group, Vall d'Hebron Research Institute, Barcelona, Spain
| | - Julio Bobes
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Department of Psychiatry, Universidad de Oviedo, Oviedo, Spain
- nstituto de Investigación Sanitaria del Principado de Asturias (ISPA), Oviedo, Spain
- Instituto Universitario de Neurociencias del Principado de Asturias (INEUROPA), Oviedo, Spain
- Servicio de Salud del Principado de Asturias (SESPA) Oviedo, Oviedo, Spain
| | - Pilar A Sáiz
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Department of Psychiatry, Universidad de Oviedo, Oviedo, Spain
- nstituto de Investigación Sanitaria del Principado de Asturias (ISPA), Oviedo, Spain
- Instituto Universitario de Neurociencias del Principado de Asturias (INEUROPA), Oviedo, Spain
- Servicio de Salud del Principado de Asturias (SESPA) Oviedo, Oviedo, Spain
| | - Paz García-Portilla
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Department of Psychiatry, Universidad de Oviedo, Oviedo, Spain
- nstituto de Investigación Sanitaria del Principado de Asturias (ISPA), Oviedo, Spain
- Instituto Universitario de Neurociencias del Principado de Asturias (INEUROPA), Oviedo, Spain
- Servicio de Salud del Principado de Asturias (SESPA) Oviedo, Oviedo, Spain
| | - Jose M Menchón
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bellvitge Biomedical Research Institute-IDIBELL, Bellvitge University Hospital, Barcelona, Spain
- Department of Clinical Sciences, University of Barcelona, Barcelona, Spain
| | - José M Crespo
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bellvitge Biomedical Research Institute-IDIBELL, Bellvitge University Hospital, Barcelona, Spain
- Department of Clinical Sciences, University of Barcelona, Barcelona, Spain
| | - Ana González-Pinto
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Department of Psychiatry, University of the Basque Country (UPV-EHU), Vitoria-Gasteiz, Spain
- Araba University Hospital, Bioaraba Research Institute, UPV/EHU, Vitoria-Gasteiz, Spain
| | - Víctor Pérez
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Hospital de Mar. Mental Health Institute, Barcelona, Spain
- Neurosciences Research Unit, Hospital del Mar d'Investigacions Mèdiques, Barcelona, Spain
- Department of Experimental and Health Sciences, Pompeu Fabra University, Barcelona, Spain
| | - Celso Arango
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Institute of Psychiatry and Mental Health, Madrid, Spain
- Hospital General Universitario Gregorio Marañón, Madrid, Spain
- Universidad Complutense, Madrid, Spain
| | - Pilar Sierra
- La Fe University and Polytechnic Hospital, Valencia, Spain
- Department of Psychiatry, School of Medicine, University of Valencia, Valencia, Spain
| | - Julio Sanjuán
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Department of Psychiatry, School of Medicine, University of Valencia, Valencia, Spain
| | - Edith Pomarol-Clotet
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- FIDMAG Germanes Hospitalàries Research Foundation, Sant Boi de Llobregat, Barcelona, Spain
| | - Eduard Vieta
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain
- Bipolar and Depressive Disorders Unit, Hospital Clinic, Institute of Neurosciences, University of Barcelona, IDIBAPS, Barcelona, Spain
| | - Elisabet Vilella
- Institut d'Investigació Sanitària Pere Virgili-CERCA, Reus, Spain.
- Hospital Universitari Institut Pere Mata, Reus, Spain.
- Universitat Rovira i Virgili, Reus, Spain.
- Centro de Investigación Biomédica en Red en Salud Mental (CIBERSAM)-Instituto de Salud Carlos III, Madrid, Spain.
| |
Collapse
|
18
|
Wang X, Xin B, Guo M, Yu G, Wang J. GWASTool: A web pipeline for detecting SNP-phenotype associations. FUNDAMENTAL RESEARCH 2024; 4:761-769. [PMID: 39660349 PMCID: PMC11630686 DOI: 10.1016/j.fmre.2024.03.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 02/19/2024] [Accepted: 03/11/2024] [Indexed: 12/12/2024] Open
Abstract
The genome-wide association study (GWAS) aims to detect associations between individual single nucleotide polymorphisms (SNPs) or SNP interactions and phenotypes to decipher the genetic mechanism. Existing GWAS analysis tools have different focuses and advantages, but suffer a series of tedious and heterogeneous configurations for computation. It is inconvenient for researchers to simply choose and apply these tools, statistically and biologically analyze their results for different usages. To address these issues, we develop a user friendly web pipeline GWASTool for detecting associations, which includes simulation data generation, associated loci detection, result visualization, analysis and comparison. GWASTool provides a unified and plugin-able framework to encapsulate the heterogeneity of GWAS algorithms, simplifies the analysis steps and energizes GWAS tasks. GWASTool is implemented in Java and is freely available for public use at http://www.sdu-idea.cn/GWASTool. The website hosts a comprehensive collection of resources, including a user manual, description of integrated algorithms, data examples and standalone version for download.
Collapse
Affiliation(s)
- Xin Wang
- School of Software, Shandong University, Jinan 250101, China
- Joint SDU-NTU Centre for Artificial Intelligence Research, Shandong University, Jinan 250101, China
| | - Beibei Xin
- College of Agronomy & Biotechnology, China Agricultural University, Beijing 100193, China
| | - Maozu Guo
- College of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, China
| | - Guoxian Yu
- School of Software, Shandong University, Jinan 250101, China
- Joint SDU-NTU Centre for Artificial Intelligence Research, Shandong University, Jinan 250101, China
| | - Jun Wang
- Joint SDU-NTU Centre for Artificial Intelligence Research, Shandong University, Jinan 250101, China
| |
Collapse
|
19
|
Šetinc M, Celinšćak Ž, Bočkor L, Zajc Petranović M, Stojanović Marković A, Peričić Salihović M, Deelen J, Škarić-Jurić T. The role of longevity-related genetic variant interactions as predictors of survival after 85 years of age. Mech Ageing Dev 2024; 219:111926. [PMID: 38484896 PMCID: PMC11166054 DOI: 10.1016/j.mad.2024.111926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 02/27/2024] [Accepted: 03/11/2024] [Indexed: 03/26/2024]
Abstract
Genome-wide association studies and candidate gene studies have identified several genetic variants that might play a role in achieving longevity. This study investigates interactions between pairs of those single nucleotide polymorphisms (SNPs) and their effect on survival above the age of 85 in a sample of 327 Croatian individuals. Although none of the SNPs individually showed a significant effect on survival in this sample, 14 of the 359 interactions tested (between SNPs not in LD) reached the level of nominal significance (p<0.05), showing a potential effect on late-life survival. Notably, SH2B3 rs3184504 interacted with different SNPs near TERC, TP53 rs1042522 with different SNPs located near the CDKN2B gene, and CDKN2B rs1333049 with different SNPs in FOXO3, as well as with LINC02227 rs2149954. The other interaction pairs with a possible effect on survival were FOXO3 rs2802292 and ERCC2 rs50871, IL6 rs1800795 and GHRHR rs2267723, LINC02227 rs2149954 and PARK7 rs225119, as well as PARK7 rs225119 and PTPN1 rs6067484. These interactions remained significant when tested together with a set of health-related variables that also had a significant effect on survival above 85 years. In conclusion, our results confirm the central role of genetic regulation of insulin signalling and cell cycle control in longevity.
Collapse
Affiliation(s)
- Maja Šetinc
- Institute for Anthropological Research, Zagreb 10000, Croatia; Centre for Applied Bioanthropology, Institute for Anthropological Research, Zagreb 10000, Croatia.
| | | | - Luka Bočkor
- Institute for Anthropological Research, Zagreb 10000, Croatia; Centre for Applied Bioanthropology, Institute for Anthropological Research, Zagreb 10000, Croatia
| | | | | | | | - Joris Deelen
- Max Planck Institute for Biology of Ageing, Cologne 50931, Germany; Cologne Excellence Cluster on Cellular Stress Responses in Ageing-Associated Diseases (CECAD), University of Cologne, Cologne 50931, Germany.
| | | |
Collapse
|
20
|
Dai L, Ye Y, Mugaany J, Hu Z, Huang J, Lu C. Leveraging pQTL-based Mendelian randomization to identify new treatment prospects for primary biliary cholangitis and primary sclerosing cholangitis. Aging (Albany NY) 2024; 16:9228-9250. [PMID: 38809509 PMCID: PMC11164478 DOI: 10.18632/aging.205867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Accepted: 04/15/2024] [Indexed: 05/30/2024]
Abstract
Primary biliary cholangitis (PBC) and primary sclerosing cholangitis (PSC) are autoimmune disorders characterized by progressive and chronic damage to the bile ducts, presenting clinicians with significant challenges. The objective of this study is to identify potential druggable targets to offer new avenues for treatment. A Mendelian randomization analysis was performed to identify druggable targets for PBC and PSC. This involved obtaining Cis-protein quantitative trait loci (Cis-pQTL) data from the deCODE database to serve as exposure. Outcome data for PBC (557 cases and 281,127 controls) and PSC (1,715 cases and 330,903 controls) were obtained from the FINNGEN database. Colocalization analysis was conducted to determine whether these features share the same associated SNPs. Validation of the expression level of druggable targets was done using the GSE119600 dataset and immunohistochemistry for clinical samples. Lastly, the DRUGBANK database was used to predict potential drugs. The MR analysis identified eight druggable targets each for PBC and PSC. Subsequent summary-data-based MR and colocalization analyses showed that LEFTY2 had strong evidence as a therapeutic candidate for PBC, while HSPB1 had moderate evidence. For PSC, only FCGR3B showed strong evidence as a therapeutic candidate. Additionally, upregulated expression of these genes was validated in PBC and PSC groups by GEO dataset and clinical samples. This study identifies two novel druggable targets with strong evidence for therapeutic candidates for PBC (LEFTY2 and HSPB1) and one for PSC (FCGR3B). These targets offer new therapeutic opportunities to address the challenging nature of PBC and PSC treatment.
Collapse
Affiliation(s)
- Lei Dai
- Department of Hepato-Pancreato-Biliary Surgery, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
| | - Yunyan Ye
- Department of Ophthalmology, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
| | - Joseph Mugaany
- Department of Hepato-Pancreato-Biliary Surgery, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
- Health Science Center, Ningbo University, Ningbo 315211, China
| | - Zetong Hu
- Department of Hepato-Pancreato-Biliary Surgery, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
- Health Science Center, Ningbo University, Ningbo 315211, China
| | - Jing Huang
- Department of Hepato-Pancreato-Biliary Surgery, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
| | - Changjiang Lu
- Department of Hepato-Pancreato-Biliary Surgery, Ningbo Medical Centre Lihuili Hospital, The Affiliated Hospital of Ningbo University, Ningbo, Zhejiang 315040, China
| |
Collapse
|
21
|
Ozminkowski S, Solís‐Lemus C. Identifying microbial drivers in biological phenotypes with a Bayesian network regression model. Ecol Evol 2024; 14:e11039. [PMID: 38774136 PMCID: PMC11106058 DOI: 10.1002/ece3.11039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 01/29/2024] [Accepted: 02/03/2024] [Indexed: 05/24/2024] Open
Abstract
In Bayesian Network Regression models, networks are considered the predictors of continuous responses. These models have been successfully used in brain research to identify regions in the brain that are associated with specific human traits, yet their potential to elucidate microbial drivers in biological phenotypes for microbiome research remains unknown. In particular, microbial networks are challenging due to their high dimension and high sparsity compared to brain networks. Furthermore, unlike in brain connectome research, in microbiome research, it is usually expected that the presence of microbes has an effect on the response (main effects), not just the interactions. Here, we develop the first thorough investigation of whether Bayesian Network Regression models are suitable for microbial datasets on a variety of synthetic and real data under diverse biological scenarios. We test whether the Bayesian Network Regression model that accounts only for interaction effects (edges in the network) is able to identify key drivers (microbes) in phenotypic variability. We show that this model is indeed able to identify influential nodes and edges in the microbial networks that drive changes in the phenotype for most biological settings, but we also identify scenarios where this method performs poorly which allows us to provide practical advice for domain scientists aiming to apply these tools to their datasets. BNR models provide a framework for microbiome researchers to identify connections between microbes and measured phenotypes. We allow the use of this statistical model by providing an easy-to-use implementation which is publicly available Julia package at https://github.com/solislemuslab/BayesianNetworkRegression.jl.
Collapse
Affiliation(s)
- Samuel Ozminkowski
- Department of Statistics and Wisconsin Institute for DiscoveryUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Claudia Solís‐Lemus
- Department of Plant Pathology and Wisconsin Institute for DiscoveryUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| |
Collapse
|
22
|
Shiv R, Rakeswari, Farjana N, Subbiah U, Ajith A, Balaji A, Mohanasatheesh S. Characterization of missense nonsynonymous single-nucleotide polymorphism of runt-related transcription factor-2 gene - An in silico approach. Indian J Pharmacol 2024; 56:198-205. [PMID: 39078184 PMCID: PMC11286098 DOI: 10.4103/ijp.ijp_533_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 02/23/2024] [Accepted: 06/04/2024] [Indexed: 07/31/2024] Open
Abstract
OBJECTIVES Single-nucleotide polymorphism (SNP) codes for multiple amino acids, impacting protein functions and disease prognosis. Runt-related transcription factor-2 (RUNX2), a transcription factor linked to osteoblast differentiation, regulates cell proliferation in endothelium and osteoblastic cells. Understanding Runx2's role in nonosseous tissues is rapidly advancing. This study aims to identify harmful SNPs of the RUNX2 gene that may alter disease susceptibility using computational techniques. METHODS The study uses various in silico methods to identify nonsynonymous SNPs (nsSNPs) of the RUNX2 gene, which could potentially alter protein structure and functions, with further analyses by I-Mutant, ConSurf, Netsurf 3.0, GeneMANIA, and Have (y)Our Protein Explained. RESULTS Six missense nsSNPs were identified as potentially harmful, disease-causing, and damaging. Four were found to be unstable, while five were conserved. All six nsSNPs had a coiled secondary structure. Five nsSNPs were found to be destabilized. CONCLUSION The RUNX2 gene's deleterious missense nsSNPs were identified by this study, and they may be exploited in future experimental studies. These high-risk nsSNPs might be considered target molecules in therapeutic and diagnostic therapies in teeth and bone development.
Collapse
Affiliation(s)
- Ragul Shiv
- Department of Periodontics, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Rakeswari
- Department of Periodontics, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Nilofer Farjana
- Department of Periodontics, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Usha Subbiah
- Human Genetics Research Centre, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Athira Ajith
- Human Genetics Research Centre, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Anitha Balaji
- Department of Periodontics, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - S. Mohanasatheesh
- Department of Periodontics, Sree Balaji Dental College and Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| |
Collapse
|
23
|
Hajiaghabozorgi M, Fischbach M, Albrecht M, Wang W, Myers CL. BridGE: a pathway-based analysis tool for detecting genetic interactions from GWAS. Nat Protoc 2024; 19:1400-1435. [PMID: 38514837 PMCID: PMC11311251 DOI: 10.1038/s41596-024-00954-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 11/22/2023] [Indexed: 03/23/2024]
Abstract
Genetic interactions have the potential to modulate phenotypes, including human disease. In principle, genome-wide association studies (GWAS) provide a platform for detecting genetic interactions; however, traditional methods for identifying them, which tend to focus on testing individual variant pairs, lack statistical power. In this protocol, we describe a novel computational approach, called Bridging Gene sets with Epistasis (BridGE), for discovering genetic interactions between biological pathways from GWAS data. We present a Python-based implementation of BridGE along with instructions for its application to a typical human GWAS cohort. The major stages include initial data processing and quality control, construction of a variant-level genetic interaction network, measurement of pathway-level genetic interactions, evaluation of statistical significance using sample permutations and generation of results in a standardized output format. The BridGE software pipeline includes options for running the analysis on multiple cores and multiple nodes for users who have access to computing clusters or a cloud computing environment. In a cluster computing environment with 10 nodes and 100 GB of memory per node, the method can be run in less than 24 h for typical human GWAS cohorts. Using BridGE requires knowledge of running Python programs and basic shell script programming experience.
Collapse
Affiliation(s)
- Mehrad Hajiaghabozorgi
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Mathew Fischbach
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA
- Graduate Program in Bioinformatics and Computational Biology (BICB), University of Minnesota, Minneapolis, MN, USA
| | - Michael Albrecht
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Wen Wang
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA.
| | - Chad L Myers
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA.
- Graduate Program in Bioinformatics and Computational Biology (BICB), University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
24
|
Zhang S, Zhou Y, Geng P, Lu Q. Functional Neural Networks for High-Dimensional Genetic Data Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:383-393. [PMID: 38507390 PMCID: PMC11301578 DOI: 10.1109/tcbb.2024.3364614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/22/2024]
Abstract
Artificial intelligence (AI) is a thriving research field with many successful applications in areas such as computer vision and speech recognition. Machine learning methods, such as artificial neural networks (ANN), play a central role in modern AI technology. While ANN also holds great promise for human genetic research, the high-dimensional genetic data and complex genetic structure bring tremendous challenges. The vast majority of genetic variants on the genome have small or no effects on diseases, and fitting ANN on a large number of variants without considering the underlying genetic structure (e.g., linkage disequilibrium) could bring a serious overfitting issue. Furthermore, while a single disease phenotype is often studied in a classic genetic study, in emerging research fields (e.g., imaging genetics), researchers need to deal with different types of disease phenotypes. To address these challenges, we propose a functional neural networks (FNN) method. FNN uses a series of basis functions to model high-dimensional genetic data and a variety of phenotype data and further builds a multi-layer functional neural network to capture the complex relationships between genetic variants and disease phenotypes. Through simulations, we demonstrate the advantages of FNN for high-dimensional genetic data analysis in terms of robustness and accuracy. The real data applications also showed that FNN attained higher accuracy than the existing methods.
Collapse
|
25
|
Ma J, Li J, Chen Y, Yang Z, He Y. Poor statistical power in population-based association study of gene interaction. BMC Med Genomics 2024; 17:111. [PMID: 38678264 PMCID: PMC11055307 DOI: 10.1186/s12920-024-01884-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 04/19/2024] [Indexed: 04/29/2024] Open
Abstract
BACKGROUND Statistical epistasis, or "gene-gene interaction" in genetic association studies, means the nonadditive effects between the polymorphic sites on two different genes affecting the same phenotype. In the genetic association analysis of complex traits, nevertheless, the researchers haven't found enough clues of statistical epistasis so far. METHODS We developed a statistical model where the statistical epistasis was presented as an extra linkage disequilibrium between the polymorphic sites of different risk genes. The power of statistical test for identifying the gene-gene interaction was calculated and then compared in different hypothesis scenarios. RESULTS Our results show the statistical power increases with the increasing of interaction coefficient, relative risk, and linkage disequilibrium with genetic markers. However, the power of interaction discovery is much lower than that of regular single-site association test. When rigorous criteria were employed in statistical tests, the identification of gene-gene interaction became a very difficult task. Since the criterion of significance was given to be p-value ≤ 5.0 × 10-8, the same as that of many genome-wide association studies, there is little chance to identify the gene-gene interaction in all kind of circumstances. CONCLUSIONS The lack of epistasis tends to be an inevitable result caused by the statistical principles of methods in the genetic association studies and therefore is the inherent characteristic of the research itself.
Collapse
Affiliation(s)
- Jiarui Ma
- Shanghai Key Laboratory of Medical Epigenetics, International Co-Laboratory of Medical Epigenetics and Metabolism (Ministry of Science and Technology), Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Jian Li
- Shanghai Key Laboratory of Medical Epigenetics, International Co-Laboratory of Medical Epigenetics and Metabolism (Ministry of Science and Technology), Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Yuqi Chen
- Shanghai Key Laboratory of Medical Epigenetics, International Co-Laboratory of Medical Epigenetics and Metabolism (Ministry of Science and Technology), Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Zhen Yang
- Center for Medical Research and Innovation of Pudong Hospital, Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
| | - Yungang He
- Shanghai Fifth People's Hospital, Intelligent Medicine Institute, Fudan University, Shanghai, 200032, PR China.
| |
Collapse
|
26
|
Durvasula A, Price AL. Distinct explanations underlie gene-environment interactions in the UK Biobank. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.09.22.23295969. [PMID: 37790574 PMCID: PMC10543037 DOI: 10.1101/2023.09.22.23295969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
The role of gene-environment (GxE) interaction in disease and complex trait architectures is widely hypothesized, but currently unknown. Here, we apply three statistical approaches to quantify and distinguish three different types of GxE interaction for a given trait and E variable. First, we detect locus-specific GxE interaction by testing for genetic correlation r g < 1 across E bins. Second, we detect genome-wide effects of the E variable on genetic variance by leveraging polygenic risk scores (PRS) to test for significant PRSxE in a regression of phenotypes on PRS, E, and PRSxE, together with differences in SNP-heritability across E bins. Third, we detect genome-wide proportional amplification of genetic and environmental effects as a function of the E variable by testing for significant PRSxE with no differences in SNP-heritability across E bins. Simulations show that these approaches achieve high sensitivity and specificity in distinguishing these three GxE scenarios. We applied our framework to 33 UK Biobank traits (25 quantitative traits and 8 diseases; average N = 325 K ) and 10 E variables spanning lifestyle, diet, and other environmental exposures. First, we identified 19 trait-E pairs with r g significantly < 1 (FDR<5%) (average r g = 0.95 ); for example, white blood cell count had r g = 0.95 (s.e. 0.01) between smokers and non-smokers. Second, we identified 28 trait-E pairs with significant PRSxE and significant SNP-heritability differences across E bins; for example, BMI had a significant PRSxE for physical activity (P=4.6e-5) with 5% larger SNP-heritability in the largest versus smallest quintiles of physical activity (P=7e-4). Third, we identified 15 trait-E pairs with significant PRSxE with no SNP-heritability differences across E bins; for example, waist-hip ratio adjusted for BMI had a significant PRSxE effect for time spent watching television (P=5e-3) with no SNP-heritability differences. Across the three scenarios, 8 of the trait-E pairs involved disease traits, whose interpretation is complicated by scale effects. Analyses using biological sex as the E variable produced additional significant findings in each of the three scenarios. Overall, we infer a significant contribution of GxE and GxSex effects to complex trait and disease variance.
Collapse
Affiliation(s)
- Arun Durvasula
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Genetics, Harvard Medical School, Cambridge, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Alkes L Price
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
27
|
Behr M, Kumbier K, Cordova-Palomera A, Aguirre M, Ronen O, Ye C, Ashley E, Butte AJ, Arnaout R, Brown B, Priest J, Yu B. Learning epistatic polygenic phenotypes with Boolean interactions. PLoS One 2024; 19:e0298906. [PMID: 38625909 PMCID: PMC11020961 DOI: 10.1371/journal.pone.0298906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 01/31/2024] [Indexed: 04/18/2024] Open
Abstract
Detecting epistatic drivers of human phenotypes is a considerable challenge. Traditional approaches use regression to sequentially test multiplicative interaction terms involving pairs of genetic variants. For higher-order interactions and genome-wide large-scale data, this strategy is computationally intractable. Moreover, multiplicative terms used in regression modeling may not capture the form of biological interactions. Building on the Predictability, Computability, Stability (PCS) framework, we introduce the epiTree pipeline to extract higher-order interactions from genomic data using tree-based models. The epiTree pipeline first selects a set of variants derived from tissue-specific estimates of gene expression. Next, it uses iterative random forests (iRF) to search training data for candidate Boolean interactions (pairwise and higher-order). We derive significance tests for interactions, based on a stabilized likelihood ratio test, by simulating Boolean tree-structured null (no epistasis) and alternative (epistasis) distributions on hold-out test data. Finally, our pipeline computes PCS epistasis p-values that probabilisticly quantify improvement in prediction accuracy via bootstrap sampling on the test set. We validate the epiTree pipeline in two case studies using data from the UK Biobank: predicting red hair and multiple sclerosis (MS). In the case of predicting red hair, epiTree recovers known epistatic interactions surrounding MC1R and novel interactions, representing non-linearities not captured by logistic regression models. In the case of predicting MS, a more complex phenotype than red hair, epiTree rankings prioritize novel interactions surrounding HLA-DRB1, a variant previously associated with MS in several populations. Taken together, these results highlight the potential for epiTree rankings to help reduce the design space for follow up experiments.
Collapse
Affiliation(s)
- Merle Behr
- Faculty of Informatics and Data Science, University of Regensburg, Regensburg, Germany
| | - Karl Kumbier
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
| | | | - Matthew Aguirre
- Department of Pediatrics, Stanford Medicine, Stanford, CA, United States of America
- Department of Biomedical Data Science, Stanford Medicine, Stanford, CA, United States of America
| | - Omer Ronen
- Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
| | - Chengzhong Ye
- Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
| | - Euan Ashley
- Division of Cardiovascular Medicine, Stanford Medicine, Stanford, CA, United States of America
| | - Atul J. Butte
- Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, United States of America
| | - Rima Arnaout
- Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, United States of America
- Division of Cardiology, Department of Medicine, University of California, San Francisco, San Francisco, CA, United States of America
| | - Ben Brown
- Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
- Biosciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - James Priest
- Department of Pediatrics, Stanford Medicine, Stanford, CA, United States of America
| | - Bin Yu
- Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
- Department of Electrical Engineering and Computer Sciences and Center for Computational Biology, University of California at Berkeley, Berkeley, CA, United States of America
| |
Collapse
|
28
|
Westerman KE, Sofer T. Many roads to a gene-environment interaction. Am J Hum Genet 2024; 111:626-635. [PMID: 38579668 PMCID: PMC11023920 DOI: 10.1016/j.ajhg.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/29/2024] [Accepted: 03/01/2024] [Indexed: 04/07/2024] Open
Abstract
Despite the importance of gene-environment interactions (GxEs) in improving and operationalizing genetic discovery, interpretation of any GxEs that are discovered can be surprisingly difficult. There are many potential biological and statistical explanations for a statistically significant finding and, likewise, it is not always clear what can be claimed based on a null result. A better understanding of the possible underlying mechanisms leading to a detected GxE can help investigators decide which are and which are not relevant to their hypothesis. Here, we provide a detailed explanation of five "phenomena," or data-generating mechanisms, that can lead to nonzero interaction estimates, as well as a discussion of specific instances in which they might be relevant. We hope that, given this framework, investigators can design more targeted experiments and provide cleaner interpretations of the associated results.
Collapse
Affiliation(s)
- Kenneth E Westerman
- Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA, USA; Metabolism Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA.
| | - Tamar Sofer
- Cardiovascular Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| |
Collapse
|
29
|
Tsouris A, Brach G, Friedrich A, Hou J, Schacherer J. Diallel panel reveals a significant impact of low-frequency genetic variants on gene expression variation in yeast. Mol Syst Biol 2024; 20:362-373. [PMID: 38355920 PMCID: PMC10987670 DOI: 10.1038/s44320-024-00021-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 01/29/2024] [Accepted: 01/30/2024] [Indexed: 02/16/2024] Open
Abstract
Unraveling the genetic sources of gene expression variation is essential to better understand the origins of phenotypic diversity in natural populations. Genome-wide association studies identified thousands of variants involved in gene expression variation, however, variants detected only explain part of the heritability. In fact, variants such as low-frequency and structural variants (SVs) are poorly captured in association studies. To assess the impact of these variants on gene expression variation, we explored a half-diallel panel composed of 323 hybrids originated from pairwise crosses of 26 natural Saccharomyces cerevisiae isolates. Using short- and long-read sequencing strategies, we established an exhaustive catalog of single nucleotide polymorphisms (SNPs) and SVs for this panel. Combining this dataset with the transcriptomes of all hybrids, we comprehensively mapped SNPs and SVs associated with gene expression variation. While SVs impact gene expression variation, SNPs exhibit a higher effect size with an overrepresentation of low-frequency variants compared to common ones. These results reinforce the importance of dissecting the heritability of complex traits with a comprehensive catalog of genetic variants at the population level.
Collapse
Affiliation(s)
- Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Gauthier Brach
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Jing Hou
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.
- Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
30
|
Alfayyadh MM, Maksemous N, Sutherland HG, Lea RA, Griffiths LR. Unravelling the Genetic Landscape of Hemiplegic Migraine: Exploring Innovative Strategies and Emerging Approaches. Genes (Basel) 2024; 15:443. [PMID: 38674378 PMCID: PMC11049430 DOI: 10.3390/genes15040443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Migraine is a severe, debilitating neurovascular disorder. Hemiplegic migraine (HM) is a rare and debilitating neurological condition with a strong genetic basis. Sequencing technologies have improved the diagnosis and our understanding of the molecular pathophysiology of HM. Linkage analysis and sequencing studies in HM families have identified pathogenic variants in ion channels and related genes, including CACNA1A, ATP1A2, and SCN1A, that cause HM. However, approximately 75% of HM patients are negative for these mutations, indicating there are other genes involved in disease causation. In this review, we explored our current understanding of the genetics of HM. The evidence presented herein summarises the current knowledge of the genetics of HM, which can be expanded further to explain the remaining heritability of this debilitating condition. Innovative bioinformatics and computational strategies to cover the entire genetic spectrum of HM are also discussed in this review.
Collapse
Affiliation(s)
| | | | | | | | - Lyn R. Griffiths
- Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), Brisbane, QLD 4059, Australia; (M.M.A.); (N.M.); (H.G.S.); (R.A.L.)
| |
Collapse
|
31
|
Carré C, Carluer JB, Chaux C, Estoup-Streiff C, Roche N, Hosy E, Mas A, Krouk G. Next-Gen GWAS: full 2D epistatic interaction maps retrieve part of missing heritability and improve phenotypic prediction. Genome Biol 2024; 25:76. [PMID: 38523316 PMCID: PMC10962106 DOI: 10.1186/s13059-024-03202-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 02/19/2024] [Indexed: 03/26/2024] Open
Abstract
The problem of missing heritability requires the consideration of genetic interactions among different loci, called epistasis. Current GWAS statistical models require years to assess the entire combinatorial epistatic space for a single phenotype. We propose Next-Gen GWAS (NGG) that evaluates over 60 billion single nucleotide polymorphism combinatorial first-order interactions within hours. We apply NGG to Arabidopsis thaliana providing two-dimensional epistatic maps at gene resolution. We demonstrate on several phenotypes that a large proportion of the missing heritability can be retrieved, that it indeed lies in epistatic interactions, and that it can be used to improve phenotype prediction.
Collapse
Affiliation(s)
- Clément Carré
- BionomeeX, Montpellier, France.
- IMAG, Univ. Montpellier, CNRS, Montpellier, France.
- IPSiM, Univ. Montpellier, CNRS, INRAE, Montpellier, France.
| | - Jean Baptiste Carluer
- IMAG, Univ. Montpellier, CNRS, Montpellier, France
- IPSiM, Univ. Montpellier, CNRS, INRAE, Montpellier, France
| | | | | | | | - Eric Hosy
- Interdisciplinary Institute for Neuroscience, University of Bordeaux, CNRS, Bordeaux, France
| | - André Mas
- BionomeeX, Montpellier, France.
- IMAG, Univ. Montpellier, CNRS, Montpellier, France.
| | - Gabriel Krouk
- BionomeeX, Montpellier, France.
- IPSiM, Univ. Montpellier, CNRS, INRAE, Montpellier, France.
| |
Collapse
|
32
|
Pravednikova AE, Nikitich A, Witkowicz A, Karabon L, Flouris AD, Vliora M, Nintou E, Dinas PC, Szulińska M, Bogdański P, Metsios GS, Kerchev VV, Yepiskoposyan L, Bylino OV, Larina SN, Shulgin B, Shidlovskii YV. Genotypes of the UCP1 gene polymorphisms and cardiometabolic diseases: A multifactorial study of association with disease probability. Biochimie 2024; 218:162-173. [PMID: 37863280 DOI: 10.1016/j.biochi.2023.10.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/16/2023] [Accepted: 10/18/2023] [Indexed: 10/22/2023]
Abstract
Cardiometabolic diseases (CMDs) are complex disorders with a heterogenous phenotype, which are caused by multiple factors including genetic factors. Single nucleotide polymorphisms (SNPs) rs45539933 (p.Ala64Thr), rs10011540 (c.-112A>C), rs3811791 (c.-1766A>G), and rs1800592 (c.-3826A>G) in the UCP1 gene have been analyzed for association with CMDs in many studies providing controversial results. However, previous studies only considered individual UCP1 SNPs and did not evaluate them in an integrated manner, which is a more powerful approach to uncover genetic component of complex diseases. This study aimed to investigate associations between UCP1 genotype combinations and CMDs or CMD risk factors in the context of non-genetic factors. We performed multiple logistic regression analysis and proposed new methodology of testing different combinations of SNP genotypes. We found that probability of CMDs increased in presence of the three-SNP combination of genotypes with minor alleles of c.-3826A>G and p.Ala64Thr and wild allele of c.-112A>C, with increasing age, body mass index (BMI), body fat percentage (BF%) and may differ between sexes and between countries. The combination of genotypes with c.-3826A>G minor allele and wild homozygotes of c.-112A>C and p.Ala64Thr was associated with increased probability of diabetes. While combination of genotypes with minor alleles of all three SNPs reduced the CMD probability. The present results suggest that age, BMI, sex, and UCP1 three-SNP combinations of genotypes significantly contribute to CMD probability. Varying of c.-112A>C alleles in the genotype combination with minor alleles of c.-3826A>G and p.Ala64Thr markedly changes CMD probability.
Collapse
Affiliation(s)
- Anna E Pravednikova
- Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia.
| | - Antonina Nikitich
- Center for Mathematical Modeling in Drug Development, Institute of Biodesign and Complex Systems Modeling, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Agata Witkowicz
- Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, Wroclaw, Poland
| | - Lidia Karabon
- Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, Wroclaw, Poland
| | - Andreas D Flouris
- FAME Laboratory, Department of Physical Education and Sport Science, University of Thessaly, Trikala, Greece
| | - Maria Vliora
- FAME Laboratory, Department of Physical Education and Sport Science, University of Thessaly, Trikala, Greece
| | - Eleni Nintou
- FAME Laboratory, Department of Physical Education and Sport Science, University of Thessaly, Trikala, Greece
| | - Petros C Dinas
- FAME Laboratory, Department of Physical Education and Sport Science, University of Thessaly, Trikala, Greece
| | - Monika Szulińska
- Department of Treatment of Obesity, Metabolic Disorders and Clinical Dietetics, Poznan University of Medical Sciences, Poznan, Poland
| | - Paweł Bogdański
- Department of Treatment of Obesity, Metabolic Disorders and Clinical Dietetics, Poznan University of Medical Sciences, Poznan, Poland
| | - George S Metsios
- School of Physical Education, Sport Science and Dietetics, University of Thessaly, Trikala, Greece
| | - Victor V Kerchev
- Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia; Department of Biology and General Genetics, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Levon Yepiskoposyan
- Laboratory of Evolutionary Genomics, Institute of Molecular Biology, National Academy of Sciences of the Republic of Armenia, Yerevan, Armenia
| | - Oleg V Bylino
- Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Svetlana N Larina
- Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia; Department of Biology and General Genetics, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Boris Shulgin
- Center for Mathematical Modeling in Drug Development, Institute of Biodesign and Complex Systems Modeling, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia; Department of Mathematics, Mechanics and Mathematical Modeling, Institute of Computer Science and Mathematical Modeling, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Yulii V Shidlovskii
- Laboratory of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia; Department of Biology and General Genetics, I.M. Sechenov First Moscow State Medical University, Ministry of Health of the Russian Federation, Moscow, Russia
| |
Collapse
|
33
|
Naake T, Zhu F, Alseekh S, Scossa F, Perez de Souza L, Borghi M, Brotman Y, Mori T, Nakabayashi R, Tohge T, Fernie AR. Genome-wide association studies identify loci controlling specialized seed metabolites in Arabidopsis. PLANT PHYSIOLOGY 2024; 194:1705-1721. [PMID: 37758174 PMCID: PMC10904349 DOI: 10.1093/plphys/kiad511] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 08/01/2023] [Accepted: 08/24/2023] [Indexed: 10/03/2023]
Abstract
Plants synthesize specialized metabolites to facilitate environmental and ecological interactions. During evolution, plants diversified in their potential to synthesize these metabolites. Quantitative differences in metabolite levels of natural Arabidopsis (Arabidopsis thaliana) accessions can be employed to unravel the genetic basis for metabolic traits using genome-wide association studies (GWAS). Here, we performed metabolic GWAS on seeds of a panel of 315 A. thaliana natural accessions, including the reference genotypes C24 and Col-0, for polar and semi-polar seed metabolites using untargeted ultra-performance liquid chromatography-mass spectrometry. As a complementary approach, we performed quantitative trait locus (QTL) mapping of near-isogenic introgression lines between C24 and Col-0 for specific seed specialized metabolites. Besides common QTL between seeds and leaves, GWAS revealed seed-specific QTL for specialized metabolites, indicating differences in the genetic architecture of seeds and leaves. In seeds, aliphatic methylsulfinylalkyl and methylthioalkyl glucosinolates associated with the ALKENYL HYDROXYALKYL PRODUCING loci (GS-ALK and GS-OHP) on chromosome 4 containing alkenyl hydroxyalkyl producing 2 (AOP2) and 3 (AOP3) or with the GS-ELONG locus on chromosome 5 containing methylthioalkyl malate synthase (MAM1) and MAM3. We detected two unknown sulfur-containing compounds that were also mapped to these loci. In GWAS, some of the annotated flavonoids (kaempferol 3-O-rhamnoside-7-O-rhamnoside, quercetin 3-O-rhamnoside-7-O-rhamnoside) were mapped to transparent testa 7 (AT5G07990), encoding a cytochrome P450 75B1 monooxygenase. Three additional mass signals corresponding to quercetin-containing flavonols were mapped to UGT78D2 (AT5G17050). The association of the loci and associating metabolic features were functionally verified in knockdown mutant lines. By performing GWAS and QTL mapping, we were able to leverage variation of natural populations and parental lines to study seed specialized metabolism. The GWAS data set generated here is a high-quality resource that can be investigated in further studies.
Collapse
Affiliation(s)
- Thomas Naake
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Feng Zhu
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Saleh Alseekh
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| | - Federico Scossa
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Research Center for Genomics and Bioinformatics (CREA-GB), Council for Agricultural Research and Economics, Via Ardeatina 546, 00178 Rome, Italy
| | - Leonardo Perez de Souza
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
| | - Monica Borghi
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Department of Biology, Utah State University, 5305 Old Main Hill, Logan, UT 84321-5305, USA
| | - Yariv Brotman
- Department of Life Sciences, Ben-Gurion University of the Negev, 8410501 Be’er Sheva, Israel
| | - Tetsuya Mori
- RIKEN Center for Sustainable Resource Science, Tsurumi, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan
| | - Ryo Nakabayashi
- RIKEN Center for Sustainable Resource Science, Tsurumi, 1-7-22 Suehiro, Yokohama, Kanagawa 230-0045, Japan
| | - Takayuki Tohge
- Graduate School of Biological Science, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| | - Alisdair R Fernie
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| |
Collapse
|
34
|
Fong WJ, Tan HM, Garg R, Teh AL, Pan H, Gupta V, Krishna B, Chen ZH, Purwanto NY, Yap F, Tan KH, Chan KYJ, Chan SY, Goh N, Rane N, Tan ESE, Jiang Y, Han M, Meaney M, Wang D, Keppo J, Tan GCY. Comparing feature selection and machine learning approaches for predicting CYP2D6 methylation from genetic variation. Front Neuroinform 2024; 17:1244336. [PMID: 38449836 PMCID: PMC10915285 DOI: 10.3389/fninf.2023.1244336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 10/18/2023] [Indexed: 03/08/2024] Open
Abstract
Introduction Pharmacogenetics currently supports clinical decision-making on the basis of a limited number of variants in a few genes and may benefit paediatric prescribing where there is a need for more precise dosing. Integrating genomic information such as methylation into pharmacogenetic models holds the potential to improve their accuracy and consequently prescribing decisions. Cytochrome P450 2D6 (CYP2D6) is a highly polymorphic gene conventionally associated with the metabolism of commonly used drugs and endogenous substrates. We thus sought to predict epigenetic loci from single nucleotide polymorphisms (SNPs) related to CYP2D6 in children from the GUSTO cohort. Methods Buffy coat DNA methylation was quantified using the Illumina Infinium Methylation EPIC beadchip. CpG sites associated with CYP2D6 were used as outcome variables in Linear Regression, Elastic Net and XGBoost models. We compared feature selection of SNPs from GWAS mQTLs, GTEx eQTLs and SNPs within 2 MB of the CYP2D6 gene and the impact of adding demographic data. The samples were split into training (75%) sets and test (25%) sets for validation. In Elastic Net model and XGBoost models, optimal hyperparameter search was done using 10-fold cross validation. Root Mean Square Error and R-squared values were obtained to investigate each models' performance. When GWAS was performed to determine SNPs associated with CpG sites, a total of 15 SNPs were identified where several SNPs appeared to influence multiple CpG sites. Results Overall, Elastic Net models of genetic features appeared to perform marginally better than heritability estimates and substantially better than Linear Regression and XGBoost models. The addition of nongenetic features appeared to improve performance for some but not all feature sets and probes. The best feature set and Machine Learning (ML) approach differed substantially between CpG sites and a number of top variables were identified for each model. Discussion The development of SNP-based prediction models for CYP2D6 CpG methylation in Singaporean children of varying ethnicities in this study has clinical application. With further validation, they may add to the set of tools available to improve precision medicine and pharmacogenetics-based dosing.
Collapse
Affiliation(s)
- Wei Jing Fong
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Hong Ming Tan
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Rishabh Garg
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Ai Ling Teh
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Hong Pan
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Varsha Gupta
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Bernadus Krishna
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Zou Hui Chen
- Computational Biology, National University of Singapore, Singapore, Singapore
| | | | - Fabian Yap
- KK Women's and Children's Hospital, Singapore, Singapore
| | - Kok Hian Tan
- KK Women's and Children's Hospital, Singapore, Singapore
- Duke NUS Medical School, Singapore, Singapore
| | - Kok Yen Jerry Chan
- KK Women's and Children's Hospital, Singapore, Singapore
- Duke NUS Medical School, Singapore, Singapore
| | - Shiao-Yng Chan
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- National University Hospital, Singapore, Singapore
| | | | - Nikita Rane
- Institute of Mental Health,Singapore, Singapore
| | | | | | - Mei Han
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Michael Meaney
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Dennis Wang
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- National Heart and Lung Institute, Imperial College London, London, United Kingdom
| | - Jussi Keppo
- Computational Biology, National University of Singapore, Singapore, Singapore
| | - Geoffrey Chern-Yee Tan
- Computational Biology, National University of Singapore, Singapore, Singapore
- Institute of Mental Health,Singapore, Singapore
| |
Collapse
|
35
|
Zhang Q, Liu J, Liu H, Ao L, Xi Y, Chen D. Genome-wide epistasis analysis reveals gene-gene interaction network on an intermediate endophenotype P-tau/Aβ 42 ratio in ADNI cohort. Sci Rep 2024; 14:3984. [PMID: 38368488 PMCID: PMC10874417 DOI: 10.1038/s41598-024-54541-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 02/14/2024] [Indexed: 02/19/2024] Open
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disorder and the most common cause of dementia in the elderly worldwide. The exact etiology of AD, particularly its genetic mechanisms, remains incompletely understood. Traditional genome-wide association studies (GWAS), which primarily focus on single-nucleotide polymorphisms (SNPs) with main effects, provide limited explanations for the "missing heritability" of AD, while there is growing evidence supporting the important role of epistasis. In this study, we performed a genome-wide SNP-SNP interaction detection using a linear regression model and employed multiple GPUs for parallel computing, significantly enhancing the speed of whole-genome analysis. The cerebrospinal fluid (CSF) phosphorylated tau (P-tau)/amyloid-[Formula: see text] (A[Formula: see text]) ratio was used as a quantitative trait (QT) to enhance statistical power. Age, gender, and clinical diagnosis were included as covariates to control for potential non-genetic factors influencing AD. We identified 961 pairs of statistically significant SNP-SNP interactions, explaining a high-level variance of P-tau/A[Formula: see text] level, all of which exhibited marginal main effects. Additionally, we replicated 432 previously reported AD-related genes and found 11 gene-gene interaction pairs overlapping with the protein-protein interaction (PPI) network. Our findings may contribute to partially explain the "missing heritability" of AD. The identified subnetwork may be associated with synaptic dysfunction, Wnt signaling pathway, oligodendrocytes, inflammation, hippocampus, and neuronal cells.
Collapse
Affiliation(s)
- Qiushi Zhang
- School of Computer Science, Northeast Electric Power University, 169 Changchun Street, Jilin, 132012, China
| | - Junfeng Liu
- School of Computer Science, Northeast Electric Power University, 169 Changchun Street, Jilin, 132012, China
| | - Hongwei Liu
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, 145 Nantong Street, Harbin, China
| | - Lang Ao
- School of Computer Science, Northeast Electric Power University, 169 Changchun Street, Jilin, 132012, China
| | - Yang Xi
- School of Computer Science, Northeast Electric Power University, 169 Changchun Street, Jilin, 132012, China
| | - Dandan Chen
- School of Automation Engineering, Northeast Electric Power University, 169 Changchun Street, Jilin, 132012, China.
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, 145 Nantong Street, Harbin, China.
| |
Collapse
|
36
|
Behairy MY, Tawfik NZ, Eid RA, Nasser Binjawhar D, Alshaya DS, Fayad E, Elkhatib WF, Abdallah HY. Mannose-binding lectin gene polymorphism in psoriasis and vitiligo: an observational study and computational analysis. Front Med (Lausanne) 2024; 10:1340703. [PMID: 38404462 PMCID: PMC10885344 DOI: 10.3389/fmed.2023.1340703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 12/28/2023] [Indexed: 02/27/2024] Open
Abstract
Introduction Psoriasis and vitiligo are inflammatory autoimmune skin disorders with remarkable genetic involvement. Mannose-binding lectin (MBL) represents a significant immune molecule with one of its gene variants strongly linked to autoimmune diseases. Therefore, in this study, we investigated the role of the MBL variant, rs1800450, in psoriasis and vitiligo disease susceptibility. Methods The study comprised performing in silico analysis, performing an observational study regarding psoriasis patients, and performing an observational study regarding vitiligo patients. Various in silico tools were used to investigate the impact of the selected mutation on the function, stability, post-translational modifications (PTMs), and secondary structures of the protein. In addition, a total of 489 subjects were enrolled in this study, including their demographic and clinicopathological data. Genotyping analysis was performed using real-time PCR for the single nucleotide polymorphism (SNP) rs1800450 on codon 54 of the MBL gene, utilizing TaqMan genotyping technology. In addition, implications of the studied variant on disease susceptibility and various clinicopathological data were analyzed. Results Computational analysis demonstrated the anticipated effects of the mutation on MBL protein. Furthermore, regarding the observational studies, rs1800450 SNP on codon 54 displayed comparable results in our population relative to global frequencies reported via the 1,000 Genomes Project. This SNP showed no significant association with either psoriasis or vitiligo disease risk in all genetic association models. Furthermore, rs1800450 SNP did not significantly correlate with any of the demographic or clinicopathological features of both psoriasis and vitiligo. Discussion Our findings highlighted that the rs1800450 SNP on the MBL2 gene has no role in the disease susceptibility to autoimmune skin diseases, such as psoriasis and vitiligo, among Egyptian patients. In addition, our analysis advocated the notion of the redundancy of MBL and revealed the lack of significant impact on both psoriasis and vitiligo disorders.
Collapse
Affiliation(s)
- Mohammed Y. Behairy
- Department of Microbiology and Immunology, Faculty of Pharmacy, University of Sadat City, Sadat City, Egypt
| | - Noha Z. Tawfik
- Department of Dermatology, Venereology and Andrology, Faculty of Medicine, Suez Canal University, Ismailia, Egypt
| | - Refaat A. Eid
- Pathology Department, College of Medicine, King Khalid University, Abha, Saudi Arabia
| | - Dalal Nasser Binjawhar
- Department of Chemistry, College of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Dalal Sulaiman Alshaya
- Department of Biology, College of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Eman Fayad
- Department of Biotechnology, College of Sciences, Taif University, Taif, Saudi Arabia
| | - Walid F. Elkhatib
- Microbiology and Immunology Department, Faculty of Pharmacy, Ain Shams University, African Union Organization St., Abbassia, Cairo, Egypt
- Department of Microbiology and Immunology, Faculty of Pharmacy, Galala University, Suez, Egypt
| | - Hoda Y. Abdallah
- Department of Histology and Cell Biology (Genetics Unit), Faculty of Medicine, Suez Canal University, Ismailia, Egypt
- Center of Excellence in Molecular and Cellular Medicine, Faculty of Medicine, Suez Canal University, Ismailia, Egypt
| |
Collapse
|
37
|
Tejwani L, Ravindra NG, Lee C, Cheng Y, Nguyen B, Luttik K, Ni L, Zhang S, Morrison LM, Gionco J, Xiang Y, Yoon J, Ro H, Haidery F, Grijalva RM, Bae E, Kim K, Martuscello RT, Orr HT, Zoghbi HY, McLoughlin HS, Ranum LPW, Shakkottai VG, Faust PL, Wang S, van Dijk D, Lim J. Longitudinal single-cell transcriptional dynamics throughout neurodegeneration in SCA1. Neuron 2024; 112:362-383.e15. [PMID: 38016472 PMCID: PMC10922326 DOI: 10.1016/j.neuron.2023.10.039] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 09/10/2023] [Accepted: 10/27/2023] [Indexed: 11/30/2023]
Abstract
Neurodegeneration is a protracted process involving progressive changes in myriad cell types that ultimately results in the death of vulnerable neuronal populations. To dissect how individual cell types within a heterogeneous tissue contribute to the pathogenesis and progression of a neurodegenerative disorder, we performed longitudinal single-nucleus RNA sequencing of mouse and human spinocerebellar ataxia type 1 (SCA1) cerebellar tissue, establishing continuous dynamic trajectories of each cell population. Importantly, we defined the precise transcriptional changes that precede loss of Purkinje cells and, for the first time, identified robust early transcriptional dysregulation in unipolar brush cells and oligodendroglia. Finally, we applied a deep learning method to predict disease state accurately and identified specific features that enable accurate distinction of wild-type and SCA1 cells. Together, this work reveals new roles for diverse cerebellar cell types in SCA1 and provides a generalizable analysis framework for studying neurodegeneration.
Collapse
Affiliation(s)
- Leon Tejwani
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA.
| | - Neal G Ravindra
- Cardiovascular Research Center, Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT 06510, USA; Department of Computer Science, Yale University, New Haven, CT 06510, USA
| | - Changwoo Lee
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | - Yubao Cheng
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Billy Nguyen
- University of California, San Francisco School of Medicine, San Francisco, CA 94143, USA
| | - Kimberly Luttik
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | - Luhan Ni
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Shupei Zhang
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Logan M Morrison
- Department of Neurology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA; Neuroscience Graduate Program, University of Michigan, Ann Arbor, MI 48109, USA
| | - John Gionco
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center and the New York Presbyterian Hospital, New York, NY 10032, USA
| | - Yangfei Xiang
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA; Yale Stem Cell Center, Yale School of Medicine, New Haven, CT 06510, USA
| | | | - Hannah Ro
- Yale College, New Haven, CT 06510, USA
| | | | - Rosalie M Grijalva
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | | | - Kristen Kim
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Psychiatry, Yale School of Medicine, New Haven, CT 06510, USA
| | - Regina T Martuscello
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center and the New York Presbyterian Hospital, New York, NY 10032, USA
| | - Harry T Orr
- Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN 55455, USA
| | - Huda Y Zoghbi
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston, TX 77030, USA; Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Howard Hughes Medical Institute, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hayley S McLoughlin
- Department of Neurology, University of Michigan, Ann Arbor, MI 48109-2200, USA
| | - Laura P W Ranum
- Department of Molecular Genetics and Microbiology, Center for Neurogenetics, College of Medicine, Genetics Institute, McKnight Brain Institute, University of Florida, Gainesville, FL 32610, USA
| | - Vikram G Shakkottai
- Department of Neurology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Phyllis L Faust
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center and the New York Presbyterian Hospital, New York, NY 10032, USA
| | - Siyuan Wang
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA; Department of Cell Biology, Yale School of Medicine, New Haven, CT 06510, USA.
| | - David van Dijk
- Cardiovascular Research Center, Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT 06510, USA; Department of Computer Science, Yale University, New Haven, CT 06510, USA.
| | - Janghoo Lim
- Interdepartmental Neuroscience Program, Yale School of Medicine, New Haven, CT 06510, USA; Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA; Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA; Yale Stem Cell Center, Yale School of Medicine, New Haven, CT 06510, USA; Program in Cellular Neuroscience, Neurodegeneration and Repair, Yale School of Medicine, New Haven, CT 06510, USA; Wu Tsai Institute, Yale School of Medicine, New Haven, CT 06510, USA.
| |
Collapse
|
38
|
Yaldız B, Erdoğan O, Rafatov S, Iyigün C, Aydın Son Y. Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies. BioData Min 2024; 17:3. [PMID: 38291454 PMCID: PMC10826120 DOI: 10.1186/s13040-024-00355-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 01/16/2024] [Indexed: 02/01/2024] Open
Abstract
BACKGROUND Non-linear relationships at the genotype level are essential in understanding the genetic interactions of complex disease traits. Genome-wide association Studies (GWAS) have revealed statistical association of the SNPs in many complex diseases. As GWAS results could not thoroughly reveal the genetic background of these disorders, Genome-Wide Interaction Studies have started to gain importance. In recent years, various statistical approaches, such as entropy-based methods, have been suggested for revealing these non-additive interactions between variants. This study presents a novel prioritization workflow integrating two-step Random Forest (RF) modeling and entropy analysis after PLINK filtering. PLINK-RF-RF workflow is followed by an entropy-based 3-way interaction information (3WII) method to capture the hidden patterns resulting from non-linear relationships between genotypes in Late-Onset Alzheimer Disease to discover early and differential diagnosis markers. RESULTS Three models from different datasets are developed by integrating PLINK-RF-RF analysis and entropy-based three-way interaction information (3WII) calculation method, which enables the detection of the third-order interactions, which are not primarily considered in epistatic interaction studies. A reduced SNP set is selected for all three datasets by 3WII analysis by PLINK filtering and prioritization of SNP with RF-RF modeling, promising as a model minimization approach. Among SNPs revealed by 3WII, 4 SNPs out of 19 from GenADA, 1 SNP out of 27 from ADNI, and 4 SNPs out of 106 from NCRAD are mapped to genes directly associated with Alzheimer Disease. Additionally, several SNPs are associated with other neurological disorders. Also, the genes the variants mapped to in all datasets are significantly enriched in calcium ion binding, extracellular matrix, external encapsulating structure, and RUNX1 regulates estrogen receptor-mediated transcription pathways. Therefore, these functional pathways are proposed for further examination for a possible LOAD association. Besides, all 3WII variants are proposed as candidate biomarkers for the genotyping-based LOAD diagnosis. CONCLUSION The entropy approach performed in this study reveals the complex genetic interactions that significantly contribute to LOAD risk. We benefited from the entropy-based 3WII as a model minimization step and determined the significant 3-way interactions between the prioritized SNPs by PLINK-RF-RF. This framework is a promising approach for disease association studies, which can also be modified by integrating other machine learning and entropy-based interaction methods.
Collapse
Affiliation(s)
- Burcu Yaldız
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
| | - Onur Erdoğan
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
| | - Sevda Rafatov
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
| | - Cem Iyigün
- Department of Industrial Engineering, METU, Ankara, Turkey
| | - Yeşim Aydın Son
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey.
- Graduate School of Informatics, ODTU-NOROM, METU, Ankara, Turkey.
| |
Collapse
|
39
|
Zhao Y, Deng W, Wang Z, Wang Y, Zheng H, Zhou K, Xu Q, Bai L, Liu H, Ren Z, Jiang Z. Genetics of congenital heart disease. Clin Chim Acta 2024; 552:117683. [PMID: 38030030 DOI: 10.1016/j.cca.2023.117683] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/22/2023] [Accepted: 11/24/2023] [Indexed: 12/01/2023]
Abstract
During embryonic development, the cardiovascular system and the central nervous system exhibit a coordinated developmental process through intricate interactions. Congenital heart disease (CHD) refers to structural or functional abnormalities that occur during embryonic or prenatal heart development and is the most common congenital disorder. One of the most common complications in CHD patients is neurodevelopmental disorders (NDD). However, the specific mechanisms, connections, and precise ways in which CHD co-occurs with NDD remain unclear. According to relevant research, both genetic and non-genetic factors are significant contributors to the co-occurrence of sporadic CHD and NDD. Genetic variations, such as chromosomal abnormalities and gene mutations, play a role in the susceptibility to both CHD and NDD. Further research should aim to identify common molecular mechanisms that underlie the co-occurrence of CHD and NDD, possibly originating from shared genetic mutations or shared gene regulation. Therefore, this review article summarizes the current advances in the genetics of CHD co-occurring with NDD, elucidating the application of relevant gene detection techniques. This is done with the aim of exploring the genetic regulatory mechanisms of CHD co-occurring with NDD at the gene level and promoting research and treatment of developmental disorders related to the cardiovascular and central nervous systems.
Collapse
Affiliation(s)
- Yuanqin Zhao
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Wei Deng
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Zhaoyue Wang
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Yanxia Wang
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Hongyu Zheng
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Kun Zhou
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Qian Xu
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Le Bai
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Huiting Liu
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Zhong Ren
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| | - Zhisheng Jiang
- Institute of Cardiovascular Disease, Key Lab for Arteriosclerology of Hunan Province, International Joint Laboratory for Arteriosclerotic Disease Research of Hunan Province, University of South China, Hengyang 421001, China.
| |
Collapse
|
40
|
Yu EYW, Tang QY, Chen YT, Zhang YX, Dai YN, Wu YX, Li WC, Mehrkanoon S, Wang SZ, Zeegers MP, Wesselius A. Genome-wide exploration of genetic interactions for bladder cancer risk. Int J Cancer 2024; 154:81-93. [PMID: 37638657 DOI: 10.1002/ijc.34690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 07/14/2023] [Accepted: 07/31/2023] [Indexed: 08/29/2023]
Abstract
Although GWASs have been conducted to investigate genetic variation of bladder tumorigenesis, little is known about genetic interactions that may influence bladder cancer (BC) risk. By leveraging large-scale participants from UK Biobank, we established a discovery database with 4000 Caucasian participants (2000 cases vs 2000 non-cases), a database with 1648 Caucasian participants (824 cases vs 824 non-cases) and 856 non-Caucasian participants (428 cases vs 428 non-cases) as validation. We then performed a genome-wide SNP-SNP interaction investigation related to BC risk based a machine learning approach (ie, GenEpi). Moreover, we used the selected interactions to build a BC screening model with an integrated interaction-empowered polygenic risk score (iPRS) based on Cox proportional hazard model. With Bonferroni correction, we identified 10 statistically significant pairs of SNPs, which located in 17 chromosomes. Of these, four SNP-SNP interactions were found to be positively associated with BC risk among Caucasian participants (ORs 1.57-2.03), while six SNP-SNP interactions showed negatively associated with BC risk (ORs 0.54-0.65). Only four of the SNP-SNP interactions were consistently identified in non-Caucasian participants located in ST7L-ADSS2, FHIT-CHDH, LARP4B-LHPP and RBFOX3-MPRIP. In addition, the iPRS showed a HR of 1.81 (95% CI: 1.46-2.09) compared the highest tertile to the lowest tertile, with an enhanced AUC (0.91; 95% CI:0.85-0.97) than PRS (AUC: 0.86; 95% CI:0.76-0.95; P-DeLong test = 2.2 × 10-4 ). In summary, this study identified several important SNP-SNP interactions for BC risk, and developed an iPRS model for BC screening, which may help to identify the people at high-risk state of BC before early manifestation.
Collapse
Affiliation(s)
- Evan Yi-Wen Yu
- Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing, China
- Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, China
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands
| | - Qiu-Yi Tang
- Medical School of Southeast University, Nanjing, China
| | - Ya-Ting Chen
- Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing, China
- Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, China
| | - Yan-Xi Zhang
- Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing, China
- Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, China
| | - Ya-Nan Dai
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands
| | - Yu-Xuan Wu
- Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing, China
- Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, China
| | - Wen-Chao Li
- Department of Urology, Affiliated Zhongda Hospital of Southeast University, Nanjing, China
| | - Siamak Mehrkanoon
- Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
| | - Shi-Zhi Wang
- Key Laboratory of Environmental Medicine Engineering, Ministry of Education, School of Public Health, Southeast University, Nanjing, China
| | - Maurice P Zeegers
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands
| | - Anke Wesselius
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands
| |
Collapse
|
41
|
Li F, Zhao Y, Xu T, Zhang Y. Distributed multi-objective optimization for SNP-SNP interaction detection. Methods 2024; 221:55-64. [PMID: 38061496 DOI: 10.1016/j.ymeth.2023.11.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 11/20/2023] [Accepted: 11/29/2023] [Indexed: 01/16/2024] Open
Abstract
The detection of complex interactions between single nucleotide polymorphisms (SNPs) plays a vital role in genome-wide association analysis (GWAS). The multi-objective evolutionary algorithm is a promising technique for SNP-SNP interaction detection. However, as the scale of SNP data further increases, the exponentially growing search space gradually becomes the dominant factor, causing evolutionary algorithm (EA)-based approaches to fall into local optima. In addition, multi-objective genetic operations consume significant amounts of time and computational resources. To this end, this study proposes a distributed multi-objective evolutionary framework (DM-EF) to identify SNP-SNP interactions on large-scale datasets. DM-EF first partitions the entire search space into several subspaces based on a space-partitioning strategy, which is nondestructive because it guarantees that each feasible solution is assigned to a specific subspace. Thereafter, each subspace is optimized using a multi-objective EA optimizer, and all subspaces are optimized in parallel. A decomposition-based multi-objective firework optimizer (DCFWA) with several problem-guided operators was designed. Finally, the final output is selected from the Pareto-optimal solutions in the historical search of each subspace. DM-EF avoids the preference for a single objective function, handles the heavy computational burden, and enhances the diversity of the population to avoid local optima. Notably, DM-EF is load-balanced and scalable because it can flexibly partition the space according to the number of available computational nodes and problem size. Experiments on both artificial and real-world datasets demonstrate that the proposed method significantly improves the search speed and accuracy.
Collapse
Affiliation(s)
- Fangting Li
- School of Computer Science and Engineering, Northeastern University, Shenyang, China.
| | - Yuhai Zhao
- School of Computer Science and Engineering, Northeastern University, Shenyang, China.
| | - Tongze Xu
- School of Computer Science and Engineering, Northeastern University, Shenyang, China.
| | - Yuhan Zhang
- College of Medicine and Biological information Engineering, Northeastern University, Shenyang, China.
| |
Collapse
|
42
|
Ren F, Li S, Wen Z, Liu Y, Tang D. The Spherical Evolutionary Multi-Objective (SEMO) Algorithm for Identifying Disease Multi-Locus SNP Interactions. Genes (Basel) 2023; 15:11. [PMID: 38275593 PMCID: PMC10815643 DOI: 10.3390/genes15010011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 11/21/2023] [Accepted: 12/18/2023] [Indexed: 01/27/2024] Open
Abstract
Single-nucleotide polymorphisms (SNPs), as disease-related biogenetic markers, are crucial in elucidating complex disease susceptibility and pathogenesis. Due to computational inefficiency, it is difficult to identify high-dimensional SNP interactions efficiently using combinatorial search methods, so the spherical evolutionary multi-objective (SEMO) algorithm for detecting multi-locus SNP interactions was proposed. The algorithm uses a spherical search factor and a feedback mechanism of excellent individual history memory to enhance the balance between search and acquisition. Moreover, a multi-objective fitness function based on the decomposition idea was used to evaluate the associations by combining two functions, K2-Score and LR-Score, as an objective function for the algorithm's evolutionary iterations. The performance evaluation of SEMO was compared with six state-of-the-art algorithms on a simulated dataset. The results showed that SEMO outperforms the comparative methods by detecting SNP interactions quickly and accurately with a shorter average run time. The SEMO algorithm was applied to the Wellcome Trust Case Control Consortium (WTCCC) breast cancer dataset and detected two- and three-point SNP interactions that were significantly associated with breast cancer, confirming the effectiveness of the algorithm. New combinations of SNPs associated with breast cancer were also identified, which will provide a new way to detect SNP interactions quickly and accurately.
Collapse
Affiliation(s)
- Fuxiang Ren
- College of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China; (F.R.); (S.L.); (Y.L.)
| | - Shiyin Li
- College of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China; (F.R.); (S.L.); (Y.L.)
| | - Zihao Wen
- College of Mathematics and Informatics, College of Software Engineering, South China Agricultural University, Guangzhou 510642, China
- Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Yidi Liu
- College of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China; (F.R.); (S.L.); (Y.L.)
| | - Deyu Tang
- College of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China; (F.R.); (S.L.); (Y.L.)
- College of Mathematics and Informatics, College of Software Engineering, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
43
|
Zuo P, Zhang C, Gao Y, Zhao L, Guo J, Yang Y, Yu Q, Li Y, Wang Z, Yang H. Genome-wide unraveling SNP pairwise epistatic effects associated with sheep body weight. Anim Biotechnol 2023; 34:3416-3427. [PMID: 36495095 DOI: 10.1080/10495398.2022.2152349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Epistatic effects are an important part of the genetic effect of complex traits in livestock. In this study, we used 218 synthetic ewes from the Xinjiang Academy of Agricultural Reclamation in China to identify interacting paired with genome-wide single nucleotide polymorphisms (SNPs) associated with birth weight, weaning weight, and one-yearling weight. We detected 2 and 66 SNP-SNP interactions of sheep birth weight and weaning weight, respectively. No significant epistatic interaction of one-year-old body weight was detected. The genetic interaction of sheep body weight is dynamic and time-dependent. Most significant interactions of weaning body weight contributed 1% or higher. In the weaning weight trait, 66 significant SNP pairs consisted of 98 single SNPs covering 23 chromosomes, 5 of which were nonsynonymous SNPs (nsSNPs), resulting in single amino acid substitution. We found that genes that interact with transcription factors (TFs) are target genes for the corresponding TFs. Four epitron networks affecting weaning weight, including subnetworks of HIVEP3 and BACH2 transcription factors, constructed using significant SNP pairs, were also analyzed and annotated. These results suggest that transcription factors may play an important role in explaining epistatic effects. It provides a new idea to study the genetic mechanism of weight developing.
Collapse
Affiliation(s)
- Peng Zuo
- College of Science, Northeast Agricultural University, Harbin
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
| | - Chaoxin Zhang
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China
| | - Yupeng Gao
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Engineering, Northeast Agricultural University, Harbin, China
| | - Lijunyi Zhao
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Information and Electrical Engineering, Northeast Agricultural University, Harbin, China
| | - Jiaxu Guo
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Life Sciences, Northeast Agricultural University, Harbin, China
| | - Yonglin Yang
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural Reclamation, Shihezi, Hebei, China
| | - Qian Yu
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural Reclamation, Shihezi, Hebei, China
| | - Yunna Li
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China
| | - Zhipeng Wang
- Bioinformatics Center, Northeast Agricultural University, Harbin, China
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural Reclamation, Shihezi, Hebei, China
| | - Hua Yang
- State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural Reclamation, Shihezi, Hebei, China
| |
Collapse
|
44
|
Sun NA, Wang YU, Chu J, Han Q, Shen Y. Bayesian Approaches in Exploring Gene-environment and Gene-gene Interactions: A Comprehensive Review. Cancer Genomics Proteomics 2023; 20:669-678. [PMID: 38035701 PMCID: PMC10687732 DOI: 10.21873/cgp.20414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/30/2023] [Accepted: 11/01/2023] [Indexed: 12/02/2023] Open
Abstract
Rapid advancements in high-throughput biological techniques have facilitated the generation of high-dimensional omics datasets, which have provided a solid foundation for precision medicine and prognosis prediction. Nonetheless, the problem of missing heritability persists. To solve this problem, it is essential to explain the genetic structure of disease incidence risk and prognosis by incorporating interactions. The development of the Bayesian theory has provided new approaches for developing models for interaction identification and estimation. Several Bayesian models have been developed to improve the accuracy of model and identify the main effect, gene-environment (G×E) and gene-gene (G×G) interactions. Studies based on single-nucleotide polymorphisms (SNPs) are significant for the exploration of rare and common variants. Models based on the effect heredity principle and group-based models are relatively flexible and do not require strict constraints when dealing with the hierarchical structure between the main effect and interactions (M-I). These models have a good interpretability of biological mechanisms. Machine learning-based Bayesian approaches are highly competitive in improving prediction accuracy. These models provide insights into the mechanisms underlying the occurrence and progression of complex diseases, identify more reliable biomarkers, and develop higher predictive accuracy. In this paper, we provide a comprehensive review of these Bayesian approaches.
Collapse
Affiliation(s)
- N A Sun
- Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, P.R. China
| | - Y U Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, P.R. China
| | - Jiadong Chu
- Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, P.R. China
| | - Qiang Han
- Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, P.R. China
| | - Yueping Shen
- Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, P.R. China
| |
Collapse
|
45
|
Zhang W, Wang Y, Deng S, Zhu YC. LncRNA RP11-10E18.7 cooperates with lncRNA RP11-481C4.2 to affect the overall survival of breast cancer patients: a TCGA-based retrospective study. Transl Cancer Res 2023; 12:3156-3165. [PMID: 38130297 PMCID: PMC10731347 DOI: 10.21037/tcr-23-1941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 11/07/2023] [Indexed: 12/23/2023]
Abstract
BACKGROUND As either oncogenes or tumor suppressor genes, long non-coding RNAs (lncRNAs) have a major role in both tumorigenesis and progression of human cancers, including breast cancer (BC). However, the statistical correlation between the lncRNA-lncRNA interaction and prognosis of BC remains unclear. METHODS We analyzed the fragments per kilobase per million (FPKM) lncRNA expression data in tumor tissue samples from 890 female patients with BC in The Cancer Genome Atlas (TCGA) between May 2021 and October 2022. The Cox proportional hazards model adjusted for age, race, clinical stage, neoadjuvant therapy, estrogen receptor (ER), and progesterone receptor (PR) was adopted to evaluate the lncRNA-lncRNA interaction regarding overall survival (OS) of BC. The multiple comparison was corrected by Bonferroni method. RESULTS RP11-10E18.7×RP11-481C4.2 was significantly associated with OS of BC patients [hazard ratio (HR)interaction =1.04, 95% confidence interval (CI): 1.03-1.06, P=3.35×10-9]. Then, gene-gene interaction analysis was performed for genes co-expressed with lncRNAs. FOXA1×U2SURP (HRinteraction =1.49, 95% CI: 1.28-1.73, P=2.16×10-7) was found to have a similar interactive pattern to RP11-10E18.7×RP11-481C4.2. after classifying the patients by intersection (3.47), we observed that the effect of FOXA1 opposite in patients with different U2SURP expression level (HRhigh vs. low =0.58, 95% CI: 0.34-0.99, P=0.046 in low expression of U2SURP; HRhigh vs. low =1.56, 95% CI: 1.18-2.87, P=0.029 in high expression of U2SURP). CONCLUSIONS Our comprehensive study identified RP11-10E18.7×RP11-481C4.2 as a potential biomarker of BC prognosis. The results play an essential role in the impact of lncRNA-lncRNA interaction on BC survival. Our findings elucidated potential molecular mechanisms of BC progression under complex association patterns and provided potential dynamic and reversible therapeutic targets for BC patients.
Collapse
Affiliation(s)
- Wenzhong Zhang
- Department of Surgery, Pudong New Area People’s Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Yueming Wang
- Department of Surgery, Pudong New Area People’s Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Shuhao Deng
- Department of Ultrasound, Pudong New Area People’s Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Yi-Cheng Zhu
- Department of Ultrasound, Pudong New Area People’s Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai, China
| |
Collapse
|
46
|
Behairy MY, Eid RA, Otifi HM, Mohammed HM, Alshehri MA, Asiri A, Aldehri M, Zaki MSA, Darwish KM, Elhady SS, El-Shaer NH, Eldeen MA. Unraveling Extremely Damaging IRAK4 Variants and Their Potential Implications for IRAK4 Inhibitor Efficacy. J Pers Med 2023; 13:1648. [PMID: 38138875 PMCID: PMC10744719 DOI: 10.3390/jpm13121648] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 11/02/2023] [Accepted: 11/07/2023] [Indexed: 12/24/2023] Open
Abstract
Interleukin-1-receptor-associated kinase 4 (IRAK4) possesses a crucial function in the toll-like receptor (TLR) signaling pathway, and the dysfunction of this molecule could lead to various infectious and immune-related diseases in addition to cancers. IRAK4 genetic variants have been linked to various types of diseases. Therefore, we conducted a comprehensive analysis to recognize the missense variants with the most damaging impacts on IRAK4 with the employment of diverse bioinformatics tools to study single-nucleotide polymorphisms' effects on function, stability, secondary structures, and 3D structure. The residues' location on the protein domain and their conservation status were investigated as well. Moreover, docking tools along with structural biology were engaged in analyzing the SNPs' effects on one of the developed IRAK4 inhibitors. By analyzing IRAK4 gene SNPs, the analysis distinguished ten variants as the most detrimental missense variants. All variants were situated in highly conserved positions on an important protein domain. L318S and L318F mutations were linked to changes in IRAK4 secondary structures. Eight SNPs were revealed to have a decreasing effect on the stability of IRAK4 via both I-Mutant 2.0 and Mu-Pro tools, while Mu-Pro tool identified a decreasing effect for the G198E SNP. In addition, detrimental effects on the 3D structure of IRAK4 were also discovered for the selected variants. Molecular modeling studies highlighted the detrimental impact of these identified SNP mutant residues on the druggability of the IRAK4 ATP-binding site towards the known target inhibitor, HG-12-6, as compared to the native protein. The loss of important ligand residue-wise contacts, altered protein global flexibility, increased steric clashes, and even electronic penalties at the ligand-binding site interfaces were all suggested to be associated with SNP models for hampering the HG-12-6 affinity towards IRAK4 target protein. This given model lays the foundation for the better prediction of various disorders relevant to IRAK4 malfunction and sheds light on the impact of deleterious IRAK4 variants on IRAK4 inhibitor efficacy.
Collapse
Affiliation(s)
- Mohammed Y. Behairy
- Department of Microbiology and Immunology, Faculty of Pharmacy, University of Sadat City, Sadat City 32897, Egypt;
| | - Refaat A. Eid
- Department of Pathology, College of Medicine, King Khalid University, Abha P.O. Box 61421, Saudi Arabia; (R.A.E.); (H.M.O.)
| | - Hassan M. Otifi
- Department of Pathology, College of Medicine, King Khalid University, Abha P.O. Box 61421, Saudi Arabia; (R.A.E.); (H.M.O.)
| | - Heitham M. Mohammed
- Department of Anatomy, College of Medicine, King Khalid University, Abha P.O. Box 61421, Saudi Arabia; (H.M.M.); (M.A.); (M.S.A.Z.)
| | - Mohammed A. Alshehri
- Department of Child Health, College of Medicine, King Khalid University, Abha P.O. Box 62529, Saudi Arabia; (M.A.A.)
| | - Ashwag Asiri
- Department of Child Health, College of Medicine, King Khalid University, Abha P.O. Box 62529, Saudi Arabia; (M.A.A.)
| | - Majed Aldehri
- Department of Anatomy, College of Medicine, King Khalid University, Abha P.O. Box 61421, Saudi Arabia; (H.M.M.); (M.A.); (M.S.A.Z.)
| | - Mohamed Samir A. Zaki
- Department of Anatomy, College of Medicine, King Khalid University, Abha P.O. Box 61421, Saudi Arabia; (H.M.M.); (M.A.); (M.S.A.Z.)
| | - Khaled M. Darwish
- Department of Medicinal Chemistry, Faculty of Pharmacy, Suez Canal University, Ismailia 41522, Egypt;
| | - Sameh S. Elhady
- Department of Natural Products, Faculty of Pharmacy, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| | - Nahla H. El-Shaer
- Department of Zoology, Faculty of Science, Zagazig University, Zagazig 44511, Egypt;
| | - Muhammad Alaa Eldeen
- Department of Zoology, Faculty of Science, Zagazig University, Zagazig 44511, Egypt;
| |
Collapse
|
47
|
Skodvin SN, Gjessing HK, Jugessur A, Romanowska J, Page CM, Corfield EC, Lee Y, Håberg SE, Gjerdevik M. Statistical methods to detect mother-father genetic interaction effects on risk of infertility: A genome-wide approach. Genet Epidemiol 2023; 47:503-519. [PMID: 37638522 DOI: 10.1002/gepi.22534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 05/25/2023] [Accepted: 08/01/2023] [Indexed: 08/29/2023]
Abstract
Infertility is a heterogeneous phenotype, and for many couples, the causes of fertility problems remain unknown. One understudied hypothesis is that allelic interactions between the genotypes of the two parents may influence the risk of infertility. Our aim was, therefore, to investigate how allelic interactions can be modeled using parental genotype data linked to 15,789 pregnancies selected from the Norwegian Mother, Father, and Child Cohort Study. The newborns in 1304 of these pregnancies were conceived using assisted reproductive technologies (ART), and the remainder were conceived naturally. Treating the use of ART as a proxy for infertility, different parameterizations were implemented in a genome-wide screen for interaction effects between maternal and paternal alleles at the same locus. Some of the models were more similar in the way they were parameterized, and some produced similar results when implemented on a genome-wide scale. The results showed near-significant interaction effects in genes relevant to the phenotype under study, such as Dynein axonemal heavy chain 17 (DNAH17) with a recognized role in male infertility. More generally, the interaction models presented here are readily adaptable to the study of other phenotypes in which maternal and paternal allelic interactions are likely to be involved.
Collapse
Affiliation(s)
- Siri N Skodvin
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
| | - Håkon K Gjessing
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
| | - Astanand Jugessur
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
| | - Julia Romanowska
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
| | - Christian M Page
- Department of Physical Health and Ageing, Division of Mental and Physical Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Elizabeth C Corfield
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
- Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway
| | - Yunsung Lee
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Siri E Håberg
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Miriam Gjerdevik
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Computer Science, Electrical Engineering and Mathematical Sciences, Western Norway University of Applied Sciences, Bergen, Norway
| |
Collapse
|
48
|
Tang S, Mao S, Chen Y, Tan F, Duan L, Pian C, Zeng X. LRBmat: A novel gut microbial interaction and individual heterogeneity inference method for colorectal cancer. J Theor Biol 2023; 571:111538. [PMID: 37257720 DOI: 10.1016/j.jtbi.2023.111538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 05/07/2023] [Accepted: 05/18/2023] [Indexed: 06/02/2023]
Abstract
The gut microbial community has been shown to play a significant role in various diseases, including colorectal cancer (CRC), which is a major public health concern worldwide. The accurate diagnosis and etiological analysis of CRC are crucial issues. Numerous methods have utilized gut microbiota to address these challenges; however, few have considered the complex interactions and individual heterogeneity of the gut microbiota, which are important issues in genetics and intestinal microbiology, particularly in high-dimensional cases. This paper presents a novel method called Binary matrix based on Logistic Regression (LRBmat) to address these concerns. The binary matrix in LRBmat can directly mitigate or eliminate the influence of heterogeneity, while also capturing information on gut microbial interactions with any order. LRBmat is highly adaptable and can be combined with any machine learning method to enhance its capabilities. The proposed method was evaluated using real CRC data and demonstrated superior classification performance compared to state-of-the-art methods. Furthermore, the association rules extracted from the binary matrix of the real data align well with biological properties and existing literature, thereby aiding in the etiological analysis of CRC.
Collapse
Affiliation(s)
- Shan Tang
- Department of Statistics, Hunan University, Changsha 410006, China
| | - Shanjun Mao
- Department of Statistics, Hunan University, Changsha 410006, China.
| | - Yangyang Chen
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Falong Tan
- Department of Statistics, Hunan University, Changsha 410006, China
| | - Lihua Duan
- Department of Rheumatology and Clinical Immunology, Jiangxi Provincial People's Hospital, The First Affiliated Hospital of Nanchang University, Nanchang 330006, China
| | - Cong Pian
- College of Sciences, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiangxiang Zeng
- Department of Computer Science, Hunan University, Changsha 410086, China
| |
Collapse
|
49
|
Abstract
Despite monumental advances in molecular technology to generate genome sequence data at scale, there is still a considerable proportion of heritability in most complex diseases that remains unexplained. Because many of the discoveries have been single-nucleotide variants with small to moderate effects on disease, the functional implication of many of the variants is still unknown and, thus, we have limited new drug targets and therapeutics. We, and many others, posit that one primary factor that has limited our ability to identify novel drug targets from genome-wide association studies may be due to gene interactions (epistasis), gene-environment interactions, network/pathway effects, or multiomic relationships. We propose that many of these complex models explain much of the underlying genetic architecture of complex disease. In this review, we discuss the evidence from multiple research avenues, ranging from pairs of alleles to multiomic integration studies and pharmacogenomics, that supports the need for further investigation of gene interactions (or epistasis) in genetic and genomic studies of human disease. Our goal is to catalog the mounting evidence for epistasis in genetic studies and the connections between genetic interactions and human health and disease that could enable precision medicine of the future.
Collapse
Affiliation(s)
- Pankhuri Singhal
- Genetics and Epigenetics Graduate Group, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Shefali Setia Verma
- Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA;
- Penn Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
50
|
Tsouris A, Brach G, Friedrich A, Hou J, Schacherer J. Diallel panel reveals a significant impact of low-frequency genetic variants on gene expression variation in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.21.550015. [PMID: 37503053 PMCID: PMC10370210 DOI: 10.1101/2023.07.21.550015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Unraveling the genetic sources of gene expression variation is essential to better understand the origins of phenotypic diversity in natural populations. Genome-wide association studies identified thousands of variants involved in gene expression variation, however, variants detected only explain part of the heritability. In fact, variants such as low-frequency and structural variants (SVs) are poorly captured in association studies. To assess the impact of these variants on gene expression variation, we explored a half-diallel panel composed of 323 hybrids originated from pairwise crosses of 26 natural Saccharomyces cerevisiae isolates. Using short- and long-read sequencing strategies, we established an exhaustive catalog of single nucleotide polymorphisms (SNPs) and SVs for this panel. Combining this dataset with the transcriptomes of all hybrids, we comprehensively mapped SNPs and SVs associated with gene expression variation. While SVs impact gene expression variation, SNPs exhibit a higher effect size with an overrepresentation of low-frequency variants compared to common ones. These results reinforce the importance of dissecting the heritability of complex traits with a comprehensive catalog of genetic variants at the population level.
Collapse
Affiliation(s)
- Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Gauthier Brach
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Jing Hou
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|