Published online Aug 28, 2021. doi: 10.35712/aig.v2.i4.105
Peer-review started: March 16, 2021
First decision: April 15, 2021
Revised: April 22, 2021
Accepted: July 9, 2021
Article in press: July 9, 2021
Published online: August 28, 2021
Processing time: 166 Days and 12.5 Hours
The microbiome has been identified as a causing factor for many cancers. Helicobacter pylori contributes to the development of gastric cancer (GC) and impacts disease treatments. The rapid development of sequencing technology is increasingly producing large-scale and complex big data. However, there are many obstacles in the analysis of these data by humans, which limit clinicians from making rapid decisions. Recently, the emergence of artificial intelligence (AI), including machine learning and deep learning, has greatly assisted clinicians in processing and interpreting large microbiome data. This paper reviews the application of AI in the study of the microbiome and discusses its potential in the diagnosis and therapy of GC. We also exemplify strategies for implementing microbiome-based precision medicines for patients with GC.
Core Tip: Artificial intelligence (AI) helps us understand the role of the microbiome in gastric cancer (GC) and further promote the development precision medicine. AI can be applied in the following three aspects: (1) AI improves the diagnostic accuracy for GC based on big data and gastric microbiome; (2) AI aids pathologists to diagnose gastric biopsies rapidly by sensitively detecting low abundance microbes; and (3) AI regulates individual’s dietary intake by giving new insight into host-microbiome interactions.
- Citation: Li ZM, Zhuang X. Application of artificial intelligence in microbiome study promotes precision medicine for gastric cancer. Artif Intell Gastroenterol 2021; 2(4): 105-110
- URL: https://www.wjgnet.com/2644-3236/full/v2/i4/105.htm
- DOI: https://dx.doi.org/10.35712/aig.v2.i4.105
Gastric cancer (GC, also known as stomach cancer) is the second leading cause of cancer-related mortality globally, with over 70000 new cases diagnosed every year[1]. The 5-year survival rate of GC is lower than 15%, even in the United States[2]. According to Lauren's criteria, GC can be classified into two main types: Diffuse and intestinal. The diffuse type usually appears in younger patients and tends to be more aggressive, whereas the intestinal type is usually found in older patients and is caused by chronic infection with Helicobacter pylori (H. pylori)[3]. The microbiota in the stomach is extremely rich and complex[4]. DNA sequencing and computational methods are making astounding advances in the identification of conserved ribosomal RNA (rRNA) genes for pathogenic microorganisms. More than 100 phylotypes have been uncovered in humans, and the majority of gastric microbiota falls within five phyla, including Bacteroidetes, Firmicutes, Proteobacteria, Actinobacteria, and Fusobacteria. H. pylori belongs to Proteobacteria. H. pylori infection triggers multistep progression from chronic gastritis, atrophic gastritis, and intestinal metaplasia to carcinoma finally. However, the issue of how the gastric microbiota interplays with H. pylori (namely, does the gastric microbiota lead to a more virulent H. pylori or, vice versa, does H. pylori facilitate the carcinogenesis of the microbiota?) is still not clear. This might have implications for clinical management.
Artificial intelligence (AI) is the simulation of human intelligence processes by computers and has been applied in various fields, such as image processing and natural language processing. AI is playing an increasingly important role in healthcare. It has been demonstrated that AI algorithms can support humans in simplifying the multidimensional, complex metagenomic data of gene profiling and elucidating the peculiar signatures of beneficial microbes in the gastrointestinal tract[5]. As a core branch of AI, machine learning (ML) focuses on building mathematical models that help machines make predictions or decisions without being explicitly programmed. In the field of ML, deep learning (DL) has become the dominant approach for ongoing work with big data. DL, a subset of ML, is inspired by the information processing system discovered in the human brain. DL uses numerous layers of algorithms (artificial neural networks) to extract higher-level features from raw input. Briefly, ML is a core branch of AI, and DL is performed to implement ML. ML and DL have been successfully used to predict the risk of GC[6].
Gastroenterology is a field where AI can make a significant difference. Traditional diagnostic methods have insufficient resolution ability to estimate the invasion depth of early GC in the clinic. Thus, over one-third of advanced GC cases with lesions around the cardia are not easily detected by image-based methods[7]. However, AI-assisted image analysis using endoscopic detection can make more accurate assessments and provide more details than conventional analysis[8]. There are still two main limitations in AI-assisted image analysis. First, there are relatively few data serving as learning and testing materials for building DL models. Second, the diagnostic accuracy is greatly affected when low-resolution images, which endoscopists usually encounter in clinical practice, are input. The above two points may cause certain defects in medical decisions based on image analysis. Remarkably, the combination of AI and the microbiome shows great potential in precision medicine for GC.
High-throughput sequencing is becoming a common technology for typing microbial isolates, especially in clinical samples. Many gene mutations, transcriptional differences, translational differences, epigenetic variations, and metabolic changes have been identified as being associated with the heterogeneity and stage of GC. High-throughput sequencing generates massive microbial data. A deep understanding of microbial data is helpful to explain the relationship between microbes and diseases[9]. Virulence among H. pylori strains and host genetic polymorphisms contribute to GC susceptibility. AI algorithms effectively improve our understanding of the gastric microbiota due to two major advantages. First, AI methods can be applied to extract microbial genomic DNA from sequencing samples. Second, AI methods can simultaneously examine all genes in all organisms contained in a sample. Combined with other parameters, such as food habits, duration of infection, and physical activity, AI algorithms can provide better health advice to GC patients. A recent study has started to explore the ability of DL to treat diseases related to gut dysbiosis based on the individual’s microbiome pattern[10]. In the future, researchers can develop AI algorithms to regulate the individual’s dietary intake and plan their meals when we fully understand the microbiome differences between people with and without disease (Figure 1).
Studying the microbiome composition of primary samples provides a chance to understand the role of pathogenic microorganisms in disease development. In the late 2000s, two large-scale international human microbiome projects (HMPs), Metagenomics of the Human Intestinal Tract[11] and the HMP[12], were initiated to study microorganisms in the human body and to develop computational methods that analyze sequenced metagenomes. However, it seems challenging due to the low number of microbial DNA relative to the host DNA. Accurate identification of the microbiome requires the removal of all possible sequencing reads that originate from human DNA. Bacterial identification was commonly completed by characterization of uniform genomic coverage[13]. For example, the sequence identity of 16S rRNA gene fragments greater than 97% can be classified into separate operational taxonomic units (OTUs), which means the phylogenetic boundaries of different bacterial species[14]. Bacterial identification can also be completed based on coverage along a narrow region of their genomes. For example, analysis of amplicon sequence variants improves the sensitivity and specificity and decreases the problem of inflated microbiota datasets due to falsely identified OTUs originating from misclustered sequences[15]. Recently, Lupolova et al[16] found that ML algorithms made a good attribution of the host sources of S. enterica serovar Typhimurium isolates[16]. The combination of 16S rRNA gene sequencing data and AI algorithms may reveal the essential role of low-abundance bacteria in the alteration of the gut microbiota composition.
It is challenging to quantify and characterize microbiome profiling in samples where the bacterial content is relatively low. The microbial community in the stomach is typically restricted by the lower luminal pH, which selects for acid-resistant bacterial populations and usually limits the colonization densities to < 1000 colony-forming units per gram (CFU/g)[17]. The current approach for detecting the bacteria of fecal or environmental samples cannot be directly used to analyze the microbiome from the upper gastrointestinal tract, such as the stomach. This is partly because the high amount of human DNA in the samples confounds microbial identification. Klein et al[18] designed a DL algorithm that can be used to detect H. pylori on regular whole slide images of gastric biopsies, achieving a sensitivity of 100%[18]. Detecting the low abundance bacteria without sample processing facilitates the establishment of a rapid diagnostic method. Recently, we designed magnetic nanoparticles with a broad range of capture potentials via electrostatic attractions[19]. This system can rapidly and efficiently capture bacteria at a low concentration of 10 CFU/mL within 1 h. The capture efficiency was more than 90%. It can be used to evaluate the microbiome profile of gastric biopsies in future studies.
A comparative study of GC and chronic gastritis using an approach targeting the 16S rRNA gene of mucosal biopsies showed that bacterial diversity was decreased in GC patients[20]. Patients with GC had a large number of non-Helicobacter Proteobacteria. Colonization with bacteria other than H. pylori breaks the balance between the resident gastric microbiota and the host, which may increase the risk for H. pylori-related cancer. Another study evaluated the microbiota composition in normal, peritumoral, and tumoral tissues by 16S rRNA gene profiling and found that microbial diversity was significantly reduced in peritumoral and tumoral microhabitats[21]. H. pylori, Prevotella copri, and Bacteroides uniformis were relatively less abundant in the tumoral microhabitat, whereas Prevotella melaninogenica, Streptococcus anginosus, and Propionibacterium acnes were more abundant. The authors proposed the hypothesis that chronic atrophic gastritis with atrophy (the acidity of the microenvironment of the stomach is reduced) was attributed to H. pylori substitution by a cancer-prone microbiota[22]. Additionally, the same research team found a close relationship between the subtype of immune cells (regulatory T cells and plasmacytoid dendritic cells) and gastric microbiota dysbiosis within the tumor microenvironment. It is already known that H. pylori infection functions in the development of precancerous lesions, such as chronic gastritis. Nevertheless, the dramatic changes in the composition of the stomach microbiome play a more direct role in the later stages of cancer. Moreover, the microbiome affects the therapeutic response of GC patients, and the treatment also impacts microbial composition. Distal gastrectomy impacts postoperative gut microbiota composition, leading to higher abundances of Escherichia, Shigella, Veillonella, and Clostridium XVIII and a lower abundance of Bacteroides[23]. Immune checkpoint inhibitors targeting programmed cell death 1 (PD-1)/programmed cell death ligand 1 were recently added to the therapeutic arsenal for GC. The microbiome composition interferes with the response to these inhibitors. A recent study reported that nonresponders to PD-1 blockade immunotherapy can be distinguished from responders according to the ratio of putatively favorable to unfavorable bacteria[24]. Thus, the role of the microbiome in cancer-immune interactions is gaining much attention. When we learn more about host-microbiome interactions, nonresponders to checkpoint inhibitors are easier to select and treat by personalized immunotherapy.
Due to the practical limitations of analysis methods, there are still large gaps on how the microbiome mechanically affects host function at the system and community levels. Notably, the past few decades has seen significant work on AI in filling these existing gaps. AI algorithms can co-analyze heterogeneous datasets and capture changes at the microbial and host levels. These methods can be classified into four types: Interfering protein-protein interactions, interfering RNA-mediated interactions, interfering microbe-host metabolic networks, and integrating multiple interspecies and intraspecies networks and omic datasets[25]. The powerful multiomics tools and rapidly developed AI algorithms can greatly enhance or perhaps revolutionize microbiome research. This collaboration provides hopeful expectations to improve our current understanding of GC mechanisms, as well as better detection and treatment.
We live in a world surrounded by data and microbes. The gastric microbiome occupies an important position in maintaining the individual’s health. A large quantity of complex sequencing data are generated by high-throughput technologies. However, inherent challenges still exist in data processing, including confounding variables from abundant organisms, the integration of different omics data, and the relationships between microbes and their hosts. Currently, big data are easier than ever to analyze due to the assistance of AI technologies. AI is evolving as an important tool for the proposal of new biological hypotheses and the discovery of biomarkers from the available data. In the future, the renewal of the stomach of dysbiosis patients may be achieved by synthetic biology and food engineering based on our understanding of the microbiome and the performance of AI.
Manuscript source: Invited manuscript
Specialty type: Gastroenterology and hepatology
Country/Territory of origin: China
Peer-review report’s scientific quality classification
Grade A (Excellent): 0
Grade B (Very good): B
Grade C (Good): C, C, C
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Abreu de Melo MI, Kinami S, Moradi L, Sharma J S-Editor: Fan JR L-Editor: Wang TQ P-Editor: Li JH
1. | Brawner KM, Morrow CD, Smith PD. Gastric microbiome and gastric cancer. Cancer J. 2014;20:211-216. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 101] [Cited by in F6Publishing: 93] [Article Influence: 8.5] [Reference Citation Analysis (0)] |
2. | Polk DB, Peek RM Jr. Helicobacter pylori: gastric cancer and beyond. Nat Rev Cancer. 2010;10:403-414. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 718] [Cited by in F6Publishing: 822] [Article Influence: 54.8] [Reference Citation Analysis (0)] |
3. | Quadri HS, Smaglo BG, Morales SJ, Phillips AC, Martin AD, Chalhoub WM, Haddad NG, Unger KR, Levy AD, Al-Refaie WB. Gastric Adenocarcinoma: A Multimodal Approach. Front Surg. 2017;4:42. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 12] [Cited by in F6Publishing: 11] [Article Influence: 1.4] [Reference Citation Analysis (0)] |
4. | Knight ZA, Tan K, Birsoy K, Schmidt S, Garrison JL, Wysocki RW, Emiliano A, Ekstrand MI, Friedman JM. Molecular profiling of activated neurons by phosphorylated ribosome capture. Cell. 2012;151:1126-1137. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 212] [Cited by in F6Publishing: 233] [Article Influence: 19.4] [Reference Citation Analysis (1)] |
5. | Qu K, Guo F, Liu X, Lin Y, Zou Q. Application of Machine Learning in Microbiology. Front Microbiol. 2019;10:827. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 120] [Cited by in F6Publishing: 95] [Article Influence: 15.8] [Reference Citation Analysis (0)] |
6. | Taninaga J, Nishiyama Y, Fujibayashi K, Gunji T, Sasabe N, Iijima K, Naito T. Prediction of future gastric cancer risk using a machine learning algorithm and comprehensive medical check-up data: A case-control study. Sci Rep. 2019;9:12384. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 36] [Cited by in F6Publishing: 51] [Article Influence: 8.5] [Reference Citation Analysis (0)] |
7. | Sung IK, Kim YC, Yun JW, Seo HI, Park DI, Cho YK, Kim HJ, Park JH, Sohn CI, Jeon WK, Kim BI, Oh SJ, Son BH, Yoo CH, Sohn JH, Lee HY, Won KH. Characteristics of advanced gastric cancer undetected on gastroscopy. Korean J Gastroenterol. 2011;57:288-293. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 9] [Cited by in F6Publishing: 10] [Article Influence: 0.7] [Reference Citation Analysis (0)] |
8. | Mori Y, Kudo SE, Misawa M, Saito Y, Ikematsu H, Hotta K, Ohtsuka K, Urushibara F, Kataoka S, Ogawa Y, Maeda Y, Takeda K, Nakamura H, Ichimasa K, Kudo T, Hayashi T, Wakamura K, Ishida F, Inoue H, Itoh H, Oda M, Mori K. Real-Time Use of Artificial Intelligence in Identification of Diminutive Polyps During Colonoscopy: A Prospective Study. Ann Intern Med. 2018;169:357-366. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 299] [Cited by in F6Publishing: 324] [Article Influence: 46.3] [Reference Citation Analysis (1)] |
9. | Shi XJ, Wei Y, Ji B. Systems Biology of Gastric Cancer: Perspectives on the Omics-Based Diagnosis and Treatment. Front Mol Biosci. 2020;7:203. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 14] [Cited by in F6Publishing: 15] [Article Influence: 3.0] [Reference Citation Analysis (0)] |
10. | Zhou YH, Gallins P. A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction. Front Genet. 2019;10:579. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 77] [Cited by in F6Publishing: 100] [Article Influence: 16.7] [Reference Citation Analysis (0)] |
11. | Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, Nielsen T, Pons N, Levenez F, Yamada T, Mende DR, Li J, Xu J, Li S, Li D, Cao J, Wang B, Liang H, Zheng H, Xie Y, Tap J, Lepage P, Bertalan M, Batto JM, Hansen T, Le Paslier D, Linneberg A, Nielsen HB, Pelletier E, Renault P, Sicheritz-Ponten T, Turner K, Zhu H, Yu C, Jian M, Zhou Y, Li Y, Zhang X, Qin N, Yang H, Wang J, Brunak S, Doré J, Guarner F, Kristiansen K, Pedersen O, Parkhill J, Weissenbach J; MetaHIT Consortium; Bork P, Ehrlich SD. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464:59-65. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 7052] [Cited by in F6Publishing: 7580] [Article Influence: 505.3] [Reference Citation Analysis (4)] |
12. | Kim Y, Koh I, Rho M. Deciphering the human microbiome using next-generation sequencing data and bioinformatics approaches. Methods. 2015;79-80:52-59. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 37] [Cited by in F6Publishing: 38] [Article Influence: 3.5] [Reference Citation Analysis (0)] |
13. | Zhang C, Cleveland K, Schnoll-Sussman F, McClure B, Bigg M, Thakkar P, Schultz N, Shah MA, Betel D. Identification of low abundance microbiome in clinical samples using whole genome sequencing. Genome Biol. 2015;16:265. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 65] [Cited by in F6Publishing: 68] [Article Influence: 6.8] [Reference Citation Analysis (0)] |
14. | Fricker AM, Podlesny D, Fricke WF. What is new and relevant for sequencing-based microbiome research? J Adv Res. 2019;19:105-112. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 69] [Cited by in F6Publishing: 70] [Article Influence: 11.7] [Reference Citation Analysis (0)] |
15. | Kopylova E, Navas-Molina JA, Mercier C, Xu ZZ, Mahé F, He Y, Zhou HW, Rognes T, Caporaso JG, Knight R. Open-Source Sequence Clustering Methods Improve the State Of the Art. mSystems. 2016;1. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 121] [Cited by in F6Publishing: 110] [Article Influence: 12.2] [Reference Citation Analysis (0)] |
16. | Lupolova N, Lycett SJ, Gally DL. A guide to machine learning for bacterial host attribution using genome sequence data. Microb Genom. 2019;5. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 13] [Cited by in F6Publishing: 23] [Article Influence: 4.6] [Reference Citation Analysis (0)] |
17. | Sheh A, Fox JG. The role of the gastrointestinal microbiome in Helicobacter pylori pathogenesis. Gut Microbes. 2013;4:505-531. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 127] [Cited by in F6Publishing: 141] [Article Influence: 11.8] [Reference Citation Analysis (0)] |
18. | Klein S, Gildenblat J, Ihle MA, Merkelbach-Bruse S, Noh KW, Peifer M, Quaas A, Büttner R. Deep learning for sensitive detection of Helicobacter Pylori in gastric biopsies. BMC Gastroenterol. 2020;20:417. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 13] [Cited by in F6Publishing: 19] [Article Influence: 3.8] [Reference Citation Analysis (0)] |
19. | Li Z, Ma J, Ruan J, Zhuang X. Using Positively Charged Magnetic Nanoparticles to Capture Bacteria at Ultralow Concentration. Nanoscale Res Lett. 2019;14:195. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 69] [Cited by in F6Publishing: 34] [Article Influence: 5.7] [Reference Citation Analysis (0)] |
20. | Companioni O, Bonet C, Muñoz X, Weiderpass E, Panico S, Tumino R, Palli D, Agnoli C, Vineis P, Boutron-Ruault MC, Racine A, Clavel-Chapelon F, Travis RC, Khaw KT, Riboli E, Murphy N, Vergnaud AC, Trichopoulou A, Benetou V, Trichopoulos D, Lund E, Johansen D, Lindkvist B, Johansson M, Sund M, Ardanaz E, Sánchez-Cantalejo E, Huerta JM, Dorronsoro M, Ramón Quirós J, Tjonneland A, Mortensen LM, Overvad K, Chang-Claude J, Rizzato C, Boeing H, Bueno-de-Mesquita HB, Siersema P, Peeters PH, Numans ME, Carneiro F, Licaj I, Freisling H, Sala N, González CA. Polymorphisms of Helicobacter pylori signaling pathway genes and gastric cancer risk in the European Prospective Investigation into Cancer-Eurgast cohort. Int J Cancer. 2014;134:92-101. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 31] [Cited by in F6Publishing: 33] [Article Influence: 3.0] [Reference Citation Analysis (0)] |
21. | Malfertheiner P, Sipponen P, Naumann M, Moayyedi P, Mégraud F, Xiao SD, Sugano K, Nyrén O; Lejondal H. pylori-Gastric Cancer Task Force. Helicobacter pylori eradication has the potential to prevent gastric cancer: a state-of-the-art critique. Am J Gastroenterol. 2005;100:2100-2115. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 143] [Cited by in F6Publishing: 146] [Article Influence: 7.3] [Reference Citation Analysis (0)] |
22. | Schulz C, Schütte K, Mayerle J, Malfertheiner P. The role of the gastric bacterial microbiome in gastric cancer: Helicobacter pylori and beyond. Therap Adv Gastroenterol. 2019;12:1756284819894062. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 37] [Cited by in F6Publishing: 52] [Article Influence: 8.7] [Reference Citation Analysis (0)] |
23. | Liang W, Yang Y, Wang H, Yu X, Lu Y, Shen S, Teng L. Gut microbiota shifts in patients with gastric cancer in perioperative period. Medicine (Baltimore). 2019;98:e16626. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 45] [Cited by in F6Publishing: 61] [Article Influence: 10.2] [Reference Citation Analysis (0)] |
24. | Matson V, Fessler J, Bao R, Chongsuwat T, Zha Y, Alegre ML, Luke JJ, Gajewski TF. The commensal microbiome is associated with anti-PD-1 efficacy in metastatic melanoma patients. Science. 2018;359:104-108. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 1390] [Cited by in F6Publishing: 1967] [Article Influence: 281.0] [Reference Citation Analysis (0)] |
25. | Sudhakar P, Machiels K, Verstockt B, Korcsmaros T, Vermeire S. Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions. Front Microbiol. 2021;12:618856. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 20] [Cited by in F6Publishing: 9] [Article Influence: 2.3] [Reference Citation Analysis (0)] |