1
|
Costa MVL, de Aguiar EJ, Rodrigues LS, Traina C, Traina AJM. DEELE-Rad: exploiting deep radiomics features in deep learning models using COVID-19 chest X-ray images. Health Inf Sci Syst 2025; 13:11. [PMID: 39741501 PMCID: PMC11683036 DOI: 10.1007/s13755-024-00330-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 12/17/2024] [Indexed: 01/03/2025] Open
Abstract
Purpose Deep learning-based radiomics techniques have the potential to aid specialists and physicians in performing decision-making in COVID-19 scenarios. Specifically, a Deep Learning (DL) ensemble model is employed to classify medical images when addressing the diagnosis during the classification tasks for COVID-19 using chest X-ray images. It also provides feasible and reliable visual explicability concerning the results to support decision-making. Methods Our DEELE-Rad approach integrates DL and Machine Learning (ML) techniques. We use deep learning models to extract deep radiomics features and evaluate its performance regarding end-to-end classifiers. We avoid successive radiomics approach steps by employing these models with transfer learning techniques from ImageNet, such as VGG16, ResNet50V2, and DenseNet201 architectures. We extract 100 and 500 deep radiomics features from each DL model. We also placed these features into well-established ML classifiers and applied automatic parameter tuning and a cross-validation strategy. Besides, we exploit insights into the decision-making behavior by applying a visual explanation method. Results Experimental evaluation on our proposed approach achieved 89.97% AUC when using 500 deep radiomics features from the DenseNet201 end-to-end classifier. Besides, our ensemble DEELE-Rad method improves the results up to 96.19% AUC for the 500 dimensions. To outperform, ML DEELE-Rad reached the best results with an Accuracy of 98.39% and 99.19% AUC for the same setup. Our visual assessment employs additional possibilities for specialists and physicians to decision-making. Conclusion The results reflect that the DEELE-Rad approach provides robustness and confidence to the images' analysis. Our approach can benefit healthcare specialists when employed at clinical routines and respective decision-making procedures. For reproducibility, our code is available at https://github.com/usmarcv/deele-rad.
Collapse
Affiliation(s)
- Márcus V. L. Costa
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Erikson J. de Aguiar
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Lucas S. Rodrigues
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Caetano Traina
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| | - Agma J. M. Traina
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo 13566-590 Brazil
| |
Collapse
|
2
|
Straub J, Estrada Lobato E, Paez D, Langs G, Prosch H. Artificial intelligence in respiratory pandemics-ready for disease X? A scoping review. Eur Radiol 2025; 35:1583-1593. [PMID: 39570367 PMCID: PMC11835992 DOI: 10.1007/s00330-024-11183-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 08/02/2024] [Accepted: 09/26/2024] [Indexed: 11/22/2024]
Abstract
OBJECTIVES This study aims to identify repeated previous shortcomings in medical imaging data collection, curation, and AI-based analysis during the early phase of respiratory pandemics. Based on the results, it seeks to highlight essential steps for improving future pandemic preparedness. MATERIALS AND METHODS We searched PubMed/MEDLINE, Scopus, and Cochrane Reviews for articles published from January 1, 2000, to December 31, 2021, using the terms "imaging" or "radiology" or "radiography" or "CT" or "x-ray" combined with "SARS," "MERS," "H1N1," or "COVID-19." WHO and CDC Databases were searched for case definitions. RESULTS Over the last 20 years, the world faced several international health emergencies caused by respiratory diseases such as SARS, MERS, H1N1, and COVID-19. During the same period, major technological advances enabled the analysis of vast amounts of imaging data and the continual development of artificial intelligence algorithms to support radiological diagnosis and prognosis. Timely availability of data proved critical, but so far, data collection attempts were initialized only as individual responses to each outbreak, leading to long delays and hampering unified guidelines and data-driven technology to support the management of pandemic outbreaks. Our findings highlight the multifaceted role of imaging in the early stages of SARS, MERS, H1N1, and COVID-19, and outline possible actions for advancing future pandemic preparedness. CONCLUSIONS Advancing international cooperation and action on these topics is essential to create a functional, effective, and rapid counteraction system to future respiratory pandemics exploiting state of the art imaging and artificial intelligence. KEY POINTS Question What has been the role of radiological data for diagnosis and prognosis in early respiratory pandemics and what challenges were present? Findings International cooperation is essential to developing an effective rapid response system for future respiratory pandemics using advanced imaging and artificial intelligence. Clinical relevance Strengthening global collaboration and leveraging cutting-edge imaging and artificial intelligence are crucial for developing rapid and effective response systems. This approach is essential for improving patient outcomes and managing future respiratory pandemics more effectively.
Collapse
Affiliation(s)
- Jennifer Straub
- Computational Imaging Research Lab, Department of Biomedical Imaging and Image-guided Therapy, Medical University of Vienna, 1090, Vienna, Austria
| | - Enrique Estrada Lobato
- Nuclear Medicine and Diagnostic Imaging Section, Division of Human Health, Department of Nuclear Sciences and Applications, International Atomic Energy Agency (IAEA), 1220, Vienna, Austria
| | - Diana Paez
- Nuclear Medicine and Diagnostic Imaging Section, Division of Human Health, Department of Nuclear Sciences and Applications, International Atomic Energy Agency (IAEA), 1220, Vienna, Austria
| | - Georg Langs
- Computational Imaging Research Lab, Department of Biomedical Imaging and Image-guided Therapy, Medical University of Vienna, 1090, Vienna, Austria.
- Christian Doppler Laboratory for Machine Learning Driven Precision Imaging, Department of Biomedical Imaging and Image-guided Therapy, Medical University of Vienna, 1090, Vienna, Austria.
| | - Helmut Prosch
- Christian Doppler Laboratory for Machine Learning Driven Precision Imaging, Department of Biomedical Imaging and Image-guided Therapy, Medical University of Vienna, 1090, Vienna, Austria
- Division of General and Paediatric Radiology, Department of Biomedical Imaging and Image-guided Therapy, Medical University of Vienna, 1090, Vienna, Austria
| |
Collapse
|
3
|
Yang L, Wan Y, Pan F. Enhancing Chest X-ray Diagnosis with a Multimodal Deep Learning Network by Integrating Clinical History to Refine Attention. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-025-01446-1. [PMID: 39971817 DOI: 10.1007/s10278-025-01446-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Revised: 01/26/2025] [Accepted: 02/10/2025] [Indexed: 02/21/2025]
Abstract
The rapid advancements of deep learning technology have revolutionized medical imaging diagnosis. However, training these models is often challenged by label imbalance and the scarcity of certain diseases. Most models fail to recognize multiple coexisting diseases, which are common in real-world clinical scenarios. Moreover, most radiological models rely solely on image data, which contrasts with radiologists' comprehensive approach, incorporating both images and other clinical information such as clinical history and laboratory results. In this study, we introduce a Multimodal Chest X-ray Network (MCX-Net) that integrates chest X-ray images and clinical history texts for multi-label disease diagnosis. This integration is achieved by combining a pretrained text encoder, a pretrained image encoder, and a pretrained image-text cross-modal encoder, fine-tuned on the public MIMIC-CXR-JPG dataset, to diagnose 13 diverse lung diseases on chest X-rays. As a result, MCX-Net achieved the highest macro AUROC of 0.816 on the test set, significantly outperforming unimodal baselines such as ViT-base and ResNet152, which scored 0.747 and 0.749, respectively (p < 0.001). This multimodal approach represents a substantial advancement over existing image-based deep-learning diagnostic systems for chest X-rays.
Collapse
Affiliation(s)
- Lian Yang
- Department of Radiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Jiefang Avenue 1277, Wuhan, 430022, China
- Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430000, China
| | - Yiliang Wan
- Neusoft Medical Systems Co., Ltd, Shenyang, 110167, China
| | - Feng Pan
- Department of Radiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Jiefang Avenue 1277, Wuhan, 430022, China.
- Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430000, China.
| |
Collapse
|
4
|
Misra UK. Specialty grand challenge in neuroinfectious diseases. Front Neurol 2025; 16:1557610. [PMID: 40027168 PMCID: PMC11867942 DOI: 10.3389/fneur.2025.1557610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Accepted: 01/27/2025] [Indexed: 03/05/2025] Open
Affiliation(s)
- U. K. Misra
- T.S. Misra Medical College and Hospital, Apollo Medics Super Speciality Hospital and Vivekanand Polyclinic and Institute of Medical Sciences, Lucknow, India
| |
Collapse
|
5
|
Emah I, Bennett SJ. Algorithmic emergence? Epistemic in/justice in AI-directed transformations of healthcare. FRONTIERS IN SOCIOLOGY 2025; 10:1520810. [PMID: 39990252 PMCID: PMC11843219 DOI: 10.3389/fsoc.2025.1520810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Accepted: 01/16/2025] [Indexed: 02/25/2025]
Abstract
Moves toward integration of Artificial Intelligence (AI), particularly deep learning and generative AI-based technologies, into the domains of healthcare and public health have recently intensified, with a growing body of literature tackling the ethico-political implications of this. This paper considers the interwoven epistemic, sociopolitical and technical ramifications of healthcare-AI entanglements, examining how AI materialities shape emergence of particular modes of healthcare organization, governance and roles, and reflecting on how to embed participatory engagement within these entanglements. We discuss the implications of socio-technical entanglements between AI and Evidence-Based Medicine (EBM) for equitable development and governance of health AI. AI applications invariably center on the domains of medical knowledge and practice that are amenable to computational workings. This, in turn, intensifies the prioritization of these medical domains and furthers the assumptions which support the development of AI, a move which decontextualizes the qualitative nuances and complexities of healthcare while simultaneously advancing infrastructure to support these medical domains. We sketch the material and ideological reconfiguration of healthcare which is being shaped by the move toward embedding health AI assemblages in real-world contexts. We then consider the implications of this, how AI might be best employed in healthcare, and how to tackle the algorithmic injustices which become reproduced within health AI assemblages.
Collapse
Affiliation(s)
- Imo Emah
- Faculty of Health, Social Care and Medicine, School of Medicine, Edge Hill University, Ormskirk, United Kingdom
| | - SJ Bennett
- Department of Geography, Lower Mountjoy, Durham University, Durham, United Kingdom
| |
Collapse
|
6
|
Liu F, Li Z, Yin Q, Huang J, Luo J, Thakur A, Branson K, Schwab P, Yin B, Wu X, Zheng Y, Clifton DA. A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis. NPJ Digit Med 2025; 8:86. [PMID: 39915635 PMCID: PMC11802893 DOI: 10.1038/s41746-024-01339-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 11/11/2024] [Indexed: 02/09/2025] Open
Abstract
Radiology images are one of the most commonly used in daily clinical diagnosis. Typically, clinical diagnosis using radiology images involves disease reporting and classification, where the former is a multimodal task whereby textual reports are generated to describe clinical findings in images, as are common in various domains, e.g., chest X-ray or computed tomography. Existing approaches are mainly supervised, the quality of which heavily depends on the volume and quality of available labeled data. However, for rarer or more novel diseases, enrolling patients to collect data is both time-consuming and expensive. For non-English languages, sufficient quantities of labeled data are typically not available. We propose the Multimodal Multidomain Multilingual Foundation Model. It is useful for rare diseases and non-English languages, where the labeled data are frequently much more scarce, and may even be absent. Our approach achieves encouraging performances on nine datasets, including 2 infectious and 14 non-infectious diseases.
Collapse
Affiliation(s)
- Fenglin Liu
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK.
| | | | | | - Jinfa Huang
- Department of Computer Science, University of Rochester, Rochester, NY, USA
| | - Jiebo Luo
- Department of Computer Science, University of Rochester, Rochester, NY, USA
| | - Anshul Thakur
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | | | | | | | - Xian Wu
- Jarvis Research Center, Tencent YouTu Lab, Beijing, China.
| | - Yefeng Zheng
- Medical Artificial Intelligence Laboratory, Westlake University, Hangzhou, China.
| | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK.
- Oxford-Suzhou Centre for Advanced Research, Suzhou, China.
| |
Collapse
|
7
|
van Timmeren JE, Bussink J, Koopmans P, Smeenk RJ, Monshouwer R. Longitudinal Image Data for Outcome Modeling. Clin Oncol (R Coll Radiol) 2025; 38:103610. [PMID: 39003124 DOI: 10.1016/j.clon.2024.06.053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 04/15/2024] [Accepted: 06/24/2024] [Indexed: 07/15/2024]
Abstract
In oncology, medical imaging is crucial for diagnosis, treatment planning and therapy execution. Treatment responses can be complex and varied and are known to involve factors of treatment, patient characteristics and tumor microenvironment. Longitudinal image analysis is able to track temporal changes, aiding in disease monitoring, treatment evaluation, and outcome prediction. This allows for the enhancement of personalized medicine. However, analyzing longitudinal 2D and 3D images presents unique challenges, including image registration, reliable segmentation, dealing with variable imaging intervals, and sparse data. This review presents an overview of techniques and methodologies in longitudinal image analysis, with a primary focus on outcome modeling in radiation oncology.
Collapse
Affiliation(s)
- J E van Timmeren
- Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - J Bussink
- Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - P Koopmans
- Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - R J Smeenk
- Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - R Monshouwer
- Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, the Netherlands.
| |
Collapse
|
8
|
Li W, Zhang Y, Zhou H, Yang W, Xie Z, He Y. CLMS: Bridging domain gaps in medical imaging segmentation with source-free continual learning for robust knowledge transfer and adaptation. Med Image Anal 2025; 100:103404. [PMID: 39616943 DOI: 10.1016/j.media.2024.103404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 10/01/2024] [Accepted: 11/19/2024] [Indexed: 12/16/2024]
Abstract
Deep learning shows promise for medical image segmentation but suffers performance declines when applied to diverse healthcare sites due to data discrepancies among the different sites. Translating deep learning models to new clinical environments is challenging, especially when the original source data used for training is unavailable due to privacy restrictions. Source-free domain adaptation (SFDA) aims to adapt models to new unlabeled target domains without requiring access to the original source data. However, existing SFDA methods face challenges such as error propagation, misalignment of visual and structural features, and inability to preserve source knowledge. This paper introduces Continual Learning Multi-Scale domain adaptation (CLMS), an end-to-end SFDA framework integrating multi-scale reconstruction, continual learning, and style alignment to bridge domain gaps across medical sites using only unlabeled target data or publicly available data. Compared to the current state-of-the-art methods, CLMS consistently and significantly achieved top performance for different tasks, including prostate MRI segmentation (improved Dice of 10.87 %), colonoscopy polyp segmentation (improved Dice of 17.73 %), and plus disease classification from retinal images (improved AUC of 11.19 %). Crucially, CLMS preserved source knowledge for all the tasks, avoiding catastrophic forgetting. CLMS demonstrates a promising solution for translating deep learning models to new clinical imaging domains towards safe, reliable deployment across diverse healthcare settings.
Collapse
Affiliation(s)
- Weilu Li
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yun Zhang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Hao Zhou
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Wenhan Yang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China.
| | - Yao He
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
9
|
Jeon K, Park WY, Kahn CE, Nagy P, You SC, Yoon SH. Advancing Medical Imaging Research Through Standardization: The Path to Rapid Development, Rigorous Validation, and Robust Reproducibility. Invest Radiol 2025; 60:1-10. [PMID: 38985896 DOI: 10.1097/rli.0000000000001106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
ABSTRACT Artificial intelligence (AI) has made significant advances in radiology. Nonetheless, challenges in AI development, validation, and reproducibility persist, primarily due to the lack of high-quality, large-scale, standardized data across the world. Addressing these challenges requires comprehensive standardization of medical imaging data and seamless integration with structured medical data.Developed by the Observational Health Data Sciences and Informatics community, the OMOP Common Data Model enables large-scale international collaborations with structured medical data. It ensures syntactic and semantic interoperability, while supporting the privacy-protected distribution of research across borders. The recently proposed Medical Imaging Common Data Model is designed to encompass all DICOM-formatted medical imaging data and integrate imaging-derived features with clinical data, ensuring their provenance.The harmonization of medical imaging data and its seamless integration with structured clinical data at a global scale will pave the way for advanced AI research in radiology. This standardization will enable federated learning, ensuring privacy-preserving collaboration across institutions and promoting equitable AI through the inclusion of diverse patient populations. Moreover, it will facilitate the development of foundation models trained on large-scale, multimodal datasets, serving as powerful starting points for specialized AI applications. Objective and transparent algorithm validation on a standardized data infrastructure will enhance reproducibility and interoperability of AI systems, driving innovation and reliability in clinical applications.
Collapse
Affiliation(s)
- Kyulee Jeon
- From the Department of Biomedical Systems Informatics, Yonsei University, Seoul, South Korea (K.J., S.C.Y.); Institution for Innovation in Digital Healthcare, Yonsei University, Seoul, South Korea (K.J., S.C.Y.); Biomedical Informatics and Data Science, Johns Hopkins University, Baltimore, MD (W.Y.P., P.N.); Department of Radiology, University of Pennsylvania, Philadelphia, PA (C.E.K.); and Department of Radiology, Seoul National University College of Medicine, Seoul National University Hospital, Seoul, South Korea (S.H.Y.)
| | | | | | | | | | | |
Collapse
|
10
|
Rauscher FG, Bernardes R. Retinal OCT biomarkers and their association with cognitive function-clinical and AI approaches. DIE OPHTHALMOLOGIE 2025; 122:20-28. [PMID: 38381373 DOI: 10.1007/s00347-024-01988-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 02/22/2024]
Abstract
Retinal optical coherence tomography (OCT) biomarkers have the potential to serve as early, noninvasive, and cost-effective markers for identifying individuals at risk for cognitive impairments and neurodegenerative diseases. They may also aid in monitoring disease progression and evaluating the effectiveness of interventions targeting cognitive decline. The association between retinal OCT biomarkers and cognitive performance has been demonstrated in several studies, and their importance in cognitive assessment is increasingly being recognized. Machine learning (ML) is a branch of artificial intelligence (AI) with an exponential number of applications in the medical field, particularly its deep learning (DL) subset, which is widely used for the analysis of medical images. These techniques efficiently deal with novel biomarkers when their outcome for the applications of interest is unclear, e.g., for diagnosis, prognosis prediction, disease staging, or any other relevance to clinical practice. However, using AI-based tools for medical purposes must be approached with caution, despite the many efforts to address the black-box nature of such approaches, especially due to the general underperformance in datasets other than those used for their development. Retinal OCT biomarkers are promising as potential indicators for decline in cognitive function. The underlying mechanisms are currently being explored to gain deeper insights into this relationship linking retinal health and cognitive function. Insights from neurovascular coupling and retinal microvascular changes play an important role. Further research is needed to establish the validity and utility of retinal OCT biomarkers as early indicators of cognitive decline and neurodegenerative diseases in routine clinical practice. Retinal OCT biomarkers could then provide a new avenue for early detection, monitoring and intervention in cognitive impairment with the potential to improve patient care and outcomes.
Collapse
Affiliation(s)
- Franziska G Rauscher
- Leipzig Research Centre for Civilisation Diseases (LIFE), Leipzig University, Leipzig, Germany.
- Institute for Medical Informatics, Statistics, and Epidemiology, Leipzig University, Haertelstraße 16-18, 04107, Leipzig, Germany.
- Centre for Medical Informatics - Department of Medical Data Science, Leipzig University Medical Center, Leipzig, Germany.
| | - Rui Bernardes
- Coimbra Institute for Biomedical Imaging and Translational Research (CIBIT), Institute for Nuclear Sciences Applied to Health (ICNAS), University of Coimbra, Azinhaga de Santa Comba, 3000-548, Coimbra, Portugal
- Clinical Academic Center of Coimbra (CACC), Faculty of Medicine (FMUC), University of Coimbra, Azinhaga de Santa Comba, 3000-548, Coimbra, Portugal
| |
Collapse
|
11
|
Galanty M, Luitse D, Noteboom SH, Croon P, Vlaar AP, Poell T, Sanchez CI, Blanke T, Išgum I. Assessing the documentation of publicly available medical image and signal datasets and their impact on bias using the BEAMRAD tool. Sci Rep 2024; 14:31846. [PMID: 39738436 DOI: 10.1038/s41598-024-83218-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Accepted: 12/12/2024] [Indexed: 01/02/2025] Open
Abstract
Medical datasets are vital for advancing Artificial Intelligence (AI) in healthcare. Yet biases in these datasets on which deep-learning models are trained can compromise reliability. This study investigates biases stemming from dataset-creation practices. Drawing on existing guidelines, we first developed a BEAMRAD tool to assess the documentation of public Magnetic Resonance Imaging (MRI); Color Fundus Photography (CFP), and Electrocardiogram (ECG) datasets. In doing so, we provide an overview of the biases that may emerge due to inadequate dataset documentation. Second, we examine the current state of documentation for public medical images and signal data. Our research reveals that there is substantial variance in the documentation of image and signal datasets, even though guidelines have been developed in medical imaging. This indicates that dataset documentation is subject to individual discretionary decisions. Furthermore, we find that aspects such as hardware and data acquisition details are commonly documented, while information regarding data annotation practices, annotation error quantification, or data limitations are not consistently reported. This risks having considerable implications for the abilities of data users to detect potential sources of bias through these respective aspects and develop reliable and robust models that can be adapted for clinical practice.
Collapse
Affiliation(s)
- Maria Galanty
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands.
- Department of Biomedical Engineering and Physics, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands.
| | - Dieuwertje Luitse
- Department of Media Studies, Faculty of Humanities, University of Amsterdam, Amsterdam, The Netherlands
| | - Sijm H Noteboom
- Department of Intensive Care, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Philip Croon
- Department of Cardiology, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, United States
| | - Alexander P Vlaar
- Department of Intensive Care, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Thomas Poell
- Department of Media Studies, Faculty of Humanities, University of Amsterdam, Amsterdam, The Netherlands
| | - Clara I Sanchez
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands
- Department of Biomedical Engineering and Physics, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
| | - Tobias Blanke
- Department of Media Studies, Faculty of Humanities, University of Amsterdam, Amsterdam, The Netherlands
| | - Ivana Išgum
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands
- Department of Biomedical Engineering and Physics, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
12
|
Hoang TH, Fuhrman J, Klarqvist M, Li M, Chaturvedi P, Li Z, Kim K, Ryu M, Chard R, Huerta E, Giger M, Madduri R. Enabling end-to-end secure federated learning in biomedical research on heterogeneous computing environments with APPFLx. Comput Struct Biotechnol J 2024; 28:29-39. [PMID: 39896264 PMCID: PMC11782895 DOI: 10.1016/j.csbj.2024.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 10/18/2024] [Accepted: 12/01/2024] [Indexed: 02/04/2025] Open
Abstract
Facilitating large-scale, cross-institutional collaboration in biomedical machine learning (ML) projects requires a trustworthy and resilient federated learning (FL) environment to ensure that sensitive information such as protected health information is kept confidential. Specifically designed for this purpose, this work introduces APPFLx - a low-code, easy-to-use FL framework that enables easy setup, configuration, and running of FL experiments. APPFLx removes administrative boundaries of research organizations and healthcare systems while providing secure end-to-end communication, privacy-preserving functionality, and identity management. Furthermore, it is completely agnostic to the underlying computational infrastructure of participating clients, allowing an instantaneous deployment of this framework into existing computing infrastructures. Experimentally, the utility of APPFLx is demonstrated in two case studies: (1) predicting participant age from electrocardiogram (ECG) waveforms, and (2) detecting COVID-19 disease from chest radiographs. Here, ML models were securely trained across heterogeneous computing resources, including a combination of on-premise high-performance computing and cloud computing facilities. By securely unlocking data from multiple sources for training without directly sharing it, these FL models enhance generalizability and performance compared to centralized training models while ensuring data remains protected. In conclusion, APPFLx demonstrated itself as an easy-to-use framework for accelerating biomedical studies across organizations and healthcare systems on large datasets while maintaining the protection of private medical data.
Collapse
Affiliation(s)
- Trung-Hieu Hoang
- Department of Electrical and Computer Engineering and Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, 61801, IL, USA
| | | | | | - Miao Li
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
- School of Industrial and Systems Engineering Georgia Institute of Technology, Atlanta, GA, USA
| | - Pranshu Chaturvedi
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
- Department of Computer Science, University of Illinois at Urbana-Champaign, Street, Urbana, 61801, IL, USA
| | - Zilinghan Li
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
- Department of Computer Science, University of Illinois at Urbana-Champaign, Street, Urbana, 61801, IL, USA
| | - Kibaek Kim
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
| | - Minseok Ryu
- School of Computing and Augmented Intelligence Arizona State University, Tempe, AZ, USA
| | - Ryan Chard
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
| | - E.A. Huerta
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
| | | | - Ravi Madduri
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
| |
Collapse
|
13
|
Maertens A, Brykman S, Hartung T, Gafita A, Bai H, Hoelzer D, Skoudis E, Paller CJ. Navigating the unseen peril: safeguarding medical imaging in the age of AI. Front Artif Intell 2024; 7:1400732. [PMID: 39717840 PMCID: PMC11665297 DOI: 10.3389/frai.2024.1400732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 11/13/2024] [Indexed: 12/25/2024] Open
Abstract
In response to the increasing significance of artificial intelligence (AI) in healthcare, there has been increased attention - including a Presidential executive order to create an AI Safety Institute - to the potential threats posed by AI. While much attention has been given to the conventional risks AI poses to cybersecurity, and critical infrastructure, here we provide an overview of some unique challenges of AI for the medical community. Above and beyond obvious concerns about vetting algorithms that impact patient care, there are additional subtle yet equally important things to consider: the potential harm AI poses to its own integrity and the broader medical information ecosystem. Recognizing the role of healthcare professionals as both consumers and contributors to AI training data, this article advocates for a proactive approach in understanding and shaping the data that underpins AI systems, emphasizing the need for informed engagement to maximize the benefits of AI while mitigating the risks.
Collapse
Affiliation(s)
- Alexandra Maertens
- Center for Alternatives to Animal Testing (CAAT), Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Steve Brykman
- Independent Creative Technologist, Boston, MA, United States
| | - Thomas Hartung
- Center for Alternatives to Animal Testing (CAAT), Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
- CAAT Europe, University of Konstanz, Konstanz, Germany
| | - Andrei Gafita
- Radiology and Radiological Science, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| | - Harrison Bai
- Radiology and Radiological Science, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| | - David Hoelzer
- SANS Technology Institute, Rockville, MD, United States
| | - Ed Skoudis
- SANS Technology Institute, Rockville, MD, United States
| | - Channing Judith Paller
- Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| |
Collapse
|
14
|
Garrobé Fonollosa L, Webber T, Brotons JM, Cerdà M, Gillespie D, Pirotta E, Rendell L. Comparing neural networks against click train detectors to reveal temporal trends in passive acoustic sperm whale detections. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 156:4073-4084. [PMID: 39692862 DOI: 10.1121/10.0034602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 11/18/2024] [Indexed: 12/19/2024]
Abstract
Passive acoustic monitoring (PAM) is an increasingly popular tool to study vocalising species. The amount of data generated by PAM studies calls for robust automatic classifiers. Deep learning (DL) techniques have been proven effective in identifying acoustic signals in challenging datasets, but due to their black-box nature their underlying biases are hard to quantify. This study compares human analyst annotations, a multi-hypothesis tracking (MHT) click train classifier and a DL-based acoustic classifier to classify acoustic recordings based on the presence or absence of sperm whale (Physeter macrocephalus) click trains and study the temporal and spatial distributions of the Mediterranean sperm whale subpopulation around the Balearic Islands. The MHT and DL classifiers showed agreements with human labels of 85.7% and 85.0%, respectively, on data from sites they were trained on, but both saw a drop in performance when deployed on a new site. Agreement rates between classifiers surpassed those between human experts. Modeled seasonal and diel variations in sperm whale detections for both classifiers showed compatible results, revealing an increase in occurrence and diurnal activity during the summer and autumn months. This study highlights the strengths and limitations of two automatic classification algorithms to extract biologically useful information from large acoustic datasets.
Collapse
Affiliation(s)
- Laia Garrobé Fonollosa
- Sea Mammal Research Unit, School of Biology, University of St Andrews, KY16 9TH, St Andrews, United Kingdom
| | - Thomas Webber
- The Scottish Association for Marine Science (SAMS), Oban, PA37 1QA, Scotland, United Kingdom
| | | | - Margalida Cerdà
- Asociación Tursiops, Palma de Mallorca, Balearic Islands, Spain
| | - Douglas Gillespie
- Sea Mammal Research Unit, School of Biology, University of St Andrews, KY16 9TH, St Andrews, United Kingdom
| | - Enrico Pirotta
- Centre for Research into Ecological and Environmental Modelling, University of St Andrews, St Andrews, United Kingdom
| | - Luke Rendell
- Sea Mammal Research Unit, School of Biology, University of St Andrews, KY16 9TH, St Andrews, United Kingdom
| |
Collapse
|
15
|
Chau M. Ethical, legal, and regulatory landscape of artificial intelligence in Australian healthcare and ethical integration in radiography: A narrative review. J Med Imaging Radiat Sci 2024; 55:101733. [PMID: 39111223 DOI: 10.1016/j.jmir.2024.101733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 07/15/2024] [Accepted: 07/16/2024] [Indexed: 12/02/2024]
Abstract
This narrative review explores the ethical, legal, and regulatory landscape of AI integration in Australian healthcare, focusing on radiography. It examines the current legislative framework, assesses the trust and reliability of AI tools, and proposes future directions for ethical AI integration in radiography. AI systems significantly enhance diagnostic radiography by improving diagnostic accuracy and efficiency in stroke detection, brain imaging, and chest reporting. However, AI raises substantial ethical concerns due to its 'black-box' nature and potential biases in training data. The Therapeutic Goods Administration's reforms in Australia, though comprehensive, fall short of fully addressing issues related to the trustworthiness and legal liabilities of AI tools. Adopting a comprehensive research strategy that includes doctrinal, comparative, and public policy analyses will facilitate an understanding of international practices, particularly from countries with similar legal systems, and help guide Australia in refining its regulatory framework. For an ethical future in radiography, a robust, multi-disciplinary approach is required to prioritize patient safety, data privacy, and equitable AI use. A framework that balances technological innovation with ethical and legal integrity is essential for advancing healthcare while preserving trust and transparency. Healthcare professionals, policymakers, and AI developers must collaborate to establish a resilient, equitable, and transparent healthcare system. Future research should focus on multi-disciplinary methodologies, combining doctrinal, comparative, and public policy research to provide comprehensive insights. This approach will guide Australia in creating a more inclusive and ethically sound legal framework for AI in healthcare, ensuring its ethical and beneficial integration into radiography.
Collapse
Affiliation(s)
- Minh Chau
- Faculty of Science and Health, Charles Sturt University, Level 5, 250 Boorooma St, Charles Sturt University NSW 2678, Australia; South Australia Medical Imaging, Flinders Medical Centre, 1 Flinders Drive, Bedford Park, SA 5042, Australia.
| |
Collapse
|
16
|
Mosquera C, Ferrer L, Milone DH, Luna D, Ferrante E. Class imbalance on medical image classification: towards better evaluation practices for discrimination and calibration performance. Eur Radiol 2024; 34:7895-7903. [PMID: 38861161 DOI: 10.1007/s00330-024-10834-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/08/2024] [Accepted: 04/17/2024] [Indexed: 06/12/2024]
Abstract
PURPOSE This work aims to assess standard evaluation practices used by the research community for evaluating medical imaging classifiers, with a specific focus on the implications of class imbalance. The analysis is performed on chest X-rays as a case study and encompasses a comprehensive model performance definition, considering both discriminative capabilities and model calibration. MATERIALS AND METHODS We conduct a concise literature review to examine prevailing scientific practices used when evaluating X-ray classifiers. Then, we perform a systematic experiment on two major chest X-ray datasets to showcase a didactic example of the behavior of several performance metrics under different class ratios and highlight how widely adopted metrics can conceal performance in the minority class. RESULTS Our literature study confirms that: (1) even when dealing with highly imbalanced datasets, the community tends to use metrics that are dominated by the majority class; and (2) it is still uncommon to include calibration studies for chest X-ray classifiers, albeit its importance in the context of healthcare. Moreover, our systematic experiments confirm that current evaluation practices may not reflect model performance in real clinical scenarios and suggest complementary metrics to better reflect the performance of the system in such scenarios. CONCLUSION Our analysis underscores the need for enhanced evaluation practices, particularly in the context of class-imbalanced chest X-ray classifiers. We recommend the inclusion of complementary metrics such as the area under the precision-recall curve (AUC-PR), adjusted AUC-PR, and balanced Brier score, to offer a more accurate depiction of system performance in real clinical scenarios, considering metrics that reflect both, discrimination and calibration performance. CLINICAL RELEVANCE STATEMENT This study underscores the critical need for refined evaluation metrics in medical imaging classifiers, emphasizing that prevalent metrics may mask poor performance in minority classes, potentially impacting clinical diagnoses and healthcare outcomes. KEY POINTS Common scientific practices in papers dealing with X-ray computer-assisted diagnosis (CAD) systems may be misleading. We highlight limitations in reporting of evaluation metrics for X-ray CAD systems in highly imbalanced scenarios. We propose adopting alternative metrics based on experimental evaluation on large-scale datasets.
Collapse
Affiliation(s)
- Candelaria Mosquera
- Hospital Italiano de Buenos Aires, Buenos Aires, Argentina.
- Universidad Tecnológica Nacional, Buenos Aires, Argentina.
| | - Luciana Ferrer
- Instituto de Ciencias de la Computación, UBA-CONICET, Buenos Aires, Argentina
| | - Diego H Milone
- Institute for Signals, Systems, and Computational Intelligence, sinc(i) CONICET-UNL, Santa Fe, Argentina
| | - Daniel Luna
- Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
| | - Enzo Ferrante
- Institute for Signals, Systems, and Computational Intelligence, sinc(i) CONICET-UNL, Santa Fe, Argentina.
| |
Collapse
|
17
|
Lepoittevin M, Remaury QB, Lévêque N, Thille AW, Brunet T, Salaun K, Catroux M, Pellerin L, Hauet T, Thuillier R. Advantages of Metabolomics-Based Multivariate Machine Learning to Predict Disease Severity: Example of COVID. Int J Mol Sci 2024; 25:12199. [PMID: 39596265 PMCID: PMC11594300 DOI: 10.3390/ijms252212199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Revised: 11/07/2024] [Accepted: 11/08/2024] [Indexed: 11/28/2024] Open
Abstract
The COVID-19 outbreak caused saturations of hospitals, highlighting the importance of early patient triage to optimize resource prioritization. Herein, our objective was to test if high definition metabolomics, combined with ML, can improve prognostication and triage performance over standard clinical parameters using COVID infection as an example. Using high resolution mass spectrometry, we obtained metabolomics profiles of patients and combined them with clinical parameters to design machine learning (ML) algorithms predicting severity (herein determined as the need for mechanical ventilation during patient care). A total of 64 PCR-positive COVID patients at the Poitiers CHU were recruited. Clinical and metabolomics investigations were conducted 8 days after the onset of symptoms. We show that standard clinical parameters could predict severity with good performance (AUC of the ROC curve: 0.85), using SpO2, first respiratory rate, Horowitz quotient and age as the most important variables. However, the performance of the prediction was substantially improved by the use of metabolomics (AUC = 0.92). Our small-scale study demonstrates that metabolomics can improve the performance of diagnosis and prognosis algorithms, and thus be a key player in the future discovery of new biological signals. This technique is easily deployable in the clinic, and combined with machine learning, it can help design the mathematical models needed to advance towards personalized medicine.
Collapse
Affiliation(s)
- Maryne Lepoittevin
- Inserm Unit Ischémie Reperfusion, Métabolisme et Inflammation Stérile en Transplantation (IRMETIST), UMR U1313, F-86073 Poitiers, France; (M.L.); (L.P.); (T.H.)
- Faculty of Medicine and Pharmacy, University of Poitiers, F-86073 Poitiers, France
| | - Quentin Blancart Remaury
- UMR CNRS 7285, Institut de Chimie des Milieux et Matériaux de Poitiers (IC2MP), University of Poitiers, 4 rue Michel-Brunet, TSA 51106, F-86073 Poitiers cedex 9, France;
| | - Nicolas Lévêque
- LITEC, CHU de Poitiers, Laboratoire de Virologie et Mycobactériologie, Université de Poitiers, 2 r Milétrie, F-86000 Poitiers, France;
| | - Arnaud W. Thille
- Intensive Care Medicine Department, CHU Poitiers, F-86021 Poitiers, France; (A.W.T.); (K.S.)
| | - Thomas Brunet
- Geriatric Medicine Department, CHU Poitiers, F-86021 Poitiers, France;
| | - Karine Salaun
- Intensive Care Medicine Department, CHU Poitiers, F-86021 Poitiers, France; (A.W.T.); (K.S.)
| | - Mélanie Catroux
- Internal Medicine and Infectious Disease Department, CHU Poitiers, F-86021 Poitiers, France;
| | - Luc Pellerin
- Inserm Unit Ischémie Reperfusion, Métabolisme et Inflammation Stérile en Transplantation (IRMETIST), UMR U1313, F-86073 Poitiers, France; (M.L.); (L.P.); (T.H.)
- Faculty of Medicine and Pharmacy, University of Poitiers, F-86073 Poitiers, France
- Biochemistry Department, CHU Poitiers, F-86021 Poitiers, France
| | - Thierry Hauet
- Inserm Unit Ischémie Reperfusion, Métabolisme et Inflammation Stérile en Transplantation (IRMETIST), UMR U1313, F-86073 Poitiers, France; (M.L.); (L.P.); (T.H.)
- Faculty of Medicine and Pharmacy, University of Poitiers, F-86073 Poitiers, France
- Biochemistry Department, CHU Poitiers, F-86021 Poitiers, France
| | - Raphael Thuillier
- Inserm Unit Ischémie Reperfusion, Métabolisme et Inflammation Stérile en Transplantation (IRMETIST), UMR U1313, F-86073 Poitiers, France; (M.L.); (L.P.); (T.H.)
- Faculty of Medicine and Pharmacy, University of Poitiers, F-86073 Poitiers, France
- Biochemistry Department, CHU Poitiers, F-86021 Poitiers, France
| |
Collapse
|
18
|
Ahmed S, Raza B, Hussain L, Sadiq T, Dutta AK. Enhancing multiclass COVID-19 prediction with ESN-MDFS: Extreme smart network using mean dropout feature selection technique. PLoS One 2024; 19:e0310011. [PMID: 39531465 PMCID: PMC11556731 DOI: 10.1371/journal.pone.0310011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 08/22/2024] [Indexed: 11/16/2024] Open
Abstract
Deep learning and artificial intelligence offer promising tools for improving the accuracy and efficiency of diagnosing various lung conditions using portable chest x-rays (CXRs). This study explores this potential by leveraging a large dataset containing over 6,000 CXR images from publicly available sources. These images encompass COVID-19 cases, normal cases, and patients with viral or bacterial pneumonia. The research proposes a novel approach called "Enhancing COVID Prediction with ESN-MDFS" that utilizes a combination of an Extreme Smart Network (ESN) and a Mean Dropout Feature Selection Technique (MDFS). This study aimed to enhance multi-class lung condition detection in portable chest X-rays by combining static texture features with dynamic deep learning features extracted from a pre-trained VGG-16 model. To optimize performance, preprocessing, data imbalance, and hyperparameter tuning were meticulously addressed. The proposed ESN-MDFS model achieved a peak accuracy of 96.18% with an AUC of 1.00 in a six-fold cross-validation. Our findings demonstrate the model's superior ability to differentiate between COVID-19, bacterial pneumonia, viral pneumonia, and normal conditions, promising significant advancements in diagnostic accuracy and efficiency.
Collapse
Affiliation(s)
- Saghir Ahmed
- Department of Computer Science, COMSATS University, Islamabad Capital Territory, Islamabad, Pakistan
| | - Basit Raza
- Department of Computer Science, COMSATS University, Islamabad Capital Territory, Islamabad, Pakistan
| | - Lal Hussain
- Department of Computer Science & IT, Neelum Campus, The University of Azad Jammu and Kashmir, Athmuqam, Azad Kashmir, Pakistan
- Department of Computer Science & IT, King Abdullah Campus, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Touseef Sadiq
- Department of Information and Communication Technology, Centre for Artificial Intelligence Research (CAIR), University of Agder, Grimstad, Norway
| | - Ashit Kumar Dutta
- Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh, Kingdom of Saudi Arabia
| |
Collapse
|
19
|
Marek S, Laumann TO. Replicability and generalizability in population psychiatric neuroimaging. Neuropsychopharmacology 2024; 50:52-57. [PMID: 39215207 PMCID: PMC11526127 DOI: 10.1038/s41386-024-01960-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 07/19/2024] [Accepted: 07/29/2024] [Indexed: 09/04/2024]
Abstract
Studies linking mental health with brain function in cross-sectional population-based association studies have historically relied on small, underpowered samples. Given the small effect sizes typical of such brain-wide associations, studies require samples into the thousands to achieve the statistical power necessary for replicability. Here, we detail how small sample sizes have hampered replicability and provide sample size targets given established association strength benchmarks. Critically, while replicability will improve with larger samples, it is not guaranteed that observed effects will meaningfully apply to target populations of interest (i.e., be generalizable). We discuss important considerations related to generalizability in psychiatric neuroimaging and provide an example of generalizability failure due to "shortcut learning" in brain-based predictions of mental health phenotypes. Shortcut learning is a phenomenon whereby machine learning models learn an association between the brain and an unmeasured construct (the shortcut), rather than the intended target of mental health. Given the complex nature of brain-behavior interactions, the future of epidemiological approaches to brain-based studies of mental health will require large, diverse samples with comprehensive assessment.
Collapse
Affiliation(s)
- Scott Marek
- Mallinckrodt Institute of Radiology, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
- Department of Psychiatry, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
- Neuroimaging Labs Research Center, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
- AI Institute for Health, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
| | - Timothy O Laumann
- Department of Psychiatry, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
- Neuroimaging Labs Research Center, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| |
Collapse
|
20
|
Williams MC, Weir-McCall JR, Baldassarre LA, De Cecco CN, Choi AD, Dey D, Dweck MR, Isgum I, Kolossvary M, Leipsic J, Lin A, Lu MT, Motwani M, Nieman K, Shaw L, van Assen M, Nicol E. Artificial Intelligence and Machine Learning for Cardiovascular Computed Tomography (CCT): A White Paper of the Society of Cardiovascular Computed Tomography (SCCT). J Cardiovasc Comput Tomogr 2024; 18:519-532. [PMID: 39214777 DOI: 10.1016/j.jcct.2024.08.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 08/03/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024]
Affiliation(s)
| | | | - Lauren A Baldassarre
- Section of Cardiovascular Medicine and Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
| | | | - Andrew D Choi
- The George Washington University School of Medicine, Washington, USA
| | - Damini Dey
- Biomedical Imaging Research Institute, Cedars-Sinai Medical Center, Los Angeles, USA
| | - Marc R Dweck
- Centre for Cardiovascular Science, University of Edinburgh, Edinburgh, UK
| | - Ivana Isgum
- Amsterdam University Medical Center, University of Amsterdam, Netherlands
| | - Márton Kolossvary
- Gottsegen National Cardiovascular Center, Budapest, Hungary, and Physiological Controls Research Center, University Research and Innovation Center, Óbuda University, Budapest, Hungary
| | | | - Andrew Lin
- Victorian Heart Institute and Monash Health Heart, Victorian Heart Hospital, Monash University, Australia
| | - Michael T Lu
- Massachusetts General Hospital Cardiovascular Imaging Research Center/Harvard Medical School, USA
| | | | | | - Leslee Shaw
- Icahn School of Medicine at Mount Sinai, New York, USA
| | | | - Edward Nicol
- Royal Brompton Hospital, Guys and St Thomas' NHS Foundation Trust, London, UK; School of Biomedical Engineering and Imaging Sciences, King's College London, UK
| |
Collapse
|
21
|
Santos FAO, Zanchettin C, Lei W, Nunes Amaral LA. Adversarial training and attribution methods enable evaluation of robustness and interpretability of deep learning models for image classification. Phys Rev E 2024; 110:054310. [PMID: 39690595 DOI: 10.1103/physreve.110.054310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 09/19/2024] [Indexed: 12/19/2024]
Abstract
Deep learning models have achieved high performance in a wide range of applications. Recently, however, there have been increasing concerns about the fragility of many of those models to adversarial approaches and out-of-distribution inputs. A way to investigate and potentially address model fragility is to develop the ability to provide interpretability to model predictions. To this end, input attribution approaches such as Grad-CAM and integrated gradients have been introduced to address model interpretability. Here, we combine adversarial and input attribution approaches in order to achieve two goals. The first is to investigate the impact of adversarial approaches on input attribution. The second is to benchmark competing input attribution approaches. In the context of the image classification task, we find that models trained with adversarial approaches yield dramatically different input attribution matrices from those obtained using standard techniques for all considered input attribution approaches. Additionally, by evaluating the signal-(typical input attribution of the foreground)-to-noise (typical input attribution of the background) ratio and correlating it to model confidence, we are able to identify the most reliable input attribution approaches and demonstrate that adversarial training does increase prediction robustness. Our approach can be easily extended to contexts other than the image classification task and enables users to increase their confidence in the reliability of deep learning models.
Collapse
Affiliation(s)
| | | | | | - Luís A Nunes Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA
- Department of Physics and Astronomy, Northwestern University, Evanston, Illinois 60208, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, Illinois 60208, USA
- NSF-Simons National Institute for Theory and Mathematics in Biology, Northwestern University, Chicago, Illinois 60611, USA
| |
Collapse
|
22
|
Ball Dunlap PA, Michalowski M. Advancing AI Data Ethics in Nursing: Future Directions for Nursing Practice, Research, and Education. JMIR Nurs 2024; 7:e62678. [PMID: 39453630 PMCID: PMC11529373 DOI: 10.2196/62678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 09/08/2024] [Accepted: 09/13/2024] [Indexed: 09/15/2024] Open
Abstract
Unlabelled The ethics of artificial intelligence (AI) are increasingly recognized due to concerns such as algorithmic bias, opacity, trust issues, data security, and fairness. Specifically, machine learning algorithms, central to AI technologies, are essential in striving for ethically sound systems that mimic human intelligence. These technologies rely heavily on data, which often remain obscured within complex systems and must be prioritized for ethical collection, processing, and usage. The significance of data ethics in achieving responsible AI was first highlighted in the broader context of health care and subsequently in nursing. This viewpoint explores the principles of data ethics, drawing on relevant frameworks and strategies identified through a formal literature review. These principles apply to real-world and synthetic data in AI and machine-learning contexts. Additionally, the data-centric AI paradigm is briefly examined, emphasizing its focus on data quality and the ethical development of AI solutions that integrate human-centered domain expertise. The ethical considerations specific to nursing are addressed, including 4 recommendations for future directions in nursing practice, research, and education and 2 hypothetical nurse-focused ethical case studies. The primary objectives are to position nurses to actively participate in AI and data ethics, thereby contributing to creating high-quality and relevant data for machine learning applications.
Collapse
Affiliation(s)
- Patricia A Ball Dunlap
- School of Nursing, University of Minnesota, 5-140 Weaver-Densford Hall, 308 Harvard Street SE, Minneapolis, MN, 55455, United States, 16126245959
- Center for Digital Health, Mayo Clinic, Rochester, MN, United States
| | - Martin Michalowski
- School of Nursing, University of Minnesota, 5-140 Weaver-Densford Hall, 308 Harvard Street SE, Minneapolis, MN, 55455, United States, 16126245959
| |
Collapse
|
23
|
Wells S. Can AI shake-up translational research? Nature 2024:10.1038/d41586-024-03318-0. [PMID: 39414966 DOI: 10.1038/d41586-024-03318-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2024]
|
24
|
Lones MA. Avoiding common machine learning pitfalls. PATTERNS (NEW YORK, N.Y.) 2024; 5:101046. [PMID: 39569205 PMCID: PMC11573893 DOI: 10.1016/j.patter.2024.101046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
Mistakes in machine learning practice are commonplace and can result in loss of confidence in the findings and products of machine learning. This tutorial outlines common mistakes that occur when using machine learning and what can be done to avoid them. While it should be accessible to anyone with a basic understanding of machine learning techniques, it focuses on issues that are of particular concern within academic research, such as the need to make rigorous comparisons and reach valid conclusions. It covers five stages of the machine learning process: what to do before model building, how to reliably build models, how to robustly evaluate models, how to compare models fairly, and how to report results.
Collapse
Affiliation(s)
- Michael A Lones
- School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK
| |
Collapse
|
25
|
Sanchez T, Esteban O, Gomez Y, Pron A, Koob M, Dunet V, Girard N, Jakab A, Eixarch E, Auzias G, Bach Cuadra M. FetMRQC: A robust quality control system for multi-centric fetal brain MRI. Med Image Anal 2024; 97:103282. [PMID: 39053168 DOI: 10.1016/j.media.2024.103282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 06/28/2024] [Accepted: 07/15/2024] [Indexed: 07/27/2024]
Abstract
Fetal brain MRI is becoming an increasingly relevant complement to neurosonography for perinatal diagnosis, allowing fundamental insights into fetal brain development throughout gestation. However, uncontrolled fetal motion and heterogeneity in acquisition protocols lead to data of variable quality, potentially biasing the outcome of subsequent studies. We present FetMRQC, an open-source machine-learning framework for automated image quality assessment and quality control that is robust to domain shifts induced by the heterogeneity of clinical data. FetMRQC extracts an ensemble of quality metrics from unprocessed anatomical MRI and combines them to predict experts' ratings using random forests. We validate our framework on a pioneeringly large and diverse dataset of more than 1600 manually rated fetal brain T2-weighted images from four clinical centers and 13 different scanners. Our study shows that FetMRQC's predictions generalize well to unseen data while being interpretable. FetMRQC is a step towards more robust fetal brain neuroimaging, which has the potential to shed new insights on the developing human brain.
Collapse
Affiliation(s)
- Thomas Sanchez
- CIBM - Center for Biomedical Imaging, Switzerland; Department of Diagnostic and Interventional Radiology, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland.
| | - Oscar Esteban
- Department of Diagnostic and Interventional Radiology, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Yvan Gomez
- BCNatal Fetal Medicine Research Center (Hospital Clínic and Hospital Sant Joan de Déu), Universitat de Barcelona, Spain; Department Woman-Mother-Child, CHUV, Lausanne, Switzerland
| | - Alexandre Pron
- Aix-Marseille Université, CNRS, Institut de Neurosciences de La Timone, Marseilles, France
| | - Mériam Koob
- Department of Diagnostic and Interventional Radiology, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Vincent Dunet
- Department of Diagnostic and Interventional Radiology, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Nadine Girard
- Aix-Marseille Université, CNRS, Institut de Neurosciences de La Timone, Marseilles, France; Service de Neuroradiologie Diagnostique et Interventionnelle, Hôpital Timone, AP-HM, Marseilles, France
| | - Andras Jakab
- Center for MR Research, University Children's Hospital Zurich, University of Zurich, Zurich, Switzerland; Neuroscience Center Zurich, University of Zurich, Zurich, Switzerland; Research Priority Project Adaptive Brain Circuits in Development and Learning (AdaBD), University of Zürich, Zurich, Switzerland
| | - Elisenda Eixarch
- BCNatal Fetal Medicine Research Center (Hospital Clínic and Hospital Sant Joan de Déu), Universitat de Barcelona, Spain; IDIBAPS and CIBERER, Barcelona, Spain
| | - Guillaume Auzias
- Aix-Marseille Université, CNRS, Institut de Neurosciences de La Timone, Marseilles, France
| | - Meritxell Bach Cuadra
- CIBM - Center for Biomedical Imaging, Switzerland; Department of Diagnostic and Interventional Radiology, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
26
|
Matos J, Gallifant J, Chowdhury A, Economou-Zavlanos N, Charpignon ML, Gichoya J, Celi LA, Nazer L, King H, Wong AKI. A Clinician's Guide to Understanding Bias in Critical Clinical Prediction Models. Crit Care Clin 2024; 40:827-857. [PMID: 39218488 DOI: 10.1016/j.ccc.2024.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
This narrative review focuses on the role of clinical prediction models in supporting informed decision-making in critical care, emphasizing their 2 forms: traditional scores and artificial intelligence (AI)-based models. Acknowledging the potential for both types to embed biases, the authors underscore the importance of critical appraisal to increase our trust in models. The authors outline recommendations and critical care examples to manage risk of bias in AI models. The authors advocate for enhanced interdisciplinary training for clinicians, who are encouraged to explore various resources (books, journals, news Web sites, and social media) and events (Datathons) to deepen their understanding of risk of bias.
Collapse
Affiliation(s)
- João Matos
- University of Porto (FEUP), Porto, Portugal; Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal; Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jack Gallifant
- Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Critical Care, Guy's and St Thomas' NHS Trust, London, UK
| | - Anand Chowdhury
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, Duke University, Durham, NC, USA
| | | | - Marie-Laure Charpignon
- Institute for Data Systems and Society, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Judy Gichoya
- Department of Radiology, Emory University, Atlanta, GA, USA
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Lama Nazer
- Department of Pharmacy, King Hussein Cancer Center, Amman, Jordan
| | - Heather King
- Durham VA Health Care System, Health Services Research and Development, Center of Innovation to Accelerate Discovery and Practice Transformation (ADAPT), Durham, NC, USA; Department of Population Health Sciences, Duke University, Durham, NC, USA; Division of General Internal Medicine, Duke University, Duke University School of Medicine, Durham, NC, USA
| | - An-Kwok Ian Wong
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, Duke University, Durham, NC, USA; Department of Biostatistics and Bioinformatics, Duke University, Division of Translational Biomedical Informatics, Durham, NC, USA.
| |
Collapse
|
27
|
Nolin-Lapalme A, Corbin D, Tastet O, Avram R, Hussin JG. Advancing Fairness in Cardiac Care: Strategies for Mitigating Bias in Artificial Intelligence Models Within Cardiology. Can J Cardiol 2024; 40:1907-1921. [PMID: 38735528 DOI: 10.1016/j.cjca.2024.04.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/03/2024] [Accepted: 04/22/2024] [Indexed: 05/14/2024] Open
Abstract
In the dynamic field of medical artificial intelligence (AI), cardiology stands out as a key area for its technological advancements and clinical application. In this review we explore the complex issue of data bias, specifically addressing those encountered during the development and implementation of AI tools in cardiology. We dissect the origins and effects of these biases, which challenge their reliability and widespread applicability in health care. Using a case study, we highlight the complexities involved in addressing these biases from a clinical viewpoint. The goal of this review is to equip researchers and clinicians with the practical knowledge needed to identify, understand, and mitigate these biases, advocating for the creation of AI solutions that are not just technologically sound, but also fair and effective for all patients.
Collapse
Affiliation(s)
- Alexis Nolin-Lapalme
- Department of Medicine, Montreal Heart Institute, Montreal, Quebec, Canada; Faculté de Médecine, Université de Montréal, Montreal, Quebec, Canada; Mila - Québec AI Institute, Montreal, Quebec, Canada; Heartwise (heartwise.ai), Montreal Heart Institute, Montreal, Quebec, Canada.
| | - Denis Corbin
- Department of Medicine, Montreal Heart Institute, Montreal, Quebec, Canada
| | - Olivier Tastet
- Department of Medicine, Montreal Heart Institute, Montreal, Quebec, Canada
| | - Robert Avram
- Department of Medicine, Montreal Heart Institute, Montreal, Quebec, Canada; Faculté de Médecine, Université de Montréal, Montreal, Quebec, Canada; Heartwise (heartwise.ai), Montreal Heart Institute, Montreal, Quebec, Canada
| | - Julie G Hussin
- Department of Medicine, Montreal Heart Institute, Montreal, Quebec, Canada; Faculté de Médecine, Université de Montréal, Montreal, Quebec, Canada; Mila - Québec AI Institute, Montreal, Quebec, Canada
| |
Collapse
|
28
|
Nakayama LF, Matos J, Quion J, Novaes F, Mitchell WG, Mwavu R, Hung CJYJ, Santiago APD, Phanphruk W, Cardoso JS, Celi LA. Unmasking biases and navigating pitfalls in the ophthalmic artificial intelligence lifecycle: A narrative review. PLOS DIGITAL HEALTH 2024; 3:e0000618. [PMID: 39378192 PMCID: PMC11460710 DOI: 10.1371/journal.pdig.0000618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2024]
Abstract
Over the past 2 decades, exponential growth in data availability, computational power, and newly available modeling techniques has led to an expansion in interest, investment, and research in Artificial Intelligence (AI) applications. Ophthalmology is one of many fields that seek to benefit from AI given the advent of telemedicine screening programs and the use of ancillary imaging. However, before AI can be widely deployed, further work must be done to avoid the pitfalls within the AI lifecycle. This review article breaks down the AI lifecycle into seven steps-data collection; defining the model task; data preprocessing and labeling; model development; model evaluation and validation; deployment; and finally, post-deployment evaluation, monitoring, and system recalibration-and delves into the risks for harm at each step and strategies for mitigating them.
Collapse
Affiliation(s)
- Luis Filipe Nakayama
- Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Sao Paulo, Brazil
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - João Matos
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Faculty of Engineering (FEUP), University of Porto, Porto, Portugal
- Institute for Systems and Computer Engineering (INESC TEC), Technology and Science, Porto, Portugal
| | - Justin Quion
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Frederico Novaes
- Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Sao Paulo, Brazil
| | | | - Rogers Mwavu
- Department of Information Technology, Mbarara University of Science and Technology, Mbarara, Uganda
| | - Claudia Ju-Yi Ji Hung
- Department of Ophthalmology, Byers Eye Institute at Stanford, California, United States of America
- Department of Computer Science and Information Engineering, National Taiwan University, Taiwan
| | - Alvina Pauline Dy Santiago
- University of the Philippines Manila College of Medicine, Manila, Philippines
- Division of Pediatric Ophthalmology, Department of Ophthalmology & Visual Sciences, Philippine General Hospital, Manila, Philippines
- Section of Pediatric Ophthalmology, Eye and Vision Institute, The Medical City, Pasig, Philippines
- Section of Pediatric Ophthalmology, International Eye and Institute, St. Luke’s Medical Center, Quezon City, Philippines
| | - Warachaya Phanphruk
- Department of Ophthalmology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
| | - Jaime S. Cardoso
- Faculty of Engineering (FEUP), University of Porto, Porto, Portugal
- Institute for Systems and Computer Engineering (INESC TEC), Technology and Science, Porto, Portugal
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, United States of America
- Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| |
Collapse
|
29
|
Tai K, Zhao R, Rameau A. Artificial Intelligence in Otolaryngology: Topics in Epistemology & Ethics. Otolaryngol Clin North Am 2024; 57:863-870. [PMID: 38839555 PMCID: PMC11374503 DOI: 10.1016/j.otc.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
To fuel artificial intelligence (AI) potential in clinical practice in otolaryngology, researchers must understand its epistemic limitations, which are tightly linked to ethical dilemmas requiring careful consideration. AI tools are fundamentally opaque systems, though there are methods to increase explainability and transparency. Reproducibility and replicability limitations can be overcomed by sharing computing code, raw data, and data processing methodology. The risk of bias can be mitigated via algorithmic auditing, careful consideration of the training data, and advocating for a diverse AI workforce to promote algorithmic pluralism, reflecting our population's diverse values and preferences.
Collapse
Affiliation(s)
- Katie Tai
- New York Presbyterian Hospital, 1300 York Avenue, New York, NY 10065, USA
| | - Robin Zhao
- Department of Otolaryngology-Head & Neck Surgery, Sean Parker Institute for the Voice, Weill Cornell Medical College, 240 East 59th Street, New York, NY 10022, USA
| | - Anaïs Rameau
- Department of Otolaryngology-Head & Neck Surgery, Sean Parker Institute for the Voice, Weill Cornell Medical College, 240 East 59th Street, New York, NY 10022, USA.
| |
Collapse
|
30
|
Turrisi R, Verri A, Barla A. Deep learning-based Alzheimer's disease detection: reproducibility and the effect of modeling choices. Front Comput Neurosci 2024; 18:1360095. [PMID: 39371524 PMCID: PMC11451303 DOI: 10.3389/fncom.2024.1360095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 09/03/2024] [Indexed: 10/08/2024] Open
Abstract
Introduction Machine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices in data handling, and modeling design and assessment is crucial. In this work, we summarize and strictly adhere to such practices to ensure reproducible and reliable ML. Specifically, we focus on Alzheimer's Disease (AD) detection, a challenging problem in healthcare. Additionally, we investigate the impact of modeling choices, including different data augmentation techniques and model complexity, on overall performance. Methods We utilize Magnetic Resonance Imaging (MRI) data from the ADNI corpus to address a binary classification problem using 3D Convolutional Neural Networks (CNNs). Data processing and modeling are specifically tailored to address data scarcity and minimize computational overhead. Within this framework, we train 15 predictive models, considering three different data augmentation strategies and five distinct 3D CNN architectures with varying convolutional layers counts. The augmentation strategies involve affine transformations, such as zoom, shift, and rotation, applied either concurrently or separately. Results The combined effect of data augmentation and model complexity results in up to 10% variation in prediction accuracy. Notably, when affine transformation are applied separately, the model achieves higher accuracy, regardless the chosen architecture. Across all strategies, the model accuracy exhibits a concave behavior as the number of convolutional layers increases, peaking at an intermediate value. The best model reaches excellent performance both on the internal and additional external testing set. Discussions Our work underscores the critical importance of adhering to rigorous experimental practices in the field of ML applied to healthcare. The results clearly demonstrate how data augmentation and model depth-often overlooked factors- can dramatically impact final performance if not thoroughly investigated. This highlights both the necessity of exploring neglected modeling aspects and the need to comprehensively report all modeling choices to ensure reproducibility and facilitate meaningful comparisons across studies.
Collapse
Affiliation(s)
- Rosanna Turrisi
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| | - Alessandro Verri
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| | - Annalisa Barla
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| |
Collapse
|
31
|
Garfinkle J. Automation to approximate the Gestalt: applying machine learning to the general movement assessment. Pediatr Res 2024:10.1038/s41390-024-03558-w. [PMID: 39251880 DOI: 10.1038/s41390-024-03558-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Accepted: 08/24/2024] [Indexed: 09/11/2024]
Affiliation(s)
- Jarred Garfinkle
- Division of Neonatology, Department of Pediatrics, McGill University, Montreal Children's Hospital, Montreal, QC, Canada.
| |
Collapse
|
32
|
Habli I, Sujan M, Lawton T. Moving beyond the AI sales pitch - Empowering clinicians to ask the right questions about clinical AI. Future Healthc J 2024; 11:100179. [PMID: 39371536 PMCID: PMC11452827 DOI: 10.1016/j.fhj.2024.100179] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 08/29/2024] [Indexed: 10/08/2024]
Abstract
We challenge the dominant technology-centric narrative around clinical AI. To realise the true potential of the technology, clinicians must be empowered to take a whole-system perspective and assess the suitability of AI-supported tasks for their specific complex clinical setting. Key factors include the AI's capacity to augment human capabilities, evidence of clinical safety beyond general performance metrics and equitable clinical decision-making by the human-AI team. Proactively addressing these issues could pave the way for an accountable clinical buy-in and a trustworthy deployment of the technology.
Collapse
Affiliation(s)
- Ibrahim Habli
- Centre for Assuring Autonomy, University of York, United Kingdom
| | - Mark Sujan
- Centre for Assuring Autonomy, University of York, United Kingdom
| | - Tom Lawton
- Improvement Academy, Bradford Institute for Health Research, United Kingdom
| |
Collapse
|
33
|
Romoli M, Caliandro P. Artificial intelligence, machine learning, and reproducibility in stroke research. Eur Stroke J 2024; 9:518-520. [PMID: 39169773 PMCID: PMC11418546 DOI: 10.1177/23969873241275863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 08/01/2024] [Indexed: 08/23/2024] Open
Affiliation(s)
- Michele Romoli
- Department of Neurosciences, Bufalini Hospital, AUSL Romagna, Cesena, Italy
| | | |
Collapse
|
34
|
Ahmed TM, Lopez-Ramirez F, Fishman EK, Chu L. Artificial Intelligence Applications in Pancreatic Cancer Imaging. ADVANCES IN CLINICAL RADIOLOGY 2024; 6:41-54. [DOI: 10.1016/j.yacr.2024.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2025]
|
35
|
Goh WWB, Kabir MN, Yoo S, Wong L. Ten quick tips for ensuring machine learning model validity. PLoS Comput Biol 2024; 20:e1012402. [PMID: 39298376 DOI: 10.1371/journal.pcbi.1012402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2024] Open
Abstract
Artificial Intelligence (AI) and Machine Learning (ML) models are increasingly deployed on biomedical and health data to shed insights on biological mechanism, predict disease outcomes, and support clinical decision-making. However, ensuring model validity is challenging. The 10 quick tips described here discuss useful practices on how to check AI/ML models from 2 perspectives-the user and the developer.
Collapse
Affiliation(s)
- Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
- Center of AI in Medicine, Nanyang Technological University, Singapore, Singapore
- Division of Neurology, Department of Brain Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Mohammad Neamul Kabir
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
| | - Sehwan Yoo
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
| | - Limsoon Wong
- School of Computing, National University of Singapore, Singapore, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| |
Collapse
|
36
|
C. Pereira S, Rocha J, Campilho A, Mendonça AM. Distribution-based detection of radiographic changes in pneumonia patterns: A COVID-19 case study. Heliyon 2024; 10:e35677. [PMID: 39677970 PMCID: PMC11639430 DOI: 10.1016/j.heliyon.2024.e35677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 06/28/2024] [Accepted: 08/01/2024] [Indexed: 12/17/2024] Open
Abstract
Although the classification of chest radiographs has long been an extensively researched topic, interest increased significantly with the onset of the COVID-19 pandemic. Existing results are promising; however, the radiological similarities between COVID-19 and other types of respiratory diseases limit the success of conventional image classification approaches that focus on single instances. This study proposes a novel perspective that conceptualizes COVID-19 pneumonia as a deviation from a normative distribution of typical pneumonia patterns. Using a population-based approach, our approach utilizes distributional anomaly detection. This method diverges from traditional instance-wise approaches by focusing on sets of scans instead of individual images. Using an autoencoder to extract feature representations, we present instance-based and distribution-based assessments of the separability between COVID-positive and COVID-negative pneumonia radiographs. The results demonstrate that the proposed distribution-based methodology outperforms conventional instance-based techniques in identifying radiographic changes associated with COVID-positive cases. This underscores its potential as an early warning system capable of detecting significant distributional shifts in radiographic data. By continuously monitoring these changes, this approach offers a mechanism for early identification of emerging health trends, potentially signaling the onset of new pandemics and enabling prompt public health responses.
Collapse
Affiliation(s)
- Sofia C. Pereira
- Institute for Systems and Computer Engineering, Technology and Science (INESC-TEC), Portugal
- Faculty of Engineering of the University of Porto, Portugal
| | - Joana Rocha
- Institute for Systems and Computer Engineering, Technology and Science (INESC-TEC), Portugal
- Faculty of Engineering of the University of Porto, Portugal
| | - Aurélio Campilho
- Institute for Systems and Computer Engineering, Technology and Science (INESC-TEC), Portugal
- Faculty of Engineering of the University of Porto, Portugal
| | - Ana Maria Mendonça
- Institute for Systems and Computer Engineering, Technology and Science (INESC-TEC), Portugal
- Faculty of Engineering of the University of Porto, Portugal
| |
Collapse
|
37
|
Lorde N, Mahapatra S, Kalaria T. Machine Learning for Patient-Based Real-Time Quality Control (PBRTQC), Analytical and Preanalytical Error Detection in Clinical Laboratory. Diagnostics (Basel) 2024; 14:1808. [PMID: 39202296 PMCID: PMC11354140 DOI: 10.3390/diagnostics14161808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Revised: 08/14/2024] [Accepted: 08/16/2024] [Indexed: 09/03/2024] Open
Abstract
The rapidly evolving field of machine learning (ML), along with artificial intelligence in a broad sense, is revolutionising many areas of healthcare, including laboratory medicine. The amalgamation of the fields of ML and patient-based real-time quality control (PBRTQC) processes could improve the traditional PBRTQC and error detection algorithms in the laboratory. This narrative review discusses published studies on using ML for the detection of systematic errors, non-systematic errors, and combinations of different types of errors in clinical laboratories. The studies discussed used ML for detecting bias, the requirement for re-calibration, samples contaminated with intravenous fluid or EDTA, delayed sample analysis, wrong-blood-in-tube errors, interference or a combination of different types of errors, by comparing the performance of ML models with human validators or traditional PBRTQC algorithms. Advantages, limitations, the creation of standardised ML models, ethical and regulatory aspects and potential future developments have also been discussed in brief.
Collapse
Affiliation(s)
- Nathan Lorde
- Blood Sciences, Black Country Pathology Services, The Royal Wolverhampton NHS Trust, Wolverhampton WV10 0QP, UK
| | | | | |
Collapse
|
38
|
Bontempi D, Nuernberg L, Pai S, Krishnaswamy D, Thiriveedhi V, Hosny A, Mak RH, Farahani K, Kikinis R, Fedorov A, Aerts HJWL. End-to-end reproducible AI pipelines in radiology using the cloud. Nat Commun 2024; 15:6931. [PMID: 39138215 PMCID: PMC11322541 DOI: 10.1038/s41467-024-51202-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 07/30/2024] [Indexed: 08/15/2024] Open
Abstract
Artificial intelligence (AI) algorithms hold the potential to revolutionize radiology. However, a significant portion of the published literature lacks transparency and reproducibility, which hampers sustained progress toward clinical translation. Although several reporting guidelines have been proposed, identifying practical means to address these issues remains challenging. Here, we show the potential of cloud-based infrastructure for implementing and sharing transparent and reproducible AI-based radiology pipelines. We demonstrate end-to-end reproducibility from retrieving cloud-hosted data, through data pre-processing, deep learning inference, and post-processing, to the analysis and reporting of the final results. We successfully implement two distinct use cases, starting from recent literature on AI-based biomarkers for cancer imaging. Using cloud-hosted data and computing, we confirm the findings of these studies and extend the validation to previously unseen data for one of the use cases. Furthermore, we provide the community with transparent and easy-to-extend examples of pipelines impactful for the broader oncology field. Our approach demonstrates the potential of cloud resources for implementing, sharing, and using reproducible and transparent AI pipelines, which can accelerate the translation into clinical solutions.
Collapse
Affiliation(s)
- Dennis Bontempi
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
- Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Leonard Nuernberg
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
- Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Suraj Pai
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
- Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Deepa Krishnaswamy
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Vamsi Thiriveedhi
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Ahmed Hosny
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Raymond H Mak
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Keyvan Farahani
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ron Kikinis
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Andrey Fedorov
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Hugo J W L Aerts
- Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA.
- Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands.
- Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
39
|
Kocak B, Akinci D'Antonoli T, Ates Kus E, Keles A, Kala A, Kose F, Kadioglu M, Solak S, Sunman S, Temiz ZH. Self-reported checklists and quality scoring tools in radiomics: a meta-research. Eur Radiol 2024; 34:5028-5040. [PMID: 38180530 DOI: 10.1007/s00330-023-10487-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Revised: 11/11/2023] [Accepted: 11/24/2023] [Indexed: 01/06/2024]
Abstract
OBJECTIVE To evaluate the use of reporting checklists and quality scoring tools for self-reporting purposes in radiomics literature. METHODS Literature search was conducted in PubMed (date, April 23, 2023). The radiomics literature was sampled at random after a sample size calculation with a priori power analysis. A systematic assessment for self-reporting, including the use of documentation such as completed checklists or quality scoring tools, was conducted in original research papers. These eligible papers underwent independent evaluation by a panel of nine readers, with three readers assigned to each paper. Automatic annotation was used to assist in this process. Then, a detailed item-by-item confirmation analysis was carried out on papers with checklist documentation, with independent evaluation of two readers. RESULTS The sample size calculation yielded 117 papers. Most of the included papers were retrospective (94%; 110/117), single-center (68%; 80/117), based on their private data (89%; 104/117), and lacked external validation (79%; 93/117). Only seven papers (6%) had at least one self-reported document (Radiomics Quality Score (RQS), Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD), or Checklist for Artificial Intelligence in Medical Imaging (CLAIM)), with a statistically significant binomial test (p < 0.001). Median rate of confirmed items for all three documents was 81% (interquartile range, 6). For quality scoring tools, documented scores were higher than suggested scores, with a mean difference of - 7.2 (standard deviation, 6.8). CONCLUSION Radiomic publications often lack self-reported checklists or quality scoring tools. Even when such documents are provided, it is essential to be cautious, as the accuracy of the reported items or scores may be questionable. CLINICAL RELEVANCE STATEMENT Current state of radiomic literature reveals a notable absence of self-reporting with documentation and inaccurate reporting practices. This critical observation may serve as a catalyst for motivating the radiomics community to adopt and utilize such tools appropriately, thereby fostering rigor, transparency, and reproducibility of their research, moving the field forward. KEY POINTS • In radiomics literature, there has been a notable absence of self-reporting with documentation. • Even if such documents are provided, it is critical to exercise caution because the accuracy of the reported items or scores may be questionable. • Radiomics community needs to be motivated to adopt and appropriately utilize the reporting checklists and quality scoring tools.
Collapse
Affiliation(s)
- Burak Kocak
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey.
| | - Tugba Akinci D'Antonoli
- Institute of Radiology and Nuclear Medicine, Cantonal Hospital Baselland, Liestal, Switzerland
| | - Ece Ates Kus
- Department of Neuroradiology, Klinikum Lippe, Lemgo, Germany
| | - Ali Keles
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Ahmet Kala
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Fadime Kose
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Mehmet Kadioglu
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Sila Solak
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Seyma Sunman
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| | - Zisan Hayriye Temiz
- Department of Radiology, University of Health Sciences, Basaksehir Cam and Sakura City Hospital, Basaksehir, Istanbul, 34480, Turkey
| |
Collapse
|
40
|
Imagawa K, Shiomoto K. Evaluation of Effectiveness of Self-Supervised Learning in Chest X-Ray Imaging to Reduce Annotated Images. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:1618-1624. [PMID: 38459399 PMCID: PMC11300406 DOI: 10.1007/s10278-024-00975-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 11/17/2023] [Accepted: 11/17/2023] [Indexed: 03/10/2024]
Abstract
A significant challenge in machine learning-based medical image analysis is the scarcity of medical images. Obtaining a large number of labeled medical images is difficult because annotating medical images is a time-consuming process that requires specialized knowledge. In addition, inappropriate annotation processes can increase model bias. Self-supervised learning (SSL) is a type of unsupervised learning method that extracts image representations. Thus, SSL can be an effective method to reduce the number of labeled images. In this study, we investigated the feasibility of reducing the number of labeled images in a limited set of unlabeled medical images. The unlabeled chest X-ray (CXR) images were pretrained using the SimCLR framework, and then the representations were fine-tuned as supervised learning for the target task. A total of 2000 task-specific CXR images were used to perform binary classification of coronavirus disease 2019 (COVID-19) and normal cases. The results demonstrate that the performance of pretraining on task-specific unlabeled CXR images can be maintained when the number of labeled CXR images is reduced by approximately 40%. In addition, the performance was significantly better than that obtained without pretraining. In contrast, a large number of pretrained unlabeled images are required to maintain performance regardless of task specificity among a small number of labeled CXR images. In summary, to reduce the number of labeled images using SimCLR, we must consider both the number of images and the task-specific characteristics of the target images.
Collapse
Affiliation(s)
- Kuniki Imagawa
- Faculty of Information Technology, Tokyo City University, 1-28-1 Tamazutsumi, Setagaya-ku, Tokyo, 158-8557, Japan.
| | - Kohei Shiomoto
- Faculty of Information Technology, Tokyo City University, 1-28-1 Tamazutsumi, Setagaya-ku, Tokyo, 158-8557, Japan
| |
Collapse
|
41
|
Lawrence ND, Montgomery J. Accelerating AI for science: open data science for science. ROYAL SOCIETY OPEN SCIENCE 2024; 11:231130. [PMID: 39169971 PMCID: PMC11336680 DOI: 10.1098/rsos.231130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 05/16/2024] [Accepted: 07/03/2024] [Indexed: 08/23/2024]
Abstract
Aspirations for artificial intelligence (AI) as a catalyst for scientific discovery are growing. High-profile successes deploying AI in domains such as protein folding have highlighted AI's potential to unlock new frontiers of scientific knowledge. However, the pathway from AI innovation to deployment in research is not linear. Those seeking to drive a new wave of scientific progress through the application of AI require a diffusion engine that can enhance AI adoption across disciplines. Lessons from previous waves of technology change, experiences of deploying AI in real-world contexts and an emerging research agenda from the AI for science community suggest a framework for accelerating AI adoption. This framework requires action to build supply chains of ideas between disciplines; rapidly transfer technological capabilities through open research; create AI tools that empower researchers; and embed effective data stewardship. Together, these interventions can cultivate an environment of open data science that deliver the benefits of AI across the sciences.
Collapse
Affiliation(s)
- Neil D. Lawrence
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Jessica Montgomery
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| |
Collapse
|
42
|
Chang JY, Makary MS. Evolving and Novel Applications of Artificial Intelligence in Thoracic Imaging. Diagnostics (Basel) 2024; 14:1456. [PMID: 39001346 PMCID: PMC11240935 DOI: 10.3390/diagnostics14131456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 07/01/2024] [Accepted: 07/06/2024] [Indexed: 07/16/2024] Open
Abstract
The advent of artificial intelligence (AI) is revolutionizing medicine, particularly radiology. With the development of newer models, AI applications are demonstrating improved performance and versatile utility in the clinical setting. Thoracic imaging is an area of profound interest, given the prevalence of chest imaging and the significant health implications of thoracic diseases. This review aims to highlight the promising applications of AI within thoracic imaging. It examines the role of AI, including its contributions to improving diagnostic evaluation and interpretation, enhancing workflow, and aiding in invasive procedures. Next, it further highlights the current challenges and limitations faced by AI, such as the necessity of 'big data', ethical and legal considerations, and bias in representation. Lastly, it explores the potential directions for the application of AI in thoracic radiology.
Collapse
Affiliation(s)
- Jin Y Chang
- Department of Radiology, The Ohio State University College of Medicine, Columbus, OH 43210, USA
| | - Mina S Makary
- Department of Radiology, The Ohio State University College of Medicine, Columbus, OH 43210, USA
- Division of Vascular and Interventional Radiology, Department of Radiology, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| |
Collapse
|
43
|
Makarov V, Chabbert C, Koletou E, Psomopoulos F, Kurbatova N, Ramirez S, Nelson C, Natarajan P, Neupane B. Good machine learning practices: Learnings from the modern pharmaceutical discovery enterprise. Comput Biol Med 2024; 177:108632. [PMID: 38788373 DOI: 10.1016/j.compbiomed.2024.108632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 05/07/2024] [Accepted: 05/18/2024] [Indexed: 05/26/2024]
Abstract
Machine Learning (ML) and Artificial Intelligence (AI) have become an integral part of the drug discovery and development value chain. Many teams in the pharmaceutical industry nevertheless report the challenges associated with the timely, cost effective and meaningful delivery of ML and AI powered solutions for their scientists. We sought to better understand what these challenges were and how to overcome them by performing an industry wide assessment of the practices in AI and Machine Learning. Here we report results of the systematic business analysis of the personas in the modern pharmaceutical discovery enterprise in relation to their work with the AI and ML technologies. We identify 23 common business problems that individuals in these roles face when they encounter AI and ML technologies at work, and describe best practices (Good Machine Learning Practices) that address these issues.
Collapse
Affiliation(s)
- Vladimir Makarov
- The Pistoia Alliance, 401 Edgewater Place, Suite 600, Wakefield, MA, 01880, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Al-Kadi OS, Al-Emaryeen R, Al-Nahhas S, Almallahi I, Braik R, Mahafza W. Empowering brain cancer diagnosis: harnessing artificial intelligence for advanced imaging insights. Rev Neurosci 2024; 35:399-419. [PMID: 38291768 DOI: 10.1515/revneuro-2023-0115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 12/10/2023] [Indexed: 02/01/2024]
Abstract
Artificial intelligence (AI) is increasingly being used in the medical field, specifically for brain cancer imaging. In this review, we explore how AI-powered medical imaging can impact the diagnosis, prognosis, and treatment of brain cancer. We discuss various AI techniques, including deep learning and causality learning, and their relevance. Additionally, we examine current applications that provide practical solutions for detecting, classifying, segmenting, and registering brain tumors. Although challenges such as data quality, availability, interpretability, transparency, and ethics persist, we emphasise the enormous potential of intelligent applications in standardising procedures and enhancing personalised treatment, leading to improved patient outcomes. Innovative AI solutions have the power to revolutionise neuro-oncology by enhancing the quality of routine clinical practice.
Collapse
Affiliation(s)
- Omar S Al-Kadi
- King Abdullah II School for Information Technology, University of Jordan, Amman, 11942, Jordan
| | - Roa'a Al-Emaryeen
- King Abdullah II School for Information Technology, University of Jordan, Amman, 11942, Jordan
| | - Sara Al-Nahhas
- King Abdullah II School for Information Technology, University of Jordan, Amman, 11942, Jordan
| | - Isra'a Almallahi
- Department of Diagnostic Radiology, Jordan University Hospital, Amman, 11942, Jordan
| | - Ruba Braik
- Department of Diagnostic Radiology, Jordan University Hospital, Amman, 11942, Jordan
| | - Waleed Mahafza
- Department of Diagnostic Radiology, Jordan University Hospital, Amman, 11942, Jordan
| |
Collapse
|
45
|
Straiton J. Artificial intelligence: help or hindrance in solving the reproducibility crisis? Biotechniques 2024; 76:291-294. [PMID: 38899492 DOI: 10.1080/07366205.2024.2355776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 05/13/2024] [Indexed: 06/21/2024] Open
Abstract
Science is in the midst of a reproducibility crisis and the integration of artificial intelligence into scientific research has only compounded the problem; yet could the technology hold the solution to its own problems?[Formula: see text].
Collapse
|
46
|
Zhang F, Kreuter D, Chen Y, Dittmer S, Tull S, Shadbahr T, Preller J, Rudd JH, Aston JA, Schönlieb CB, Gleadall N, Roberts M. Recent methodological advances in federated learning for healthcare. PATTERNS (NEW YORK, N.Y.) 2024; 5:101006. [PMID: 39005485 PMCID: PMC11240178 DOI: 10.1016/j.patter.2024.101006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
For healthcare datasets, it is often impossible to combine data samples from multiple sites due to ethical, privacy, or logistical concerns. Federated learning allows for the utilization of powerful machine learning algorithms without requiring the pooling of data. Healthcare data have many simultaneous challenges, such as highly siloed data, class imbalance, missing data, distribution shifts, and non-standardized variables, that require new methodologies to address. Federated learning adds significant methodological complexity to conventional centralized machine learning, requiring distributed optimization, communication between nodes, aggregation of models, and redistribution of models. In this systematic review, we consider all papers on Scopus published between January 2015 and February 2023 that describe new federated learning methodologies for addressing challenges with healthcare data. We reviewed 89 papers meeting these criteria. Significant systemic issues were identified throughout the literature, compromising many methodologies reviewed. We give detailed recommendations to help improve methodology development for federated learning in healthcare.
Collapse
Affiliation(s)
- Fan Zhang
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Daniel Kreuter
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Yichen Chen
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Sören Dittmer
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- ZeTeM, University of Bremen, Bremen, Germany
| | - Samuel Tull
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Tolou Shadbahr
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Jacobus Preller
- Addenbrooke’s Hospital, Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - James H.F. Rudd
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - John A.D. Aston
- Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK
| | - Carola-Bibiane Schönlieb
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | | | - Michael Roberts
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- Department of Medicine, University of Cambridge, Cambridge, UK
| |
Collapse
|
47
|
Alami H, Lehoux P, Papoutsi C, Shaw SE, Fleet R, Fortin JP. Understanding the integration of artificial intelligence in healthcare organisations and systems through the NASSS framework: a qualitative study in a leading Canadian academic centre. BMC Health Serv Res 2024; 24:701. [PMID: 38831298 PMCID: PMC11149257 DOI: 10.1186/s12913-024-11112-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 05/14/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND Artificial intelligence (AI) technologies are expected to "revolutionise" healthcare. However, despite their promises, their integration within healthcare organisations and systems remains limited. The objective of this study is to explore and understand the systemic challenges and implications of their integration in a leading Canadian academic hospital. METHODS Semi-structured interviews were conducted with 29 stakeholders concerned by the integration of a large set of AI technologies within the organisation (e.g., managers, clinicians, researchers, patients, technology providers). Data were collected and analysed using the Non-Adoption, Abandonment, Scale-up, Spread, Sustainability (NASSS) framework. RESULTS Among enabling factors and conditions, our findings highlight: a supportive organisational culture and leadership leading to a coherent organisational innovation narrative; mutual trust and transparent communication between senior management and frontline teams; the presence of champions, translators, and boundary spanners for AI able to build bridges and trust; and the capacity to attract technical and clinical talents and expertise. Constraints and barriers include: contrasting definitions of the value of AI technologies and ways to measure such value; lack of real-life and context-based evidence; varying patients' digital and health literacy capacities; misalignments between organisational dynamics, clinical and administrative processes, infrastructures, and AI technologies; lack of funding mechanisms covering the implementation, adaptation, and expertise required; challenges arising from practice change, new expertise development, and professional identities; lack of official professional, reimbursement, and insurance guidelines; lack of pre- and post-market approval legal and governance frameworks; diversity of the business and financing models for AI technologies; and misalignments between investors' priorities and the needs and expectations of healthcare organisations and systems. CONCLUSION Thanks to the multidimensional NASSS framework, this study provides original insights and a detailed learning base for analysing AI technologies in healthcare from a thorough socio-technical perspective. Our findings highlight the importance of considering the complexity characterising healthcare organisations and systems in current efforts to introduce AI technologies within clinical routines. This study adds to the existing literature and can inform decision-making towards a judicious, responsible, and sustainable integration of these technologies in healthcare organisations and systems.
Collapse
Affiliation(s)
- Hassane Alami
- Department of Health Management, Evaluation and Policy, School of Public Health, University of Montreal, P.O. Box 6128, Branch Centre-Ville, Montreal, QC, H3C 3J7, Canada.
- Center for Public Health Research of the University of Montreal, Montreal, QC, Canada.
- Institute for Data Valorization (IVADO), Montreal, QC, Canada.
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK.
| | - Pascale Lehoux
- Department of Health Management, Evaluation and Policy, School of Public Health, University of Montreal, P.O. Box 6128, Branch Centre-Ville, Montreal, QC, H3C 3J7, Canada
- Center for Public Health Research of the University of Montreal, Montreal, QC, Canada
| | - Chrysanthi Papoutsi
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
| | - Sara E Shaw
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
| | - Richard Fleet
- Faculty of Medicine, Laval University, Quebec, QC, Canada
- VITAM Research Centre on Sustainable Health, Faculty of Medicine, Laval University, Quebec, QC, Canada
| | - Jean-Paul Fortin
- Faculty of Medicine, Laval University, Quebec, QC, Canada
- VITAM Research Centre on Sustainable Health, Faculty of Medicine, Laval University, Quebec, QC, Canada
| |
Collapse
|
48
|
Winder AJ, Stanley EA, Fiehler J, Forkert ND. Challenges and Potential of Artificial Intelligence in Neuroradiology. Clin Neuroradiol 2024; 34:293-305. [PMID: 38285239 DOI: 10.1007/s00062-024-01382-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 01/03/2024] [Indexed: 01/30/2024]
Abstract
PURPOSE Artificial intelligence (AI) has emerged as a transformative force in medical research and is garnering increased attention in the public consciousness. This represents a critical time period in which medical researchers, healthcare providers, insurers, regulatory agencies, and patients are all developing and shaping their beliefs and policies regarding the use of AI in the healthcare sector. The successful deployment of AI will require support from all these groups. This commentary proposes that widespread support for medical AI must be driven by clear and transparent scientific reporting, beginning at the earliest stages of scientific research. METHODS A review of relevant guidelines and literature describing how scientific reporting plays a central role at key stages in the life cycle of an AI software product was conducted. To contextualize this principle within a specific medical domain, we discuss the current state of predictive tissue outcome modeling in acute ischemic stroke and the unique challenges presented therein. RESULTS AND CONCLUSION Translating AI methods from the research to the clinical domain is complicated by challenges related to model design and validation studies, medical product regulations, and healthcare providers' reservations regarding AI's efficacy and affordability. However, each of these limitations is also an opportunity for high-impact research that will help to accelerate the clinical adoption of state-of-the-art medical AI. In all cases, establishing and adhering to appropriate reporting standards is an important responsibility that is shared by all of the parties involved in the life cycle of a prospective AI software product.
Collapse
Affiliation(s)
- Anthony J Winder
- Department of Radiology, University of Calgary, Calgary, Canada.
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada.
| | - Emma Am Stanley
- Department of Radiology, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Jens Fiehler
- Department of Diagnostic and Interventional Neuroradiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Nils D Forkert
- Department of Radiology, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
- Department of Clinical Neuroscience, University of Calgary, Calgary, Canada
- Department of Electrical and Software Engineering, University of Calgary, Calgary, Canada
| |
Collapse
|
49
|
Harkness R, Frangi AF, Zucker K, Ravikumar N. Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays. FRONTIERS IN RADIOLOGY 2024; 4:1386906. [PMID: 38836218 PMCID: PMC11148230 DOI: 10.3389/fradi.2024.1386906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 05/07/2024] [Indexed: 06/06/2024]
Abstract
Introduction This study is a retrospective evaluation of the performance of deep learning models that were developed for the detection of COVID-19 from chest x-rays, undertaken with the goal of assessing the suitability of such systems as clinical decision support tools. Methods Models were trained on the National COVID-19 Chest Imaging Database (NCCID), a UK-wide multi-centre dataset from 26 different NHS hospitals and evaluated on independent multi-national clinical datasets. The evaluation considers clinical and technical contributors to model error and potential model bias. Model predictions are examined for spurious feature correlations using techniques for explainable prediction. Results Models performed adequately on NHS populations, with performance comparable to radiologists, but generalised poorly to international populations. Models performed better in males than females, and performance varied across age groups. Alarmingly, models routinely failed when applied to complex clinical cases with confounding pathologies and when applied to radiologist defined "mild" cases. Discussion This comprehensive benchmarking study examines the pitfalls in current practices that have led to impractical model development. Key findings highlight the need for clinician involvement at all stages of model development, from data curation and label definition, to model evaluation, to ensure that all clinical factors and disease features are appropriately considered during model design. This is imperative to ensure automated approaches developed for disease detection are fit-for-purpose in a clinical setting.
Collapse
Affiliation(s)
- Rachael Harkness
- School of Computing, University of Leeds, Leeds, United Kingdom
- Centre for Computational Imaging and Simulation Technologies in Biomedicine, Leeds, United Kingdom
| | - Alejandro F Frangi
- Division of Informatics, Imaging, and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom
- Department of Computer Science, School of Engineering, University of Manchester, Manchester, United Kingdom
| | - Kieran Zucker
- Leeds Institute of Medical Research, School of Medicine, University of Leeds, Leeds, United Kingdom
| | - Nishant Ravikumar
- School of Computing, University of Leeds, Leeds, United Kingdom
- Centre for Computational Imaging and Simulation Technologies in Biomedicine, Leeds, United Kingdom
| |
Collapse
|
50
|
Steinert S, Ruf V, Dzsotjan D, Großmann N, Schmidt A, Kuhn J, Küchemann S. A refined approach for evaluating small datasets via binary classification using machine learning. PLoS One 2024; 19:e0301276. [PMID: 38771767 PMCID: PMC11108166 DOI: 10.1371/journal.pone.0301276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/13/2024] [Indexed: 05/23/2024] Open
Abstract
Classical statistical analysis of data can be complemented or replaced with data analysis based on machine learning. However, in certain disciplines, such as education research, studies are frequently limited to small datasets, which raises several questions regarding biases and coincidentally positive results. In this study, we present a refined approach for evaluating the performance of a binary classification based on machine learning for small datasets. The approach includes a non-parametric permutation test as a method to quantify the probability of the results generalising to new data. Furthermore, we found that a repeated nested cross-validation is almost free of biases and yields reliable results that are only slightly dependent on chance. Considering the advantages of several evaluation metrics, we suggest a combination of more than one metric to train and evaluate machine learning classifiers. In the specific case that both classes are equally important, the Matthews correlation coefficient exhibits the lowest bias and chance for coincidentally good results. The results indicate that it is essential to avoid several biases when analysing small datasets using machine learning.
Collapse
Affiliation(s)
- Steffen Steinert
- Chair of Physics Education, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
- Department of Electrical and Computer Engineering, RPTU Kaiserslautern-Landau, Germany
| | - Verena Ruf
- Chair of Physics Education, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
| | - David Dzsotjan
- Chair of Physics Education, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
| | - Nicolas Großmann
- Smart Data & Knowledge Services, German Research Center for Artificial Intelligence, Kaiserslautern, Germany
| | - Albrecht Schmidt
- Human-Centered Ubiquitous Media, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
| | - Jochen Kuhn
- Chair of Physics Education, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
| | - Stefan Küchemann
- Chair of Physics Education, Ludwig-Maximilians-Universität München (LMU Munich), Munich, Germany
| |
Collapse
|