Letter to the Editor
Copyright ©The Author(s) 2024. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Aug 7, 2024; 30(29): 3538-3540
Published online Aug 7, 2024. doi: 10.3748/wjg.v30.i29.3538
Evaluating the role of large language models in inflammatory bowel disease patient information
Eun Jeong Gong, Chang Seok Bang
Eun Jeong Gong, Chang Seok Bang, Department of Internal Medicine, Hallym University College of Medicine, Chuncheon 24253, Gangwon-do, South Korea
Author contributions: Gong EJ and Bang CS contributed to conceptualization, methodology, investigation and wrote the original draft; Bang CS reviewed and edited the draft, and contributed to supervision; All authors have read and agreed to the published version of the manuscript.
Conflict-of-interest statement: The authors declare that they have no conflict of interest.
Open-Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Chang Seok Bang, MD, PhD, Associate Professor, Doctor, Department of Internal Medicine, Hallym University College of Medicine, Sakju-ro 77, Chuncheon 24253, Gangwon-do, South Korea. csbang@hallym.ac.kr
Received: May 27, 2024
Revised: July 15, 2024
Accepted: July 22, 2024
Published online: August 7, 2024
Processing time: 62 Days and 24 Hours
Core Tip

Core Tip: This commentary evaluates the article by Gravina et al on ChatGPT’s potential in providing medical information for inflammatory bowel disease patients. While promising, it highlights the need for advanced techniques like reasoning + action and retrieval-augmented generation to improve accuracy, emphasizing that simple question-and-answer testing is insufficient for evaluating large language models’ true capabilities.