Comparison of ChatGPT-3.5 and GPT-4 as potential tools in artificial intelligence-assisted clinical practice in renal and liver transplantation

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 15, Issue 3

This Article

Academic Content and Language Evaluation of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Supplementary Materials of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

All Articles published online

Item

Count

PDF

HTML

207

Tables (1-7)

Sum=263

Publishing Process of This Article

Item

Count

Browse

Download

Sum=67

Sep 18, 2025 (publication date) through Apr 27, 2025

Times Cited of This Article

Journal Information of This Article

Publication Name

World Journal of Transplantation

ISSN

2220-3230

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Observational Study

World J Transplant. Sep 18, 2025; 15(3): 103536
Published online Sep 18, 2025. doi: 10.5500/wjt.v15.i3.103536

Table 7 Aggregated performance of ChatGPT and GPT-4 in clinical scenarios across published and unpublished cases, categorized by task type, n (%)

Type of task	Overall chatGPT agreement level	Overall GPT-4 agreement level	chatGPT renal transplantation agreement level	GPT-4 renal transplantation agreement level	chatGPT liver transplantation agreement level	GPT-4 liver transplantation agreement level
DD that includes final diagnosis	A: 22/30 (73.3)	A: 27/30 (90)	A: 13/16 (81.3)	A: 15/16 (93.8)	A: 9/14 (64.3)	A: 12/14 (85.7)
DD that includes final diagnosis	PA: 1/30 (3.33)	PA: 1/30 (3.3)	PA: 1/16 (6.3)	PA: 1/16 (6.2)	PA: 0/14 (0)	PA: 0/14 (0)
Final diagnosis prediction	A: 11/31 (35.5)	A: 20/31 (64.5)	A: 7/17 (41.2)	A: 13/17 (76.5)	A: 4/14 (28.6)	A: 7/14 (50)
Final diagnosis prediction	PA: 2/31 (6.45)	PA: 2/31 (6.5)	PA: 1/17 (5.9)	PA: 1/17 (5.9)	PA: 1/14 (7.1)	PA: 1/14 (7.1)
Appropriate next diagnostic test	A: 8/19 (42.1)	A: 15/19 (78.9)	A: 6/13 (46.2)	A: 11/13 (84.6)	A: 2/6 (33.6)	A: 4/6 (66.7)
Appropriate next diagnostic test	PA: 8/19 (42.1)	PA: 2/19 (10.5)	PA: 5/13 (38.5)	PA: 1/13 (7.7)	PA: 3/6 (50)	PA: 1/6 (16.7)
Appropriate treatment	A: 11/21 (52.4)	A: 15/21 (71.4)	A: 5/8 (62.5)	A: 7/8 (87.5)	A: 6/13 (46.2)	A: 4/6 (66.7)
Appropriate treatment	PA: 9/21 (42.9)	PA: 4/21 (19)	PA: 2/8(25%)	PA: 1/8 (12.5)	PA: 7/13 (53.8)	PA: 1/6 (16.7)
Prediction of prognosis	A: 3/5 (60)	A: 5/5 (100)	A: 1/1 (100%)	A: 1/1 (100)	A: 2/4 (50)	A: 4/4 (100)
Prediction of prognosis	PA: 1/5 (20)	PA: 0/5 (0)	PA: 0/0 (0%)	PA: 0/0 (0)	PA: 1/4 (25)	PA: 0/4 (0)

DD: Differential diagnosis; A: Agreement; PA: Partial agreement.

Citation: Christou CD, Sitsiani O, Boutos P, Katsanos G, Papadakis G, Tefas A, Papalois V, Tsoulfas G. Comparison of ChatGPT-3.5 and GPT-4 as potential tools in artificial intelligence-assisted clinical practice in renal and liver transplantation. World J Transplant 2025; 15(3): 103536
URL: https://www.wjgnet.com/2220-3230/full/v15/i3/103536.htm
DOI: https://dx.doi.org/10.5500/wjt.v15.i3.103536