Copyright
©The Author(s) 2024.
World J Methodol. Dec 20, 2024; 14(4): 92802
Published online Dec 20, 2024. doi: 10.5662/wjm.v14.i4.92802
Published online Dec 20, 2024. doi: 10.5662/wjm.v14.i4.92802
Table 1 Total count and percentage of 'Yes' responses for each Large Language Model
System | Neutral | No | Yes | Total |
GPT 4 | 1 | 74 | 387 (83.77) | 462 |
GPT 3.5 | 5 | 101 | 356 (77.06) | 462 |
Bard | 129 | 80 | 253 (54.76) | 462 |
Table 2 Weighted accuracy comparison across the Large Language Model
Model | Weighted accuracy |
ChatGPT 4 | 0.6775 |
ChatGPT 3.5 | 0.5519 |
Bard | 0.3745 |
Table 3 Disease accuracy comparison
Name of disease | ChatGPT 4 | ChatGPT 3.5 | Bard |
Acromegaly | 1.0 | 1.0 | 1.0 |
Orthostatic hypotension | 1.0 | 1.0 | 0.0 |
Myasthenia gravis | 1.0 | 1.0 | 0.5 |
Myoclonus | 1.0 | 1.0 | 1.0 |
Myotonic dystrophy | 1.0 | -1.0 | 1.0 |
Neonatal onset multisystem inflammatory disease | 1.0 | 1.0 | 1.0 |
Neoplastic spinal cord compression | 1.0 | 1.0 | 1.0 |
Nephrolithiasis | 1.0 | 1.0 | 1.0 |
Neurological infections | 1.0 | 1.0 | 0.0 |
Neuromyelitis optica | 1.0 | 0.0 | 1.0 |
Thiamine deficiency | -1.0 | 1.0 | 1.0 |
Anaphylactic reaction | -1.0 | -1.0 | 0.0 |
Reactive arthritis | -1.0 | -1.0 | 0.0 |
Fibrous dysplasia | -1.0 | -1.0 | 1.0 |
Hypothyroidism | -1.0 | -1.0 | -1.0 |
Multiple sclerosis | -1.0 | -1.0 | -1.0 |
Hypophosphatemia | -1.0 | -1.0 | 1.0 |
Hypomagnesemia | -1.0 | -1.0 | 0.0 |
Alcohol intoxication | -1.0 | -1.0 | 0.0 |
Post-concussive state | -1.0 | -1.0 | 0.0 |
Table 4 Detailed accuracy values for each organ system across the three Large Language Model
Organ system | ChatGPT 4 | ChatGPT 3.5 | Bard |
Cardio vascular system, respiratory system | 1.0000 | 0.6667 | 0.6667 |
Hematology | 1.0000 | 1.0000 | -1.0000 |
Respiratory | 1.0000 | 0.3333 | 0.3333 |
Respiratory system | 1.0000 | 1.0000 | 0.5000 |
Infectious diseases | 0.8039 | 0.7451 | 0.2059 |
Immune system | 0.6752 | 0.4188 | 0.2650 |
Central nervous system | 0.6585 | 0.6220 | 0.5610 |
Hematological malignancies | 0.6429 | 0.5714 | 0.4286 |
Cardio vascular system | 0.6000 | 0.6667 | 0.3333 |
Endocrine system | 0.5556 | 0.4444 | 0.5714 |
Renal | 0.5556 | 0.3704 | 0.5185 |
Gastrointestinal tract | 0.5385 | 0.2308 | 0.2308 |
- Citation: Ramasubramanian S, Balaji S, Kannan T, Jeyaraman N, Sharma S, Migliorini F, Balasubramaniam S, Jeyaraman M. Comparative evaluation of artificial intelligence systems' accuracy in providing medical drug dosages: A methodological study. World J Methodol 2024; 14(4): 92802
- URL: https://www.wjgnet.com/2222-0682/full/v14/i4/92802.htm
- DOI: https://dx.doi.org/10.5662/wjm.v14.i4.92802