Artificial Intelligence in Emergency Response: Which Model Offers More Reliable Solutions?

Authors

  • Özcan Yıldız Isparta City Hospital

DOI:

https://doi.org/10.5281/zenodo.17038499

Keywords:

Gemini, ChatGPT, Readibility

Abstract

Background: The increasing integration of artificial intelligence (AI) and large language models (LLMs) into medical education highlights the need to evaluate their performance in specialized fields like emergency medicine. This study addresses a gap in the literature by comparing the performance of ChatGPT and GEMINI models in a clinical context.

Objective: This study aimed to compare the readability and interobserver consistency of texts generated by Open AI (GPT) and Google GEMINI in response to emergency medicine cardiovascular questions.

Materials and Methods: Text samples were generated by both models (n=20 per model) in response to a predefined set of questions. The interobserver consistency was analyzed by two independent observers using an Intraclass Correlation Coefficient (ICC). Readability was assessed using multiple quantitative formulas, including Flesch Reading Ease, Fog Scale, and Flesch-Kincaid Grade Level, and the results were statistically compared.

Results: The interobserver correlation for the GEMINI group (ICC = 0.636) was notably higher than for the GPT group (ICC = 0.048), indicating better consistency in GEMINI's output, although neither correlation was statistically significant. Readability analysis revealed that GEMINI texts were significantly more readable than GPT texts across most formulas (e.g., Flesch Reading Ease: GEMINI M=57.00 vs. GPT M=47.00, p<0.001). This suggests GEMINI's output is characterized by simpler sentence structures and more accessible language.

Conclusion: The findings suggest that the GEMINI model outperforms GPT in both interobserver consistency and text readability. These results have important implications for the selection of AI tools in medical education and clinical practice, where the clarity and reliability of information are critical.

References

1. Yau JY, Saadat S, Hsu E, Murphy LS, Roh JS, Suchard J, Tapia A, Wiechmann W, Langdorf MI. Accuracy of Prospective Assessments of 4 Large Language Model Chatbot Responses to Patient Questions About Emergency Care: Experimental Comparative Study. J Med Internet Res. 2024 Nov 4;26:e60291. doi: 10.2196/60291. PMID: 39496149; PMCID: PMC11574488.

2. Carlson JA, Cheng RZ, Lange A, Nagalakshmi N, Rabets J, Shah T, Sindhwani P. Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware. Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. PMID: 39347335; PMCID: PMC11427961.

3. Mani Z, Albagawi B. AI frontiers in emergency care: the next evolution of nursing interventions. Front Public Health. 2024 Dec 11;12:1439412. doi: 10.3389/fpubh.2024.1439412. PMID: 39722724; PMCID: PMC11669251.

4. Huo B, Boyle A, Marfo N, Tangamornsuksan W, Steen JP, McKechnie T, Lee Y, Mayol J, Antoniou SA, Thirunavukarasu AJ, Sanger S, Ramji K, Guyatt G. Large Language Models for Chatbot Health Advice Studies: A Systematic Review. JAMA Netw Open. 2025 Feb 3;8(2):e2457879. doi: 10.1001/jamanetworkopen.2024.57879. Erratum in: JAMA Netw Open. 2025 Mar 3;8(3):e256511. doi: 10.1001/jamanetworkopen.2025.6511. PMID: 39903463; PMCID: PMC11795331.

5. De Simone B, Abu-Zidan FM, Gumbs AA, Chouillard E, Di Saverio S, Sartelli M, Coccolini F, Ansaloni L, Collins T, Kluger Y, Moore EE, Litvin A, Leppaniemi A, Mascagni P, Milone L, Piccoli M, Abu-Hilal M, Sugrue M, Biffl WL, Catena F. Knowledge, attitude, and practice of artificial intelligence in emergency and trauma surgery, the ARIES project: an international web-based survey. World J Emerg Surg. 2022 Feb 10;17(1):10. doi: 10.1186/s13017-022-00413-3. PMID: 35144645; PMCID: PMC8832812.

6. Moparthi KP, Javed H, Kumari M, Pavani P, Paladini A, Saleem A, Ram R, Varrassi G. Acute Care Surgery: Navigating Recent Developments, Protocols, and Challenges in the Comprehensive Management of Surgical Emergencies. Cureus. 2024 Jan 14;16(1):e52269. doi: 10.7759/cureus.52269. PMID: 38352101; PMCID: PMC10864012.

7. Miller J, McNaughton C, Joyce K, Binz S, Levy P. Hypertension Management in Emergency Departments. Am J Hypertens. 2020 Oct 21;33(10):927-934. doi: 10.1093/ajh/hpaa068. PMID: 32307541; PMCID: PMC7577644.

8. Chen E, Prakash S, Janapa Reddi V, Kim D, Rajpurkar P. A framework for integrating artificial intelligence for clinical care with continuous therapeutic monitoring. Nat Biomed Eng. 2025 Apr;9(4):445-454. doi: 10.1038/s41551-023-01115-0. Epub 2023 Nov 6. PMID: 37932379.

9. Khan M, Ahuja K, Tsirikos AI. AI and machine learning in paediatric spine deformity surgery. Bone Jt Open. 2025 May 23;6(5):569-581. doi: 10.1302/2633-1462.65.BJO-2024-0089.R1. PMID: 40407025; PMCID: PMC12100669.

10. Kalanjiyam GP, Chandramohan T, Raman M, Kalyanasundaram H. Artificial intelligence: a new cutting-edge tool in spine surgery. Asian Spine J. 2024 Jun;18(3):458-471. doi: 10.31616/asj.2023.0382. Epub 2024 Jun 25. PMID: 38917854; PMCID: PMC11222879.

11. Liang F, Yang X, Peng W, Zhen S, Cao W, Li Q, Xiao Z, Gong M, Wang Y, Gu D. Applications of digital health approaches for cardiometabolic diseases prevention and management in the Western Pacific region. Lancet Reg Health West Pac. 2023 Dec 1;43:100817. doi: 10.1016/j.lanwpc.2023.100817. PMID: 38456090; PMCID: PMC10920052.

12. Aerts A, Bogdan-Martin D. Leveraging data and AI to deliver on the promise of digital health. Int J Med Inform. 2021 Jun;150:104456. doi: 10.1016/j.ijmedinf.2021.104456. Epub 2021 Apr 10. PMID: 33866232.

13. Kachman MM, Brennan I, Oskvarek JJ, Waseem T, Pines JM. How artificial intelligence could transform emergency care. Am J Emerg Med. 2024 Jul;81:40-46. doi: 10.1016/j.ajem.2024.04.024. Epub 2024 Apr 16. PMID: 38663302.

14. Seo JW, Park SJ, Kim YJ, Kim JY, Kim KG, Yoon YH. Artificial intelligence for severity triage based on conversations in an emergency department in Korea. Sci Rep. 2025 May 15;15(1):16870. doi: 10.1038/s41598-025-99874-0. PMID: 40374942; PMCID: PMC12081756.

15. Da'Costa A, Teke J, Origbo JE, Osonuga A, Egbon E, Olawade DB. AI-driven triage in emergency departments: A review of benefits, challenges, and future directions. Int J Med Inform. 2025 May;197:105838. doi: 10.1016/j.ijmedinf.2025.105838. Epub 2025 Feb 15. PMID: 39965433.,

16. Da'Costa A, Teke J, Origbo JE, Osonuga A, Egbon E, Olawade DB. AI-driven triage in emergency departments: A review of benefits, challenges, and future directions. Int J Med Inform. 2025 May;197:105838. doi: 10.1016/j.ijmedinf.2025.105838. Epub 2025 Feb 15. PMID: 39965433.

17. Peng HT, Siddiqui MM, Rhind SG, Zhang J, da Luz LT, Beckett A. Artificial intelligence and machine learning for hemorrhagic trauma care. Mil Med Res. 2023 Feb 16;10(1):6. doi: 10.1186/s40779-023-00444-0. PMID: 36793066; PMCID: PMC9933281.

18. Almulihi QA, Alquraini AA, Almulihi FAA, Alzahid AA, Al Qahtani SSAJ, Almulhim M, Alqhtani SHS, Alnafea FMN, Mushni SAS, Alaqil NA, Assiri MIF, Maghraby NH. Applications of Artificial Intelligence and Machine Learning in Emergency Medicine Triage - A Systematic Review. Med Arch. 2024;78(3):198-206. doi: 10.5455/medarh.2024.78.198-206. PMID: 39944197; PMCID: PMC11813208.

19. Kang DY, Cho KJ, Kwon O, Kwon JM, Jeon KH, Park H, Lee Y, Park J, Oh BH. Artificial intelligence algorithm to predict the need for critical care in prehospital emergency medical services. Scand J Trauma Resusc Emerg Med. 2020 Mar 4;28(1):17. doi: 10.1186/s13049-020-0713-4. PMID: 32131867; PMCID: PMC7057604.

20. Masoumian Hosseini M, Masoumian Hosseini ST, Qayumi K, Ahmady S, Koohestani HR. The Aspects of Running Artificial Intelligence in Emergency Care; a Scoping Review. Arch Acad Emerg Med. 2023 May 11;11(1):e38. doi: 10.22037/aaem.v11i1.1974. PMID: 37215232; PMCID: PMC10197918.

21. Tyler S, Olis M, Aust N, Patel L, Simon L, Triantafyllidis C, Patel V, Lee DW, Ginsberg B, Ahmad H, Jacobs RJ. Use of Artificial Intelligence in Triage in Hospital Emergency Departments: A Scoping Review. Cureus. 2024 May 8;16(5):e59906. doi: 10.7759/cureus.59906. PMID: 38854295; PMCID: PMC11158416.

22. Rodrigues Alessi M, Gomes HA, Lopes de Castro M, Terumy Okamoto C. Performance of ChatGPT in Solving Questions From the Progress Test (Brazilian National Medical Exam): A Potential Artificial Intelligence Tool in Medical Practice. Cureus. 2024 Jul 19;16(7):e64924. doi: 10.7759/cureus.64924. PMID: 39156244; PMCID: PMC11330648.

23. Ventura CAI, Denton EE, David JA. Artificial Intelligence in Emergency Trauma Care: A Preliminary Scoping Review. Med Devices (Auckl). 2024 May 23;17:191-211. doi: 10.2147/MDER.S467146. PMID: 38803707; PMCID: PMC11129754.

Published

2025-09-03

How to Cite

Yıldız, Özcan. (2025). Artificial Intelligence in Emergency Response: Which Model Offers More Reliable Solutions?. Acta Medica Young Doctors, 1(1). https://doi.org/10.5281/zenodo.17038499

Similar Articles

You may also start an advanced similarity search for this article.