ChatGPT ve Gemini Modellerinin Ortodonti Alan�ndaki T�rk�e Yeterlili�i: Hasta Sorular�na Verilen Yan�tlar�n Do�ruluk, Eksiksizlik ve Okunabilirlik De�erlendirilmesi

Bozta� Demir, Gizem; G�rg�l�, Serkan

pdf

Cilt : 21 Say� : 3

22/1Son Say� Ar�iv En �ok �ndirilen Makaleler Online Makale G�nder

YAZAR KATKI FORMU

�IKAR �ATI�MASI BEYAN FORMU

YAYIN HAKKI DEV�R FORMU

ChatGPT ve Gemini Modellerinin Ortodonti Alan�ndaki T�rk�e Yeterlili�i: Hasta Sorular�na Verilen Yan�tlar�n Do�ruluk, Eksiksizlik ve Okunabilirlik De�erlendirilmesi [Yeditepe J Dent]

Yeditepe J Dent. 2025; 21(3): 151-158 | DOI: 10.5505/yeditepe.2025.15010

ChatGPT ve Gemini Modellerinin Ortodonti Alan�ndaki T�rk�e Yeterlili�i: Hasta Sorular�na Verilen Yan�tlar�n Do�ruluk, Eksiksizlik ve Okunabilirlik De�erlendirilmesi

Gizem Bozta� Demir, Serkan G�rg�l�
Sa�l�k Bilimleri �niversitesi, G�lhane Di� Hekimli�i Fak�ltesi, Ortodonti Anabilim Dal�, Ankara

G�R�� ve AMA�: Bu �al��ma, ChatGPT ve Gemini geni� dil modellerinin ortodontik tedaviye y�nelik s�k�a sorulan sorulara verdikleri yan�tlar�n eksiksizlik, do�ruluk ve okunabilirlik d�zeylerini kar��la�t�rmay� ama�lamaktad�r.
Y�NTEM ve GERE�LER: Sorular, genel, tedavi ile ilgili ve bak�m ve hijyen olmak �zere �� kategoriye ayr�lm��t�r. ChatGPT ve Gemini modellerinden elde edilen yan�tlar, ��l� Likert �l�e�i ile eksiksizlik, alt�l� Likert �l�e�i ile do�ruluk a��s�ndan de�erlendirilmi�; okunabilirlik ise T�rk�e�ye uyarlanm�� Ate�man Okunabilirlik Form�l� kullan�larak analiz edilmi�tir. �statistiksel analizler, a��k kaynakl� Jamovi yaz�l�m� (The Jamovi Project 2022, s�r�m 2.3.21.0) ile ger�ekle�tirilmi�tir.
BULGULAR: ChatGPT, eksiksizlik a��s�ndan t�m sorularda (p=0,042) ve tedavi ile ilgili sorularda (p=0,037) Gemini�ye g�re istatistiksel olarak anlaml� d�zeyde �st�n performans g�stermi�tir. Ancak, do�ruluk a��s�ndan iki model aras�nda anlaml� bir fark tespit edilmemi�tir. Okunabilirlik a��s�ndan ise Gemini, ChatGPT�ye k�yasla t�m sorular (p=0,001), genel kategori (p=0,013) ve bak�m ve temizlik kategorisi (p=0,01) de�erlendirmelerinde istatistiksel olarak anlaml� derecede daha y�ksek skorlar elde etmi�tir.
TARTI�MA ve SONU�: Bu �al��ma, ChatGPT�nin eksiksizlik a��s�ndan �st�n performans sergiledi�ini, ancak Gemini�nin okunabilirlik d�zeyinde daha iyi sonu�lar verdi�ini g�stermi�tir. Her iki model de do�ruluk a��s�ndan yeterli performans sergilemi� olup, T�rk�e dilinde hasta bilgilendirme s�re�lerinde potansiyel ara�lar olarak de�erlendirilmektedir. Bununla birlikte, modellerin eksiksizlik ve okunabilirlik aras�nda bir denge sa�layacak �ekilde geli�tirilmesi, hasta ileti�iminin etkinli�ini art�rmak i�in �nemlidir.

Anahtar Kelimeler: Ortodonti, geni� dil modelleri, jeneratif yapay zeka.

The Turkish Proficiency of ChatGPT and Gemini in Orthodontics: An Evaluation of the Accuracy, Completeness, and Readability of Responses to Patient Questions

Gizem Bozta� Demir, Serkan G�rg�l�
Department of Orthodontics, Gulhane Faculty of Dental Medicine, University of Health Sciences, Ankara, T�rkiye

INTRODUCTION: This study aimed to compare the performance of ChatGPT and Gemini large language models (LLMs) in answering frequently asked questions regarding orthodontic treatment. The evaluation was based on the completeness, accuracy, and readability of the responses in Turkish.
METHODS: Frequently asked questions related to orthodontic treatment were categorized into general, treatment- related, and care and hygiene groups. Responses from ChatGPT and Gemini models were assessed for completeness using a three-point Likert scale, accuracy using a six-point Likert scale, and readability using the Turkish-adapted Ate�man Readability Formula. Statistical analyses were conducted using open-source Jamovi software (The Jamovi Project 2022, version 2.3.21.0, www.jamovi.org).
RESULTS: ChatGPT demonstrated statistically significant superiority over Gemini in completeness across all questions (p=0,042) and treatment-related questions (p=0,037). However, no statistically significant difference was observed between the two models in terms of accuracy. In terms of readability, Gemini achieved significantly higher scores compared to ChatGPT across all questions (p=0,001), the general category (p=0,013), and the care and hygiene category (p=0,01), indicating responses that were easier to understand.
DISCUSSION AND CONCLUSION: This study revealed that ChatGPT outperformed Gemini in terms of completeness, particularly for all questions and treatment-related questions, while both models performed similarly in accuracy. On the other hand, Gemini provided responses with higher readability, making them more accessible for patients. Both models hold promise as patient information tools in orthodontics, but achieving a balance between completeness and readability remains essential for enhancing their effectiveness.

Keywords: Orthodontics, large language models, generative artificial intelligence.

Sorumlu Yazar: Gizem Bozta� Demir, T�rkiye
Makale Dili: T�rk�e

ATIF KOPYALA

Tam Metin PDF At�f dosyas� indir RIS EndNote BibTex Medlars Procite Reference Manager Yazara e-posta g�nder Benzer makaleler PubMed Google Scholar

H�zl� Arama

ChatGPT ve Gemini Modellerinin Ortodonti Alan�ndaki T�rk�e Yeterlili�i: Hasta Sorular�na Verilen Yan�tlar�n Do�ruluk, Eksiksizlik ve Okunabilirlik De�erlendirilmesi

The Turkish Proficiency of ChatGPT and Gemini in Orthodontics: An Evaluation of the Accuracy, Completeness, and Readability of Responses to Patient Questions