Comparative Performance of the Leading Large Language Models in Answering Complex Rhinoplasty Consultation Questions.

Facial plastic surgery & aesthetic medicine 2025 Vol.27(4) p. 378-383

Goshtasbi K, Best C, Powers B, Ching H, Pastorek NJ, Altman D, Adamson P, Krugman M, Wong BJF

관련 도메인

Abstract

Various large language models (LLMs) can provide human-level medical discussions, but they have not been compared regarding rhinoplasty knowledge. To compare the leading LLMs in answering complex rhinoplasty consultation questions as evaluated by plastic surgeons. Ten open-ended rhinoplasty consultation questions were presented to ChatGPT-4o, Google Gemini, Claude, and Meta-AI LLMs. The responses were randomized and ranked by seven rhinoplasty-specializing plastic surgeons (1 = worst, 4 = best) considering their quality. Textual readability was analyzed via Flesch Reading Ease (FRE) and Flesch-Kincaid Grade (FKG). Claude provided the top answers for seven questions while ChatGPT provided the top answers for three questions. In overall collective scoring, Claude provided the best answers with 224 points, followed by ChatGPT's 200, Meta's 138, and Gemini's 138 scores. Claude (mean score/question 3.20 ± 1.00) significantly outperformed all the other models ( < 0.05), while ChatGPT (mean score/question 2.86 ± 0.94) outperformed Meta and Gemini. Meta and Gemini performed similarly. Meta had a significantly lower FKG than Claude and ChatGPT and a significantly lower FRE than ChatGPT. According to ratings by seven rhinoplasty-specializing surgeons, Claude provided the best answers for a set of complex rhinoplasty consultation questions, followed by ChatGPT. Future studies are warranted to continue comparing these models as they evolve.

추출된 의학 개체 (NER)

유형영어 표현한국어 / 풀이UMLS CUI출처등장
시술 rhinoplasty 코성형술 dict 7
약물 Meta-AI LLMs. scispacy 1
약물 ChatGPT scispacy 1
약물 Gemini scispacy 1
질환 Language scispacy 1
기타 Gemini scispacy 1

MeSH Terms

Humans; Rhinoplasty; Language; Referral and Consultation; Comprehension; Large Language Models

🔗 함께 등장하는 도메인

이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들

관련 논문