Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.

Emma Dejean-Bouyer; Anoujat Kanlagna; François Thuau; Pierre Perrot; Ugo Lancien

doi:10.3352/jeehp.2025.22.27

← 뒤로

Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.

Journal of educational evaluation for health professions 2025 Vol.22() p. 27 🔓 OA Artificial Intelligence in Healthcar

TL;DR ChatGPT-4 performs satisfactorily on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination and meets the minimum passing standards for the exam, while responses generally align with expected knowledge.

OpenAlex 토픽 · Artificial Intelligence in Healthcare and Education Meta-analysis and systematic reviews Cardiac, Anesthesia and Surgical Outcomes

Dejean-Bouyer E, Kanlagna A, Thuau F, Perrot P, Lancien U

🔓 OA 전문 ↗ 원문 ↗ DOI ↗ BibTeX ↓ RIS ↓

📝 환자 설명용 한 줄

【연구 목적】 본 연구는 생성형 인공지능 모델인 ChatGPT-4가 프랑스 성형·재건·미용외과 전문의 자격 필기시험에서 수행하는 능력을 평가하고, 이를 의학도들의 학습 보조 자료로서의 유용성을 검증하는 것을 목적으로 한다.

이 논문을 인용하기

BibTeX ↓ RIS ↓

APA Emma Dejean-Bouyer, Anoujat Kanlagna, et al. (2025). Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.. Journal of educational evaluation for health professions, 22, 27. https://doi.org/10.3352/jeehp.2025.22.27

MLA Emma Dejean-Bouyer, et al.. "Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.." Journal of educational evaluation for health professions, vol. 22, 2025, pp. 27.

PMID 41022586

DOI 10.3352/jeehp.2025.22.27

Abstract

[PURPOSE] This study aims to evaluate the performance of Chat Generative Pre-Trained Transformer 4 (ChatGPT-4) on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination and to assess its role as a supplementary resource in helping medical students prepare for the qualification examination in plastic surgery.

[METHODS] This descriptive study evaluated ChatGPT-4's performance on 213 items from the October 2024 French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Responses were assessed for accuracy, logical reasoning, internal and external information use, and were categorized for fallacies by independent reviewers. Statistical analyses included chi-square tests and Fisher's exact test for significance.

[RESULTS] ChatGPT-4 answered all questions across the 10 modules, achieving an overall accuracy rate of 77.5%. The model applied logical reasoning in 98.1% of the questions, utilized internal information in 94.4%, and incorporated external information in 91.1%.

[CONCLUSION] ChatGPT-4 performs satisfactorily on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Its accuracy met the minimum passing standards for the exam. While responses generally align with expected knowledge, careful verification remains necessary, particularly for questions involving image interpretation. As artificial intelligence continues to evolve, ChatGPT-4 is expected to become an increasingly reliable tool for medical education. At present, it remains a valuable resource for assisting plastic surgery residents in their training.

MeSH Terms

Humans; Educational Measurement; Surgery, Plastic; France; Plastic Surgery Procedures; Students, Medical; Education, Medical, Undergraduate; Clinical Competence; Generative Artificial Intelligence