Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.
TL;DR
ChatGPT-4 performs satisfactorily on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination and meets the minimum passing standards for the exam, while responses generally align with expected knowledge.
OpenAlex 토픽 ·
Artificial Intelligence in Healthcare and Education
Meta-analysis and systematic reviews
Cardiac, Anesthesia and Surgical Outcomes
【연구 목적】 본 연구는 생성형 인공지능 모델인 ChatGPT-4가 프랑스 성형·재건·미용외과 전문의 자격 필기시험에서 수행하는 능력을 평가하고, 이를 의학도들의 학습 보조 자료로서의 유용성을 검증하는 것을 목적으로 한다.
APA
Emma Dejean-Bouyer, Anoujat Kanlagna, et al. (2025). Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.. Journal of educational evaluation for health professions, 22, 27. https://doi.org/10.3352/jeehp.2025.22.27
MLA
Emma Dejean-Bouyer, et al.. "Performance of ChatGPT-4 on the French Board of Plastic Reconstructive and Aesthetic Surgery written exam: a descriptive study.." Journal of educational evaluation for health professions, vol. 22, 2025, pp. 27.
PMID
41022586
Abstract
[PURPOSE] This study aims to evaluate the performance of Chat Generative Pre-Trained Transformer 4 (ChatGPT-4) on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination and to assess its role as a supplementary resource in helping medical students prepare for the qualification examination in plastic surgery.
[METHODS] This descriptive study evaluated ChatGPT-4's performance on 213 items from the October 2024 French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Responses were assessed for accuracy, logical reasoning, internal and external information use, and were categorized for fallacies by independent reviewers. Statistical analyses included chi-square tests and Fisher's exact test for significance.
[RESULTS] ChatGPT-4 answered all questions across the 10 modules, achieving an overall accuracy rate of 77.5%. The model applied logical reasoning in 98.1% of the questions, utilized internal information in 94.4%, and incorporated external information in 91.1%.
[CONCLUSION] ChatGPT-4 performs satisfactorily on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Its accuracy met the minimum passing standards for the exam. While responses generally align with expected knowledge, careful verification remains necessary, particularly for questions involving image interpretation. As artificial intelligence continues to evolve, ChatGPT-4 is expected to become an increasingly reliable tool for medical education. At present, it remains a valuable resource for assisting plastic surgery residents in their training.
[METHODS] This descriptive study evaluated ChatGPT-4's performance on 213 items from the October 2024 French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Responses were assessed for accuracy, logical reasoning, internal and external information use, and were categorized for fallacies by independent reviewers. Statistical analyses included chi-square tests and Fisher's exact test for significance.
[RESULTS] ChatGPT-4 answered all questions across the 10 modules, achieving an overall accuracy rate of 77.5%. The model applied logical reasoning in 98.1% of the questions, utilized internal information in 94.4%, and incorporated external information in 91.1%.
[CONCLUSION] ChatGPT-4 performs satisfactorily on the French Board of Plastic, Reconstructive, and Aesthetic Surgery written examination. Its accuracy met the minimum passing standards for the exam. While responses generally align with expected knowledge, careful verification remains necessary, particularly for questions involving image interpretation. As artificial intelligence continues to evolve, ChatGPT-4 is expected to become an increasingly reliable tool for medical education. At present, it remains a valuable resource for assisting plastic surgery residents in their training.
MeSH Terms
Humans; Educational Measurement; Surgery, Plastic; France; Plastic Surgery Procedures; Students, Medical; Education, Medical, Undergraduate; Clinical Competence; Generative Artificial Intelligence