Initial Proof-of-Concept Study for a Plastic Surgery-Specific Artificial Intelligence Large Language Model: PlasticSurgeryGPT.

Aesthetic surgery journal 2025 Vol.45(8) p. 860-864

Ozmen BB, Berber I, Schwarz GS

Abstract

[BACKGROUND] The advent of general-purpose large language models (LLMs) like ChatGPT (OpenAI, San Francisco, CA) has revolutionized natural language processing, but their applicability in specialized medical fields, like plastic surgery, remains limited because of a lack of domain-specific knowledge.

[OBJECTIVES] This study aims to develop and evaluate PlasticSurgeryGPT, a dedicated LLM fine-tuned on plastic surgery literature, to enhance performance in clinical decision support, surgical education, and research within the field.

[METHODS] A comprehensive dataset of 25,389 plastic surgery research abstracts published between January 1, 2010, and January 1, 2024, was retrieved from PubMed. The abstracts underwent rigorous preprocessing, including text cleaning and tokenization. The authors of this study fine-tuned the pretrained GPT-2 model on this dataset using the PyTorch and HuggingFace frameworks. The performance of PlasticSurgeryGPT was evaluated against the default GPT-2 model using BLEU, METEOR, and ROUGE-1 metrics.

[RESULTS] Using the fine-tuned model, named PlasticSurgeryGPT, the authors demonstrated substantial improvements over the generic GPT-2 model in capturing the semantic nuances of plastic surgery text. PlasticSurgeryGPT outperformed GPT-2 across BLEU, METEOR, and ROUGE-1 metrics, with scores of 0.135519, 0.583554, and 0.216813, respectively, compared with GPT-2's scores of 0.130179, 0.550498, and 0.215494.

[CONCLUSIONS] PlasticSurgeryGPT represents the first plastic surgery-specific LLM, demonstrating enhanced performance in generating relevant and accurate content compared with a general-purpose model. This work underscores the potential of domain-specific LLMs in improving clinical practice, surgical education, and research in plastic surgery. Future studies should focus on incorporating full-text articles, multimodal data, and larger models to further enhance performance and applicability.

추출된 의학 개체 (NER)

유형영어 표현한국어 / 풀이UMLS CUI출처등장
해부 OpenAI scispacy 1
약물 [BACKGROUND] scispacy 1
약물 [OBJECTIVES] scispacy 1
약물 [CONCLUSIONS] PlasticSurgeryGPT scispacy 1
질환 Language scispacy 1
질환 LLM scispacy 1
질환 GPT-2 scispacy 1
기타 HuggingFace frameworks scispacy 1
기타 GPT-2 scispacy 1

MeSH Terms

Humans; Surgery, Plastic; Natural Language Processing; Proof of Concept Study; Artificial Intelligence; Plastic Surgery Procedures; Large Language Models