Initial Proof-of-Concept Study for a Plastic Surgery-Specific Artificial Intelligence Large Language Model: PlasticSurgeryGPT.
Abstract
[BACKGROUND] The advent of general-purpose large language models (LLMs) like ChatGPT (OpenAI, San Francisco, CA) has revolutionized natural language processing, but their applicability in specialized medical fields, like plastic surgery, remains limited because of a lack of domain-specific knowledge.
[OBJECTIVES] This study aims to develop and evaluate PlasticSurgeryGPT, a dedicated LLM fine-tuned on plastic surgery literature, to enhance performance in clinical decision support, surgical education, and research within the field.
[METHODS] A comprehensive dataset of 25,389 plastic surgery research abstracts published between January 1, 2010, and January 1, 2024, was retrieved from PubMed. The abstracts underwent rigorous preprocessing, including text cleaning and tokenization. The authors of this study fine-tuned the pretrained GPT-2 model on this dataset using the PyTorch and HuggingFace frameworks. The performance of PlasticSurgeryGPT was evaluated against the default GPT-2 model using BLEU, METEOR, and ROUGE-1 metrics.
[RESULTS] Using the fine-tuned model, named PlasticSurgeryGPT, the authors demonstrated substantial improvements over the generic GPT-2 model in capturing the semantic nuances of plastic surgery text. PlasticSurgeryGPT outperformed GPT-2 across BLEU, METEOR, and ROUGE-1 metrics, with scores of 0.135519, 0.583554, and 0.216813, respectively, compared with GPT-2's scores of 0.130179, 0.550498, and 0.215494.
[CONCLUSIONS] PlasticSurgeryGPT represents the first plastic surgery-specific LLM, demonstrating enhanced performance in generating relevant and accurate content compared with a general-purpose model. This work underscores the potential of domain-specific LLMs in improving clinical practice, surgical education, and research in plastic surgery. Future studies should focus on incorporating full-text articles, multimodal data, and larger models to further enhance performance and applicability.
[OBJECTIVES] This study aims to develop and evaluate PlasticSurgeryGPT, a dedicated LLM fine-tuned on plastic surgery literature, to enhance performance in clinical decision support, surgical education, and research within the field.
[METHODS] A comprehensive dataset of 25,389 plastic surgery research abstracts published between January 1, 2010, and January 1, 2024, was retrieved from PubMed. The abstracts underwent rigorous preprocessing, including text cleaning and tokenization. The authors of this study fine-tuned the pretrained GPT-2 model on this dataset using the PyTorch and HuggingFace frameworks. The performance of PlasticSurgeryGPT was evaluated against the default GPT-2 model using BLEU, METEOR, and ROUGE-1 metrics.
[RESULTS] Using the fine-tuned model, named PlasticSurgeryGPT, the authors demonstrated substantial improvements over the generic GPT-2 model in capturing the semantic nuances of plastic surgery text. PlasticSurgeryGPT outperformed GPT-2 across BLEU, METEOR, and ROUGE-1 metrics, with scores of 0.135519, 0.583554, and 0.216813, respectively, compared with GPT-2's scores of 0.130179, 0.550498, and 0.215494.
[CONCLUSIONS] PlasticSurgeryGPT represents the first plastic surgery-specific LLM, demonstrating enhanced performance in generating relevant and accurate content compared with a general-purpose model. This work underscores the potential of domain-specific LLMs in improving clinical practice, surgical education, and research in plastic surgery. Future studies should focus on incorporating full-text articles, multimodal data, and larger models to further enhance performance and applicability.
추출된 의학 개체 (NER)
| 유형 | 영어 표현 | 한국어 / 풀이 | UMLS CUI | 출처 | 등장 |
|---|---|---|---|---|---|
| 해부 | OpenAI
|
scispacy | 1 | ||
| 약물 | [BACKGROUND]
|
scispacy | 1 | ||
| 약물 | [OBJECTIVES]
|
scispacy | 1 | ||
| 약물 | [CONCLUSIONS] PlasticSurgeryGPT
|
scispacy | 1 | ||
| 질환 | Language
|
scispacy | 1 | ||
| 질환 | LLM
|
scispacy | 1 | ||
| 질환 | GPT-2
|
scispacy | 1 | ||
| 기타 | HuggingFace frameworks
|
scispacy | 1 | ||
| 기타 | GPT-2
|
scispacy | 1 |
MeSH Terms
Humans; Surgery, Plastic; Natural Language Processing; Proof of Concept Study; Artificial Intelligence; Plastic Surgery Procedures; Large Language Models