Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems.
Abstract
Medical researchers are increasingly utilizing advanced LLMs like ChatGPT-4 and Gemini to enhance diagnostic processes in the medical field. This research focuses on their ability to comprehend and apply complex medical classification systems for breast conditions, which can significantly aid plastic surgeons in making informed decisions for diagnosis and treatment, ultimately leading to improved patient outcomes. Fifty clinical scenarios were created to evaluate the classification accuracy of each LLM across five established breast-related classification systems. Scores from 0 to 2 were assigned to LLM responses to denote incorrect, partially correct, or completely correct classifications. Descriptive statistics were employed to compare the performances of ChatGPT-4 and Gemini. Gemini exhibited superior overall performance, achieving 98% accuracy compared to ChatGPT-4's 71%. While both models performed well in the Baker classification for capsular contracture and UTSW classification for gynecomastia, Gemini consistently outperformed ChatGPT-4 in other systems, such as the Fischer Grade Classification for gender-affirming mastectomy, Kajava Classification for ectopic breast tissue, and Regnault Classification for breast ptosis. With further development, integrating LLMs into plastic surgery practice will likely enhance diagnostic support and decision making.
추출된 의학 개체 (NER)
| 유형 | 영어 표현 | 한국어 / 풀이 | UMLS CUI | 출처 | 등장 |
|---|---|---|---|---|---|
| 해부 | breast
|
유방 | dict | 5 | |
| 해부 | LLM
→ Large Language Model
|
scispacy | 1 | ||
| 해부 | breast tissue
|
scispacy | 1 | ||
| 합병증 | capsular contracture
|
피막구축 | dict | 1 | |
| 약물 | Gemini
|
scispacy | 1 | ||
| 질환 | gynecomastia
|
C0018418
Gynecomastia
|
scispacy | 1 | |
| 질환 | breast ptosis
|
C2233848
Ptosis of breast
|
scispacy | 1 | |
| 질환 | Language
|
scispacy | 1 | ||
| 기타 | patient
|
scispacy | 1 | ||
| 기타 | Gemini
|
scispacy | 1 | ||
| 기타 | capsular
|
scispacy | 1 |
🔗 함께 등장하는 도메인
이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들
관련 논문
- The impact of three-dimensional simulation and virtual reality technologies on surgical decision-making and postoperative satisfaction in aesthetic surgery: a preliminary study.
- Cutaneous fistula of the breast: A complication of cosmetic autologous fat transfer.
- Epidermal inclusion cyst after breast reduction mammoplasty.
- The Plastic Surgery In-Service Examination: A Scoping Review.
- Clinical outcomes of synthetic absorbable mesh use in breast surgery: First case series in reconstruction and aesthetic mastopexy.