Assessing the Informational Value of Large Language Models Responses in Aesthetic Surgery: A Comparative Analysis with Expert Opinions.
Abstract
[BACKGROUND] The increasing popularity of Large Language Models (LLMs) in various healthcare settings has raised questions about their ability to provide accurate and reliable information. This study aimed to evaluate the informational value of Large Language Models responses in aesthetic plastic surgery by comparing them with the opinions of experienced surgeons.
[METHODS] Thirty patients undergoing three common aesthetic procedures-dermal fillers, botulinum toxin injections, and aesthetic blepharoplasty-were selected. The most frequently asked questions by these patients were recorded and submitted to ChatGpt 3.5 and Google Bard v.1.53. The answers provided by the Large Language Models were then evaluated by 13 experienced aesthetic plastic surgeons on a Likert scale for accessibility, accuracy, and overall usefulness.
[RESULTS] The overall ratings of the chatbot responses were moderate, with surgeons generally finding them to be accurate and clear. However, the lack of transparency regarding the sources of the information provided by the LLMs made it impossible to fully evaluate their credibility.
[CONCLUSIONS] While chatbots have the potential to provide patients with convenient access to information about aesthetic plastic surgery, their current limitations in terms of transparency and comprehensiveness warrant caution in their use as a primary source of information. Further research is needed to develop more robust and reliable LLMs for healthcare applications.
[LEVEL OF EVIDENCE I] This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .
[METHODS] Thirty patients undergoing three common aesthetic procedures-dermal fillers, botulinum toxin injections, and aesthetic blepharoplasty-were selected. The most frequently asked questions by these patients were recorded and submitted to ChatGpt 3.5 and Google Bard v.1.53. The answers provided by the Large Language Models were then evaluated by 13 experienced aesthetic plastic surgeons on a Likert scale for accessibility, accuracy, and overall usefulness.
[RESULTS] The overall ratings of the chatbot responses were moderate, with surgeons generally finding them to be accurate and clear. However, the lack of transparency regarding the sources of the information provided by the LLMs made it impossible to fully evaluate their credibility.
[CONCLUSIONS] While chatbots have the potential to provide patients with convenient access to information about aesthetic plastic surgery, their current limitations in terms of transparency and comprehensiveness warrant caution in their use as a primary source of information. Further research is needed to develop more robust and reliable LLMs for healthcare applications.
[LEVEL OF EVIDENCE I] This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .
추출된 의학 개체 (NER)
| 유형 | 영어 표현 | 한국어 / 풀이 | UMLS CUI | 출처 | 등장 |
|---|---|---|---|---|---|
| 시술 | blepharoplasty
|
안검성형술 | dict | 1 | |
| 시술 | botulinum toxin
|
보툴리눔독소 주사 | dict | 1 | |
| 약물 | [BACKGROUND]
|
scispacy | 1 | ||
| 약물 | [CONCLUSIONS]
|
scispacy | 1 | ||
| 질환 | Language
|
scispacy | 1 | ||
| 기타 | patients
|
scispacy | 1 | ||
| 기타 | ChatGpt
|
scispacy | 1 |
MeSH Terms
Humans; Female; Male; Adult; Surgery, Plastic; Blepharoplasty; Middle Aged; Language; Esthetics; Dermal Fillers; Surveys and Questionnaires; Cosmetic Techniques; Large Language Models
🔗 함께 등장하는 도메인
이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들
관련 논문
- Local therapeutic strategies for neurocutaneous dysesthesia: from capsaicin to cannabinoids.
- Comparative efficacy of intralesional therapies for keloid scars: a network meta-analysis.
- Adverse neurological events following botulinum toxin type A: A case series of post-injection seizures and paralysis.
- Decreased utilization of component separation techniques over time in complex abdominal wall reconstruction following introduction of preoperative botulinum toxin A.
- Current Perspectives on Pectoralis Minor Syndrome: A Narrative Review.