ChatGPT 4.0 and algor in generating concept maps: an observational study.
Abstract
[BACKGROUND] To evaluate the performance of two AI systems, ChatGPT 4.0 and Algor, in generating concept maps from validated otolaryngology clinical practice guidelines.
[METHODS] Concept maps were generated by ChatGPT 4.0 and Algor from four American Academy of Otolaryngology-Head and Neck Surgery Foundation (AAO-HNSF) clinical practice guidelines. Eight otolaryngology specialists evaluated the generated concept maps using the AI-Map questionnaire, covering concept identification, relationship establishment, hierarchical structure representation, and visual presentation. Chi-square tests and Kendall's tau coefficient were used for statistical analysis.
[RESULTS] While no consistent superiority was observed across all guidelines, both AI systems demonstrated unique strengths. ChatGPT excelled in representing cross-connections between concepts and layout optimization, particularly for the Rhinoplasty guidelines (χ²=6.000, p = 0.050 for cross-connections). Algor showed strengths in capturing main themes and distinguishing general/abstract concepts, especially in the BPVV and Tympanostomy Tube guidelines (χ²=8.000, p = 0.046 for main themes in BPVV). Statistically significant differences were found in representing dynamic nature (favouring H&NMass-GPT, χ²=7.571, p = 0.023) and overall value and usefulness (favouring H&NMass-Algor, χ²=7.905, p = 0.019) for the H&N Masses guidelines.
[CONCLUSION] AI systems showed potential in automating concept map creation from otolaryngology guidelines, with performance varying across different medical topics and evaluation criteria. Further research is required to optimize AI systems for medical education and knowledge representation, highlighting their promise and current limitations.
[METHODS] Concept maps were generated by ChatGPT 4.0 and Algor from four American Academy of Otolaryngology-Head and Neck Surgery Foundation (AAO-HNSF) clinical practice guidelines. Eight otolaryngology specialists evaluated the generated concept maps using the AI-Map questionnaire, covering concept identification, relationship establishment, hierarchical structure representation, and visual presentation. Chi-square tests and Kendall's tau coefficient were used for statistical analysis.
[RESULTS] While no consistent superiority was observed across all guidelines, both AI systems demonstrated unique strengths. ChatGPT excelled in representing cross-connections between concepts and layout optimization, particularly for the Rhinoplasty guidelines (χ²=6.000, p = 0.050 for cross-connections). Algor showed strengths in capturing main themes and distinguishing general/abstract concepts, especially in the BPVV and Tympanostomy Tube guidelines (χ²=8.000, p = 0.046 for main themes in BPVV). Statistically significant differences were found in representing dynamic nature (favouring H&NMass-GPT, χ²=7.571, p = 0.023) and overall value and usefulness (favouring H&NMass-Algor, χ²=7.905, p = 0.019) for the H&N Masses guidelines.
[CONCLUSION] AI systems showed potential in automating concept map creation from otolaryngology guidelines, with performance varying across different medical topics and evaluation criteria. Further research is required to optimize AI systems for medical education and knowledge representation, highlighting their promise and current limitations.
추출된 의학 개체 (NER)
| 유형 | 영어 표현 | 한국어 / 풀이 | UMLS CUI | 출처 | 등장 |
|---|---|---|---|---|---|
| 시술 | rhinoplasty
|
코성형술 | dict | 1 | |
| 해부 | BPVV
|
scispacy | 1 | ||
| 해부 | χ²=7.571
|
scispacy | 1 | ||
| 약물 | χ²=8.000, p
|
scispacy | 1 | ||
| 약물 | ChatGPT
|
scispacy | 1 | ||
| 약물 | [BACKGROUND]
|
scispacy | 1 | ||
| 질환 | algor
|
scispacy | 1 | ||
| 질환 | Tympanostomy Tube
|
scispacy | 1 | ||
| 질환 | H&NMass-GPT
|
scispacy | 1 | ||
| 질환 | Masses
|
scispacy | 1 |
MeSH Terms
Humans; Otolaryngology; Practice Guidelines as Topic; Algorithms; Artificial Intelligence; Surveys and Questionnaires; Generative Artificial Intelligence
🔗 함께 등장하는 도메인
이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들
관련 논문
- The impact of three-dimensional simulation and virtual reality technologies on surgical decision-making and postoperative satisfaction in aesthetic surgery: a preliminary study.
- Aesthetically ideal noses created using a single artificial intelligence model: Validating literature and exploring ethnic differences.
- Septocolumellar strut technique: Tip stability and aesthetic outcomes in rhinoplasty.
- Implications of Dermatologic Disorders in Facial Cosmetic Surgery: A Systematic Review.
- Factors on Quality of Life Improvement in Septorhinoplasty: Prospective Evaluation Using the Functional Rhinoplasty Outcome Inventory 17 and Its Minimally Important Difference.