Explicit Frailty Integration Reveals Cognitive Differences in ASA Classification Between Anesthesiologists and Large Language Models
概览
- 阶段
- 不适用
- 状态
- 尚未招募
- 发起方
- Fatih Sultan Mehmet Training and Research Hospital
- 入组人数
- 200
- 试验地点
- 1
- 主要终点
- Interrater Agreement in ASA Physical Status Classification Between Anesthesiologists and Large Language Models
概览
简要总结
The American Society of Anesthesiologists (ASA) Physical Status Classification System is widely used to assess perioperative risk, but it does not explicitly include frailty as a standardized variable. In daily clinical practice, anesthesiologists may implicitly incorporate frailty-related information into ASA classification based on individual clinical judgment, which may lead to variability between evaluators.
In recent years, large language models (LLMs), a type of artificial intelligence, have been increasingly used in medical decision-support research. Unlike human clinicians, these models process information in a structured and explicit manner, without relying on intuition or implicit reasoning.
The primary objective of this study is to compare ASA Physical Status classifications assigned by anesthesiologists and by two different large language models using standardized preoperative clinical data from adult patients undergoing elective surgery. A secondary objective is to evaluate how the addition of a frailty index influences ASA classification decisions made by human experts and artificial intelligence models.
This prospective observational study aims to improve understanding of differences in clinical reasoning between anesthesiologists and artificial intelligence systems and to explore the role of frailty in perioperative risk assessment.
详细描述
This study is designed as a prospective, observational, comparative, multirater investigation evaluating differences in ASA Physical Status Classification between anesthesiologists and large language models (LLMs).
The ASA Physical Status Classification System is a cornerstone of perioperative risk assessment; however, it lacks explicit incorporation of frailty, a multidimensional concept reflecting reduced physiological reserve and vulnerability. In clinical practice, anesthesiologists often integrate frailty-related information implicitly into ASA assessments, potentially contributing to interobserver variability.
Large language models process clinical information using explicit, structured inputs and do not rely on experiential or intuitive reasoning. This characteristic provides a unique opportunity to explore cognitive differences between human experts and artificial intelligence in clinical classification tasks.
Adult patients (≥18 years) scheduled for elective surgery will be included. Emergency cases, pediatric patients, and individuals with insufficient clinical data to allow ASA classification will be excluded. For each patient, standardized preoperative clinical data will be collected, including demographic characteristics, body mass index, comorbidities, regular medications, and type of planned surgical procedure.
ASA Physical Status Classification will be independently assigned by four board-certified anesthesiologists with at least five years of clinical experience, as well as by two large language models developed by different organizations. All evaluations will be conducted using the same standardized dataset, and evaluators will be blinded to each other's assessments.
The study will be conducted in two sequential phases. In the first phase, ASA classification will be performed using standard clinical data alone. In the second phase, a validated frailty index will be added to the same patient dataset, and the evaluation process will be repeated. This design will allow assessment of how frailty information affects ASA classification decisions in human and artificial intelligence evaluators.
Large language models will be prompted using a predefined, standardized prompt that remains unchanged throughout the study. Models will be instructed to generate a single ASA Physical Status category (I-V) without providing explanations or additional commentary, and no iterative prompting or feedback will be allowed.
Interrater agreement among anesthesiologists, between artificial intelligence models, and between human and artificial intelligence evaluators will be analyzed using Cohen's Kappa and Fleiss' Kappa statistics, as appropriate. Changes in ASA classification following the addition of frailty information will be evaluated using paired statistical methods. Statistical significance will be defined as p < 0.05.
By comparing ASA classification patterns between anesthesiologists and large language models, both with and without frailty data, this study aims to clarify the role of implicit and explicit reasoning in perioperative risk assessment and to contribute to the development of future artificial intelligence-assisted clinical decision-support systems.
研究设计
- 研究类型
- Observational
- 观察模型
- Other
- 时间视角
- Prospective
入排标准
- 年龄范围
- 18 Years 至 —(Adult, Older Adult)
- 性别
- All
- 接受健康志愿者
- 是
入选标准
- •Adult patients aged 18 years and older
- •Scheduled for elective surgical procedures
- •Availability of standardized preoperative clinical data sufficient for ASA Physical -Status Classification
排除标准
- •Emergency surgical procedures
- •Pediatric patients
- •Patients with insufficient or incomplete clinical data preventing ASA Physical --Status assessment
研究组 & 干预措施
Adult Elective Surgery Patients
Adult patients aged 18 years and older scheduled for elective surgical procedures. Standardized preoperative clinical data will be evaluated by anesthesiologists and large language models for ASA Physical Status Classification, with and without the addition of frailty information.
干预措施: Observational Assessment Only (Other)
结局指标
主要结局
Interrater Agreement in ASA Physical Status Classification Between Anesthesiologists and Large Language Models
时间窗: At the time of preoperative evaluation, prior to surgery
Agreement in ASA Physical Status Classification (ASA I-V) between four anesthesiologists and two large language models based on standardized preoperative clinical data, assessed using interrater agreement statistics.
次要结局
- Effect of Frailty Information on ASA Physical Status Classification(At the time of preoperative evaluation, prior to surgery)
- Agreement Between Large Language Models in ASA Physical Status Classification(At the time of preoperative evaluation, prior to surgery)
- Agreement Among Anesthesiologists in ASA Physical Status Classification(At the time of preoperative evaluation, prior to surgery)
研究者
Cansu Ofluoğlu
Specialist in Anesthesiology and Reanimation
Fatih Sultan Mehmet Training and Research Hospital