Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs
Presentation Time: 04:45 PM - 05:00 PM
Abstract Keywords: Large Language Models (LLMs), Machine Learning, Knowledge Representation and Information Modeling
Primary Track: Applications
Programmatic Theme: Clinical Research Informatics
Electronic Health Record (EHR) audit logs are increasingly utilized for clinical tasks, from workflow modeling to predictive analyses of discharge events, adverse kidney outcomes, and hospital readmissions. These logs encapsulate user-EHR interactions, reflecting both healthcare professionals' behavior and patients' health statuses. To harness this temporal information effectively, this study explores the application of Large Language Models (LLMs) in leveraging audit log data for clinical prediction tasks, specifically focusing on discharge predictions. Utilizing a year's worth of EHR data from Vanderbilt University Medical Center, we fine-tuned LLMs with randomly selected 10,000 training examples. Our findings reveal that LLaMA-2 70B, with an AUROC of 0.80 [0.77-0.82], outperforms both GPT-4 128K in a zero-shot, with an AUROC of 0.68 [0.65-0.71], and DeBERTa, with an AUROC of 0.78 [0.75-0.82]. Among various serialization methods, the first-occurrence approach - wherein only the initial appearance of each event in a sequence is retained - showed superior performance. Furthermore, for the fine-tuned LLaMA-2 70B, logit outputs yielded a higher AUROC of 0.80 [0.77-0.82] compared to text outputs, with an AUROC of 0.69 [0.67-0.72]. This study underscores the potential of fine-tuned LLMs, particularly when combined with strategic sequence serialization, in advancing clinical prediction tasks.
Speaker(s):
Xinmeng Zhang, BS
Vanderbilt University
Author(s):
Xinmeng Zhang, BS - Vanderbilt University; Chao Yan, PhD - Vanderbilt University Medical Center; Yuyang Yang - Northwestern University; Zhuohang Li, MS - Vanderbilt University; Yubo Feng, MS - Vanderbilt University; Bradley Malin, PhD - Vanderbilt University Medical Center; You Chen, PhD - Vanderbilt University;
Presentation Time: 04:45 PM - 05:00 PM
Abstract Keywords: Large Language Models (LLMs), Machine Learning, Knowledge Representation and Information Modeling
Primary Track: Applications
Programmatic Theme: Clinical Research Informatics
Electronic Health Record (EHR) audit logs are increasingly utilized for clinical tasks, from workflow modeling to predictive analyses of discharge events, adverse kidney outcomes, and hospital readmissions. These logs encapsulate user-EHR interactions, reflecting both healthcare professionals' behavior and patients' health statuses. To harness this temporal information effectively, this study explores the application of Large Language Models (LLMs) in leveraging audit log data for clinical prediction tasks, specifically focusing on discharge predictions. Utilizing a year's worth of EHR data from Vanderbilt University Medical Center, we fine-tuned LLMs with randomly selected 10,000 training examples. Our findings reveal that LLaMA-2 70B, with an AUROC of 0.80 [0.77-0.82], outperforms both GPT-4 128K in a zero-shot, with an AUROC of 0.68 [0.65-0.71], and DeBERTa, with an AUROC of 0.78 [0.75-0.82]. Among various serialization methods, the first-occurrence approach - wherein only the initial appearance of each event in a sequence is retained - showed superior performance. Furthermore, for the fine-tuned LLaMA-2 70B, logit outputs yielded a higher AUROC of 0.80 [0.77-0.82] compared to text outputs, with an AUROC of 0.69 [0.67-0.72]. This study underscores the potential of fine-tuned LLMs, particularly when combined with strategic sequence serialization, in advancing clinical prediction tasks.
Speaker(s):
Xinmeng Zhang, BS
Vanderbilt University
Author(s):
Xinmeng Zhang, BS - Vanderbilt University; Chao Yan, PhD - Vanderbilt University Medical Center; Yuyang Yang - Northwestern University; Zhuohang Li, MS - Vanderbilt University; Yubo Feng, MS - Vanderbilt University; Bradley Malin, PhD - Vanderbilt University Medical Center; You Chen, PhD - Vanderbilt University;
Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs
Category
Paper - Student
Description
Date: Monday (11/11)
Time: 04:45 PM to 05:00 PM
Room: Continental Ballroom 8-9
Time: 04:45 PM to 05:00 PM
Room: Continental Ballroom 8-9