Matching Patients to Clinical Trials using LLaMA 2 Embeddings and Siamese Neural Network.
Shaika ChowdhurySivaraman RajaganapathyYue YuCui TaoMaria VassilakiNansu ZongPublished in: medRxiv : the preprint server for health sciences (2024)
Patient recruitment is a key desideratum for the success of a clinical trial that entails identifying eligible patients that match the selection criteria for the trial. However, the complexity of criteria information and heterogeneity of patient data render manual analysis a burdensome and time-consuming task. In an attempt to automate patient recruitment, this work proposes a Siamese Neural Network-based model, namely Siamese-PTM. Siamese-PTM employs the pretrained LLaMA 2 model to derive contextual representations of the EHR and criteria inputs and jointly encodes them using two weight-sharing identical subnetworks. We evaluate Siamese-PTM on structured and unstructured EHR to analyze their predictive informativeness as standalone and collective feature sets. We explore a variety of deep models for Siamese-PTM's encoders and compare their performance against the Single-encoder counterparts. We develop a baseline rule-based classifier, compared to which Siamese-PTM improved performance by 40%. Furthermore, visualization of Siamese-PTM's learned embedding space reinforces its predictive robustness.
Keyphrases
- clinical trial
- neural network
- end stage renal disease
- chronic kidney disease
- case report
- prognostic factors
- electronic health record
- phase ii
- randomized controlled trial
- study protocol
- peritoneal dialysis
- health information
- healthcare
- open label
- phase iii
- social media
- patient reported outcomes
- artificial intelligence
- patient reported
- double blind
- data analysis