Systematic multi-trait AAV capsid engineering for efficient gene delivery.
Fatma-Elzahraa EidAlbert Tian ChenKen Y ChanQin HuangQingxia ZhengIsabelle G TobeySimon PacouretPamela P BrauerCasey KeyesMegan PowellJencilin JohnstonBinhui ZhaoKasper LageAlice F TarantalYujia Alina ChanBenjamin E DevermanPublished in: Nature communications (2024)
Broadening gene therapy applications requires manufacturable vectors that efficiently transduce target cells in humans and preclinical models. Conventional selections of adeno-associated virus (AAV) capsid libraries are inefficient at searching the vast sequence space for the small fraction of vectors possessing multiple traits essential for clinical translation. Here, we present Fit4Function, a generalizable machine learning (ML) approach for systematically engineering multi-trait AAV capsids. By leveraging a capsid library that uniformly samples the manufacturable sequence space, reproducible screening data are generated to train accurate sequence-to-function models. Combining six models, we designed a multi-trait (liver-targeted, manufacturable) capsid library and validated 88% of library variants on all six predetermined criteria. Furthermore, the models, trained only on mouse in vivo and human in vitro Fit4Function data, accurately predicted AAV capsid variant biodistribution in macaque. Top candidates exhibited production yields comparable to AAV9, efficient murine liver transduction, up to 1000-fold greater human hepatocyte transduction, and increased enrichment relative to AAV9 in a screen for liver transduction in macaques. The Fit4Function strategy ultimately makes it possible to predict cross-species traits of peptide-modified AAV capsids and is a critical step toward assembling an ML atlas that predicts AAV capsid performance across dozens of traits.
Keyphrases
- gene therapy
- genome wide
- machine learning
- endothelial cells
- dna methylation
- induced apoptosis
- electronic health record
- stem cells
- gene expression
- induced pluripotent stem cells
- copy number
- high throughput
- mesenchymal stem cells
- cell death
- mass spectrometry
- cancer therapy
- computed tomography
- signaling pathway
- drug delivery
- cell cycle arrest
- deep learning
- resistance training
- endoplasmic reticulum stress