Reinforcement learning-guided control strategies for CAR T-cell activation and expansion.
Sakib FerdousIbne Farabi ShihabRatul ChowdhuryNigel Forest ReuelPublished in: Biotechnology and bioengineering (2024)
Reinforcement learning (RL), a subset of machine learning (ML), could optimize and control biomanufacturing processes, such as improved production of therapeutic cells. Here, the process of CAR T-cell activation by antigen-presenting beads and their subsequent expansion is formulated in silico. The simulation is used as an environment to train RL-agents to dynamically control the number of beads in culture to maximize the population of robust effector cells at the end of the culture. We make periodic decisions of incremental bead addition or complete removal. The simulation is designed to operate in OpenAI Gym, enabling testing of different environments, cell types, RL-agent algorithms, and state inputs to the RL-agent. RL-agent training is demonstrated with three different algorithms (PPO, A2C, and DQN), each sampling three different state input types (tabular, image, mixed); PPO-tabular performs best for this simulation environment. Using this approach, training of the RL-agent on different cell types is demonstrated, resulting in unique control strategies for each type. Sensitivity to input-noise (sensor performance), number of control step interventions, and advantages of pre-trained RL-agents are also evaluated. Therefore, we present an RL framework to maximize the population of robust effector cells in CAR T-cell therapy production.
Keyphrases
- cell therapy
- machine learning
- induced apoptosis
- cell cycle arrest
- single cell
- virtual reality
- deep learning
- stem cells
- dendritic cells
- mesenchymal stem cells
- cell death
- endoplasmic reticulum stress
- regulatory t cells
- oxidative stress
- mass spectrometry
- big data
- artificial intelligence
- cell proliferation
- immune response
- high resolution
- high intensity