Advancing plant biology through deep learning-powered natural language processing.
Shuang PengLoïc RajjouPublished in: Plant cell reports (2024)
The application of deep learning methods, specifically the utilization of Large Language Models (LLMs), in the field of plant biology holds significant promise for generating novel knowledge on plant cell systems. The LLM framework exhibits exceptional potential, particularly with the development of Protein Language Models (PLMs), allowing for in-depth analyses of nucleic acid and protein sequences. This analytical capacity facilitates the discernment of intricate patterns and relationships within biological data, encompassing multi-scale information within DNA or protein sequences. The contribution of PLMs extends beyond mere sequence patterns and structure--function recognition; it also supports advancements in genetic improvements for agriculture. The integration of deep learning approaches into the domain of plant sciences offers opportunities for major breakthroughs in basic research across multi-scale plant traits. Consequently, the strategic application of deep learning methodologies, particularly leveraging the potential of LLMs, will undoubtedly play a pivotal role in advancing plant sciences, plant production, plant uses and propelling the trajectory toward sustainable agroecological and agro-food transitions.
Keyphrases
- deep learning
- nucleic acid
- autism spectrum disorder
- machine learning
- convolutional neural network
- healthcare
- artificial intelligence
- cell wall
- protein protein
- climate change
- big data
- gene expression
- human health
- dna methylation
- risk assessment
- small molecule
- cell therapy
- social media
- optical coherence tomography
- circulating tumor cells