Uncertainties in synthetic DNA-based data storage.
Chengtao XuChao ZhaoBiao MaHong LiuPublished in: Nucleic acids research (2021)
Deoxyribonucleic acid (DNA) has evolved to be a naturally selected, robust biomacromolecule for gene information storage, and biological evolution and various diseases can find their origin in uncertainties in DNA-related processes (e.g. replication and expression). Recently, synthetic DNA has emerged as a compelling molecular media for digital data storage, and it is superior to the conventional electronic memory devices in theoretical retention time, power consumption, storage density, and so forth. However, uncertainties in the in vitro DNA synthesis and sequencing, along with its conjugation chemistry and preservation conditions can lead to severe errors and data loss, which limit its practical application. To maintain data integrity, complicated error correction algorithms and substantial data redundancy are usually required, which can significantly limit the efficiency and scale-up of the technology. Herein, we summarize the general procedures of the state-of-the-art DNA-based digital data storage methods (e.g. write, read, and preservation), highlighting the uncertainties involved in each step as well as potential approaches to correct them. We also discuss challenges yet to overcome and research trends in the promising field of DNA-based data storage.
Keyphrases
- circulating tumor
- single molecule
- electronic health record
- cell free
- big data
- emergency department
- nucleic acid
- data analysis
- patient safety
- poor prognosis
- working memory
- healthcare
- deep learning
- long non coding rna
- early onset
- circulating tumor cells
- social media
- genome wide
- copy number
- transcription factor
- genome wide identification