Probing the physical limits of reliable DNA data retrieval.
Lee OrganickYuan-Jyue ChenSiena Dumas AngRandolph LopezXiaomeng LiuKarin StraussLuis CezePublished in: Nature communications (2020)
Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 1010 unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here.
Keyphrases
- circulating tumor
- single molecule
- electronic health record
- cell free
- big data
- healthcare
- nucleic acid
- data analysis
- mental health
- single cell
- physical activity
- machine learning
- risk assessment
- room temperature
- molecular dynamics simulations
- social media
- artificial intelligence
- human health
- ionic liquid
- affordable care act