Login / Signup

Low cost DNA data storage using photolithographic synthesis and advanced information reconstruction and error correction.

Philipp L AntkowiakJory LietardMohammad Zalbagi DarestaniMark Manuel SomozaWendelin J StarkReinhard HeckelRobert N Grass
Published in: Nature communications (2020)
Due to its longevity and enormous information density, DNA is an attractive medium for archival storage. The current hamstring of DNA data storage systems-both in cost and speed-is synthesis. The key idea for breaking this bottleneck pursued in this work is to move beyond the low-error and expensive synthesis employed almost exclusively in today's systems, towards cheaper, potentially faster, but high-error synthesis technologies. Here, we demonstrate a DNA storage system that relies on massively parallel light-directed synthesis, which is considerably cheaper than conventional solid-phase synthesis. However, this technology has a high sequence error rate when optimized for speed. We demonstrate that even in this high-error regime, reliable storage of information is possible, by developing a pipeline of algorithms for encoding and reconstruction of the information. In our experiments, we store a file containing sheet music of Mozart, and show perfect data recovery from low synthesis fidelity DNA.
Keyphrases
  • circulating tumor
  • electronic health record
  • single molecule
  • health information
  • healthcare
  • big data
  • social media
  • deep learning
  • artificial intelligence