R&D: Demonstration of End-to-End Automation of DNA Storage
Device encodes data into DNA sequence, then written to DNA oligonucleotide using custom DNA synthesizer, pooled for liquid storage, and read using nanopore sequencer and novel, minimal preparation protocol.
This is a Press Release edited by StorageNewsletter.com on March 26, 2019 at 2:19 pmNature Scientific Reports has published an article written by Christopher N. Takahashi, School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA, Bichlien H. Nguyen, Karin Strauss, School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA, and Microsoft Research, Redmond, Washington, USA, and Luis Ceze, School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA.
An overview of the write-store-read process. Data is encoded, with error correction, into DNA bases, which are synthesized into physical DNA molecules and stored. When a user wishes to read the data, the stored DNA is read by a DNA sequencer into bases and the decoding software corrects any errors retrieving the original data. (a) The logical flow from bits to bases to DNA and back. (b) A block diagram representation of the system hardware’s three modules: synthesis, storage, and sequencing. (c) A photograph showing the completed system. Highlighted are the storage vessel and the nanopore loading fixture. The majority of the remaining hardware is responsible for synthesis. (d) Overview of enzymatic preparation for DNA sequencing. An arbitrary 1 kilobase ‘extension segment’ of DNA is PCR-amplified with TAQ polymerase, and a Bsa-I restriction site is added by the primer, leaving an A-tail and a TCGC sticky end after digestion. The extension segment is then T/A ligated to the standard Oxford Nanopore Technology (ONT) LSK-108 kit sequencing adapter, creating the ‘extended adapter,’ which ensures that sufficient bases are read for successful base calling. For sequencing, the payload hairpin and extended adapter are ligated, forming a sequence-ready construct that does not require purification.
Click to enlarge
Abstract: “Synthetic DNA has emerged as a novel substrate to encode computer data with the potential to be orders of magnitude denser than contemporary cutting edge techniques. However, even with the help of automated synthesis and sequencing devices, many intermediate steps still require expert laboratory technicians to execute. We have developed an automated end-to-end DNA data storage device to explore the challenges of automation within the constraints of this unique application. Our device encodes data into a DNA sequence, which is then written to a DNA oligonucleotide using a custom DNA synthesizer, pooled for liquid storage, and read using a nanopore sequencer and a novel, minimal preparation protocol. We demonstrate an automated 5-byte write, store, and read cycle with a modular design enabling expansion as new technology becomes available.“