Article by Claire L. Evans: “..Regardless of the medium, memory is survival. As a consequence, biology, with its billions of years of beta-testing in the rearview, has already produced the most powerful storage medium for information in the universe: DNA. Every nucleus of every cell in the human body holds 800 MB of information. In theory, DNA can store up to a billion gigabytes of data per cubic millimeter; with this efficiency, the 180-odd Zettabytes of information our global civilization produces each year would fit in a tennis ball. More importantly, it wouldn’t consume any energy—and it would be preserved for millennia.
This may all sound science-fictional, but over the last decade, technology companies and research institutions have successfully encoded all manner of precious cultural information into the double-helix: the works of Shakespeare, all 16GB of Wikipedia, an anthology of biotechnology essays and science fiction stories, the UN Declaration on the Rights of the Child, the Svalbard Global Seed Vault database, the private key of a single bitcoin, and the 1998 album Mezzanine by Massive Attack. Of course, these are PR gimmicks—snazzy proofs of concept for a nascent industry.
Could life, with its onboard resilience against entropic forces, provide a workable solution to the problem of the data center?
But beyond the hype, DNA data storage technology is evolving quickly, and biotech companies have pushed their offerings to the brink of commercial viability. Their approaches are diverse. Catalog, a Boston-based startup, has created a “printer” that can write synthetic DNA directly onto sheets of clear plastic; the French startup Biomemory stores data in credit-card sized “DNA Cards”; Atlas Data Storage, a spinoff of the biotechnology giant Twist Bioscience, encodes data onto synthetic DNA and then dehydrates it into a shelf-stable powder to be reconstituted at will. These propositions should be enticing to anyone tasked with maintaining the integrity of the cloud: plastic sheets, cards, and DNA powder, stashed in metal capsules the size of a AAA battery, don’t require air-conditioning.
This makes DNA storage the perfect storage medium for what experts call “cold” data: things like municipal and medical records, backups, research data, and archives that don’t need to be accessed on demand (“hot” data, in contrast, is the kind found on Instagram, YouTube, or your banking app). Some 60–80% of all data stored is accessed infrequently enough to be classified as cold, and is currently stored in magnetic tape libraries. Tape, by virtue of its physical nature, is secure and requires minimal power to maintain. But even under perfect environmental conditions, cooled to a precise 20–25°C temperature range, it only lasts for a few decades, and the technology for playing back magnetic tape is likely to go obsolete before the tape itself degrades.
The oldest DNA sample to be successfully read, on the other hand, was over two million years old. And given its importance in the life sciences, it’s not likely we’ll ever forget how to sequence DNA. So long as the relevant metadata—instructions for translating the four-letter code of DNA back into binary—is encoded alongside the data itself, information preserved in DNA will almost certainly outlast the technology companies encoding it. This is why Microsoft, Western Digital, and a small concern of biotech companies cofounded, in 2020, the DNA Data Storage Alliance, an organization to define industry-wide standards for the technology. As with all software, the interoperability of genetic technology will be key to its longevity…(More)”.