Genome: The Autobiography of a Species in 23 Chapters. Matt Ridley
high information content – information injected into it by its designer. So does a human body. Aristotle’s information theory meets Newton’s physics in Shannon’s brain. Like Turing, Shannon has no thoughts about biology. But his insight is of more relevance to the question of what is life than a mountain of chemistry and physics. Life, too, is digital information written in DNA.6
In the beginning was the word. The word was not DNA. That came afterwards, when life was already established, and when it had divided the labour between two separate activities: chemical work and information storage, metabolism and replication. But DNA contains a record of the word, faithfully transmitted through all subsequent aeons to the astonishing present.
Imagine the nucleus of a human egg beneath the microscope. Arrange the twenty-three chromosomes, if you can, in order of size, the biggest on the left and the smallest on the right. Now zoom in on the largest chromosome, the one called, for purely arbitrary reasons, chromosome 1. Every chromosome has a long arm and a short arm separated by a pinch point known as a centromere. On the long arm of chromosome 1, close to the centromere, you will find, if you read it carefully, that there is a sequence of 120 letters – As, Cs, Gs and Ts – that repeats over and over again. Between each repeat there lies a stretch of more random text, but the 120-letter paragraph keeps coming back like a familiar theme tune, in all more than 100 times. This short paragraph is perhaps as close as we can get to an echo of the original word.
This ‘paragraph’ is a small gene, probably the single most active gene in the human body. Its 120 letters are constantly being copied into a short filament of RNA. The copy is known as 5S RNA. It sets up residence with a lump of proteins and other RNAs, carefully intertwined, in a ribosome, a machine whose job is to translate DNA recipes into proteins. And it is proteins that enable DNA to replicate. To paraphrase Samuel Butler, a protein is just a gene’s way of making another gene; and a gene is just a protein’s way of making another protein. Cooks need recipes, but recipes also need cooks. Life consists of the interplay of two kinds of chemicals: proteins and DNA.
Protein represents chemistry, living, breathing, metabolism and behaviour – what biologists call the phenotype. DNA represents information, replication, breeding, sex – what biologists call the genotype. Neither can exist without the other. It is the classic case of chicken and egg: which came first, DNA or protein? It cannot have been DNA, because DNA is a helpless, passive piece of mathematics, which catalyses no chemical reactions. It cannot have been protein, because protein is pure chemistry with no known way of copying itself accurately. It seems impossible either that DNA invented protein or vice versa. This might have remained a baffling and strange conundrum had not the word left a trace of itself faintly drawn on the filament of life. Just as we now know that eggs came long before chickens (the reptilian ancestors of all birds laid eggs), so there is growing evidence that RNA came before proteins.
RNA is a chemical substance that links the two worlds of DNA and protein. It is used mainly in the translation of the message from the alphabet of DNA to the alphabet of proteins. But in the way it behaves, it leaves little doubt that it is the ancestor of both. RNA was Greece to DNA’s Rome: Homer to her Virgil.
RNA was the word. RNA left behind five little clues to its priority over both protein and DNA. Even today, the ingredients of DNA are made by modifying the ingredients of RNA, not by a more direct route. Also DNA’s letter Ts are made from RNA’s letter Us. Many modern enzymes, though made of protein, rely on small molecules of RNA to make them work. Moreover, RNA, unlike DNA and protein, can copy itself without assistance: give it the right ingredients and it will stitch them together into a message. Wherever you look in the cell, the most primitive and basic functions require the presence of RNA. It is an RNA-dependent enzyme that takes the message, made of RNA, from the gene. It is an RNA-containing machine, the ribosome, that translates that message, and it is a little RNA molecule that fetches and carries the amino acids for the translation of the gene’s message. But above all, RNA – unlike DNA – can act as a catalyst, breaking up and joining other molecules including RNAs themselves. It can cut them up, join the ends together, make some of its own building blocks, and elongate a chain of RNA. It can even operate on itself, cutting out a chunk of text and splicing the free ends together again.7
The discovery of these remarkable properties of RNA in the early 1980s, made by Thomas Cech and Sidney Altman, transformed our understanding of the origin of life. It now seems probable that the very first gene, the ‘ur-gene’, was a combined replicator–catalyst, a word that consumed the chemicals around it to duplicate itself. It may well have been made of RNA. By repeatedly selecting random RNA molecules in the test tube based on their ability to catalyse reactions, it is possible to ‘evolve’ catalytic RNAs from scratch – almost to rerun the origin of life. And one of the most surprising results is that these synthetic RNAs often end up with a stretch of RNA text that reads remarkably like part of the text of a ribosomal RNA gene such as the 5S gene on chromosome 1.
Back before the first dinosaurs, before the first fishes, before the first worms, before the first plants, before the first fungi, before the first bacteria, there was an RNA world – probably somewhere around four billion years ago, soon after the beginning of planet earth’s very existence and when the universe itself was only ten billion years old. We do not know what these ‘ribo-organisms’ looked like. We can only guess at what they did for a living, chemically speaking. We do not know what came before them. We can be pretty sure that they once existed, because of the clues to RNA’s role that survive in living organisms today.8
These ribo-organisms had a big problem. RNA is an unstable substance, which falls apart within hours. Had these organisms ventured anywhere hot, or tried to grow too large, they would have faced what geneticists call an error catastrophe – a rapid decay of the message in their genes. One of them invented by trial and error a new and tougher version of RNA called DNA and a system for making RNA copies from it, including a machine we’ll call the proto-ribosome. It had to work fast and it had to be accurate. So it stitched together genetic copies three letters at a time, the better to be fast and accurate. Each threesome came flagged with a tag to make it easier for the proto-ribosome to find, a tag that was made of amino acid. Much later, those tags themselves became joined together to make proteins and the three-letter word became a form of code for the proteins – the genetic code itself. (Hence to this day, the genetic code consists of three-letter words, each spelling out a particular one of twenty amino acids as part of a recipe for a protein.) And so was born a more sophisticated creature that stored its genetic recipe on DNA, made its working machines of protein and used RNA to bridge the gap between them.
Her name was Luca, the Last Universal Common Ancestor. What did she look like, and where did she live? The conventional answer is that she looked like a bacterium and she lived in a warm pond, possibly by a hot spring, or in a marine lagoon. In the last few years it has been fashionable to give her a more sinister address, since it became clear that the rocks beneath the land and sea are impregnated with billions of chemical-fuelled bacteria. Luca is now usually placed deep underground, in a fissure in hot igneous rocks, where she fed on sulphur, iron, hydrogen and carbon. To this day, the surface life on earth is but a veneer. Perhaps ten times as much organic carbon as exists in the whole biosphere is in thermophilic bacteria deep beneath the surface, where they are possibly responsible for generating what we call natural gas.9
There is, however, a conceptual difficulty about trying to identify the earliest forms of life. These days it is impossible for most creatures to acquire genes except from their parents, but that may not always have been so. Even today, bacteria can acquire genes from other bacteria merely by ingesting them. There might once have been widespread trade, even burglary, of genes. In the deep past chromosomes were probably numerous and short, containing just one gene each, which could be lost or gained quite easily. If this was so, Carl Woese points out, the organism was not yet an enduring entity. It was a temporary team of genes. The genes that ended up in all of us may therefore have come from lots of different ‘species’ of creature and it is futile to try to sort them into different lineages. We are descended not from one ancestral Luca, but