JOVANA
Library Glossary Getting Started Three Levels Fields How it works Mission
Join the mission
All guides

Molecular Cloning Step by Step

Cut the gene and the vector with the same enzyme, glue them together, slip the result into a bacterium, and let one cell grow into millions of identical copies. Walk the classic cut-paste-copy-and-pick workflow that turned biology into an engineering science.

What "cloning a gene" actually means

By now you have met the two halves of the toolkit on this rung. A restriction enzyme is a molecular scissors that cuts double-stranded DNA only where it finds a specific short sequence; many of them cut the two strands at staggered points and leave short single-stranded overhangs. A vector is a small, self-contained DNA circle — a plasmid — that a bacterium will happily copy along with its own genome. Molecular cloning is simply the act of putting a piece of DNA you care about *into* such a vector, getting that vector into a living cell, and letting the cell multiply. "Cloning" here has nothing to do with cloning sheep. It means making many faithful copies of one specific stretch of DNA.

Why was this revolutionary? Before cloning, a gene was a needle lost in a haystack of billions of bases — present in vanishingly few copies, impossible to study or read in isolation. Cloning solved that at a single stroke. Once your gene of interest sits in a plasmid inside a bacterium, every time the cell divides it copies your gene too, for free, overnight. By morning a few bacteria have become billions, each carrying an identical recombinant DNA molecule, and you can harvest pure milligrams of one chosen sequence. A vanishingly rare gene becomes an inexhaustible, identical supply. That is the leap that turned biology from a science you only observe into one you can build with.

Cut with the same scissors, so the ends fit

Here is the elegant idea at the heart of classic cloning. You cut the insert and the vector with the *same* restriction enzyme. Because the enzyme always cuts its recognition site the same staggered way, every fragment it produces ends in the *same* short single-stranded overhang — a few unpaired bases hanging off the end. These are sticky ends, and the name is literal: an overhang on the insert can base-pair with the complementary overhang on the cut vector, A reaching across to T and G to C, exactly as the two strands of a helix do. The two pieces find and hold onto each other on their own, guided by nothing but the rules of base pairing.

EcoRI cuts the sequence GAATTC the same staggered way everywhere:

   5'-...G     A A T T C...-3'
   3'-...C T T A A     G...-5'
           ^^^^^
        the overhang AATT is left single-stranded on BOTH cut pieces

VECTOR (cut)            INSERT (cut, same enzyme)
   ...G        AATT-...     ...-AATT        C...
   ...CTTAA        ...     ...        TTAA...G

   the AATT overhangs are complementary -> they pair up and snap together
Cutting both molecules with one enzyme (here EcoRI) leaves identical, complementary overhangs, so the insert and vector base-pair together like two puzzle pieces.

But base pairing alone only makes the two ends *touch*; the sugar-phosphate backbone is still broken on both strands, held together by nothing stronger than a handful of hydrogen bonds. To make the join permanent you need a second enzyme, DNA ligase — the same molecular welder the cell uses to seal the gaps between Okazaki fragments during replication, which you met on an earlier rung. Ligase seals the nicks in the backbone, forging the covalent bonds that turn two loose pieces into one continuous, unbroken DNA circle. Cut to create matching ends, then ligate to make them one: that is the whole trick of joining DNA at will.

Getting the construct into a living cell

You now have a tube of recombinant plasmids — but a plasmid in a tube does nothing. It cannot copy itself; only a living cell can do that. So the next move is transformation: persuading bacteria to take up your plasmid from the surrounding liquid. Bacteria do not normally swallow loose DNA from their environment, so first you make them *competent* — temporarily leaky — usually by bathing them in ice-cold calcium chloride, which neutralises the repulsion between the negatively charged DNA and the negatively charged cell surface. Then comes a brief, sharp heat shock, a jump to about 42 degrees for some seconds and straight back onto ice. That thermal jolt opens transient pores in the membrane, and a lucky few cells slurp a plasmid inside.

Be honest about how inefficient this is: only a tiny fraction of the cells ever take up a plasmid, and most of those take up the wrong kind. Crucially, a plasmid is built to survive once it is inside — it carries its own origin of replication, the start signal the cell's copying machinery recognises, so the bacterium duplicates the plasmid every generation and passes a copy to every daughter. Spread the transformed cells on a nutrient plate, leave them overnight, and each single successful cell grows into a visible mound of millions of genetically identical descendants — a colony, a clone you can see with the naked eye. The trouble is that the plate is now a mix: cells with your construct, cells with no plasmid at all, and cells with plasmids that closed back up empty. You need a way to find the winners.

Select, then screen: finding the right colony

Two clever filters separate the winners from the crowd, and it pays to keep them apart in your head. Selection asks the cruder question — does this cell have *any* plasmid at all? — and answers it by killing everyone who does not. Screening asks the finer question — of the cells that kept a plasmid, which ones got a plasmid carrying our *insert*? — and answers it with a visible signal rather than death. Selection clears the field; screening picks the champion. Do selection first, screening second.

Selection rides on antibiotic resistance. The vector is engineered to carry a selectable marker — typically a gene encoding an enzyme that destroys an antibiotic such as ampicillin. Pour that antibiotic into the agar, and only cells that took up the plasmid can survive on the plate; every cell that missed the plasmid is poisoned and never forms a colony. So every colony you *see* on the plate is guaranteed to carry a plasmid. That is a beautiful filter, but it is not enough on its own: it cannot tell a plasmid that swallowed your insert from one that simply re-sealed empty, because both kinds still carry the resistance gene.

That second question is what blue-white screening answers, and it is a small masterpiece of design. The vector's cloning site — the spot where you insert your gene — sits *inside* a marker gene, often a fragment of the gene for an enzyme called beta-galactosidase. Plate the bacteria with a colourless sugar mimic called X-gal: if the marker gene is intact, the enzyme is made, it cleaves X-gal, and the colony turns blue. But when your insert lands in the cloning site, it interrupts that marker gene, so no working enzyme is made, X-gal stays uncut, and the colony stays white. The logic flips the usual instinct: the *white* colonies are the ones you want, because white means the insert broke the gene. You pick white colonies off the plate and ignore the blue.

The whole workflow, and why it changed everything

  1. Cut. Digest the insert and the vector with the same restriction enzyme, so both come away with identical, complementary sticky ends.
  2. Paste. Mix the two and add DNA ligase; the matching overhangs base-pair, and ligase seals the backbone into one recombinant circle.
  3. Transform. Make bacteria competent with cold calcium and a heat shock, and let a few of them take up the recombinant plasmid.
  4. Copy. Spread on a plate overnight; each successful cell, with its built-in origin of replication, multiplies into a colony of millions of identical clones.
  5. Select. Add an antibiotic to the plate so only cells carrying a plasmid (and its resistance marker) can survive and form colonies.
  6. Screen. Use blue-white screening to spot, among the survivors, the white colonies whose plasmid actually carries the insert — then confirm by sequencing.

Step back and look at what those six steps deliver: take any DNA sequence you like, make a limitless supply of it, and put it to work. Once a gene is cloned you can read it letter by letter, mutate one base on purpose with site-directed mutagenesis to learn what that base does, or place it behind a strong promoter so the bacterium pours out the protein it encodes — the route by which recombinant proteins like human insulin first came out of a vat instead of an animal pancreas. The same cut-paste-copy-and-pick logic underlies gene therapy, engineered crops, and the antibody drugs in a modern pharmacy. Biology had become an engineering science: you could now build with genes, not merely catalogue them.

One honest caveat before you climb on. This restriction-and-ligation method is the classic that defined an era, and it is still the clearest way to *understand* cloning — but it is no longer the only way, nor always the easiest. Its limits are real: you are at the mercy of which restriction sites happen to sit in your sequence, sticky-end joins can go in backwards or not at all, and a re-sealed empty vector wastes your time. Newer techniques sidestep some of these by amplifying ends with PCR or by joining pieces without any restriction cut at all. The principle you just learned, though — cut, join, put it in a cell, then copy and pick — is the bedrock under every one of them.