JOVANA
Library Glossary Getting Started Three Levels Fields How it works Mission
Join the mission
All guides

Vectors & Plasmids

Restriction enzymes cut a gene loose — but a naked fragment of DNA, dropped into a cell, simply rots. A vector is the vehicle that carries your gene in, gets it copied billions of times, and lets you find the cells that took it. Meet the plasmid and its three essential parts, the bigger vehicles for bigger cargo, and the expression vector that turns a cloned gene into protein.

Why a cut gene needs a ride

In the last guide you watched a restriction enzyme do something almost magical: scan along a double helix, find one short palindrome, and snip both strands at precisely the same place, leaving a fragment with neat overhanging ends. Suppose that fragment is the gene you care about — say, the human gene for insulin. You now hold it as a tiny, free-floating piece of DNA in a tube. What can you do with it? On its own, almost nothing. There is far too little of it to study, you cannot read a single molecule, and the instant you put it inside a living cell, the cell's clean-up enzymes chew it to pieces. A bare gene is information with no way to be kept, copied, or read.

The fix is to stop treating the gene as cargo to study directly, and instead glue it into a piece of DNA that a cell already knows how to keep and copy. That carrier is a cloning vector — a small, well-behaved DNA molecule engineered to be a vehicle. You paste your fragment into the vector to make a single combined molecule of recombinant DNA, slip the whole thing into a bacterium, and let the bacterium do the rest. As the cell grows and divides, it copies the vector — and your passenger gene along with it — over and over. From one cell you grow billions of identical cells overnight, each carrying a perfect copy of your gene. That, in one sentence, is what cloning means: not making a sheep, but making a population of cells that all hold the same chosen piece of DNA.

The plasmid and its three essential parts

The workhorse vector is the plasmid: a small ring of double-stranded DNA, typically a few thousand base pairs around, that lives inside bacteria entirely separate from the main chromosome. Plasmids are not a human invention — bacteria carry them naturally, often passing antibiotic-resistance genes around on them. Molecular biologists simply borrowed the idea and rebuilt the ring, trimming away what they did not want and adding what they did. A modern cloning plasmid is a tiny engineered circle, and almost every one carries the same three working parts. Learn these three and you understand the heart of the whole toolkit.

First, the origin of replication, often written *ori*. This is a short stretch of sequence the cell's replication machinery recognises as a place to start copying — the same kind of origin you met when you learned how chromosomes are duplicated. Without it the plasmid would just sit there inert and be diluted away as the cell divides. With it, the plasmid gets copied by the host's own enzymes every generation, so it persists and multiplies. The origin also quietly sets the copy number: some origins keep just one or two copies of the plasmid per cell, while others let it run away to hundreds, which is what you want when the goal is to harvest as much DNA as possible.

Second, the selectable marker — the answer to a brutal practical problem. When you mix plasmids with bacteria, only a tiny fraction of the cells actually take one up; the rest get nothing. If you simply spread the mixture on a dish, the cells that took up your plasmid are hopelessly outnumbered by cells that did not, and you could never pick them out. The classic selectable marker is an antibiotic-resistance gene built into the plasmid. You grow the cells on a plate laced with that antibiotic, and the logic becomes deadly simple: any cell without the plasmid has no resistance and dies, while every survivor must be carrying your plasmid. The poison does the sorting for you. A field of colonies on the plate is a field of cells you know are holding your gene.

Third, the multiple cloning site, also called a polylinker. This is the loading dock where your gene goes in. It is a short, carefully designed stretch of DNA packed with the recognition sites for many different restriction enzymes, each appearing *only once* in the entire plasmid. That uniqueness is the whole point. Cut the plasmid with one of those enzymes and it opens at exactly one place — the cloning site — leaving the rest of the ring untouched, with overhanging ends that match the ends on your insert. Your fragment and the opened plasmid now have complementary sticky ends, so they snap together by base-pairing, and a DNA ligase seals the joins into one continuous circle. The plasmid is, in effect, a ready-made slot waiting for any gene you can cut to fit.

How a vector turns a fragment into something you can grow

Watch the three parts work together in a single run. The result is the central trick of recombinant DNA: a fragment that was useless on its own becomes a living, growing, harvestable colony of identical copies. Notice that nothing here required exotic chemistry — restriction enzymes cut, ligase joins, and the bacterium supplies the copying and the growth. You are mostly choosing the right pieces and letting biology run.

  1. Cut both. Open the plasmid at its cloning site with a restriction enzyme, and cut your gene out of its source DNA with the same enzyme, so both pieces end up with matching sticky ends.
  2. Join them. Mix the opened plasmid with your fragment; the complementary overhangs base-pair, and DNA ligase seals the backbone, producing one recombinant circle carrying your gene.
  3. Get it into a cell. Mix the circles with bacteria under conditions that briefly make the cells take up DNA — this uptake is called transformation. Only a minority of cells succeed.
  4. Select. Spread the cells on a plate containing the antibiotic. Only cells carrying the plasmid (and its resistance marker) survive, so every colony that grows holds your gene.
  5. Grow and harvest. Pick a colony, grow it overnight into billions of cells, and each copies the plasmid as it divides. Burst the cells open and you recover your gene in vast, pure quantity.

Bigger cargo needs bigger vehicles

Plasmids are wonderful, but they have a load limit. Push past roughly fifteen thousand base pairs of insert and a plasmid becomes unstable — it copies poorly, and cells tend to spit it out. That is fine for a single gene, but a human gene with all its introns, or a whole region of a chromosome you want to study, can be far too big to fit. So a family of high-capacity vectors was built, each trading some convenience for the ability to carry a larger passenger. The trick across all of them is the same: borrow a natural system that already moves big pieces of DNA around, and reshape it into a vehicle.

The first step up borrows a virus. A bacteriophage is a virus that infects bacteria, and the well-studied phage lambda comes with a protein shell that efficiently injects its DNA into a cell. Strip out the phage's own middle and you can slot in around twenty thousand base pairs of foreign DNA, then let the phage package and deliver it — a far more efficient way in than coaxing bacteria to swallow naked plasmids. Push the idea further and you get the cosmid, a clever hybrid that keeps the phage's packaging signal but otherwise behaves like a plasmid once inside, carrying up to roughly forty-five thousand base pairs. Each rung up this ladder buys you more room for cargo.

For the truly enormous inserts that genome projects needed, two giants top out the range. The BAC — bacterial artificial chromosome — is built from a natural bacterial plasmid that copies at just one or two copies per cell, which keeps even a huge insert stable; a BAC routinely carries a few hundred thousand base pairs, and BACs were the backbone of the Human Genome Project. Bigger still is the YAC — yeast artificial chromosome — which is not grown in bacteria at all but in yeast, and is assembled from the bare essentials of a real chromosome: an origin, a centromere so it gets pulled correctly when the cell divides, and telomeres to cap its ends. A YAC can hold a million base pairs or more, though it is fiddlier and more prone to rearranging its insert. As a rough rule, the bigger the cargo, the more you give up in convenience and stability to carry it.

VECTOR              TYPICAL INSERT SIZE        GROWN IN
-------------------------------------------------------------
plasmid             up to ~15 kb               bacteria
bacteriophage (lambda)  ~9-23 kb               bacteria (phage)
cosmid              ~30-45 kb                  bacteria
BAC                 ~100-300 kb                bacteria
YAC                 ~100 kb - >1,000 kb        yeast

   ( kb = kilobases = thousand base pairs )
   bigger cargo  ->  bigger, fiddlier vehicle
A rough ladder of vectors by how much DNA they carry — pick the smallest vehicle that fits your insert, because the bigger ones trade away convenience and stability.

From copying a gene to making its protein

Everything so far has been about *copying* a gene — keeping it, multiplying it, harvesting pure DNA. But often you want something more: the actual protein the gene encodes, made in bulk. A plain cloning vector will not do this on its own. Recall the central dogma from early in this ladder — DNA -> RNA -> protein. A gene only becomes protein if the cell transcribes it and then translates it, and that requires the right signals around the gene. A bacterium will not automatically read a human gene you have simply parked in its cell; the human gene's own on-switches mean nothing to bacterial machinery.

The answer is the expression vector — a cloning vector with extra parts that hand the host cell explicit instructions to make protein. Just upstream of the cloning site sits a strong promoter the host's RNA polymerase recognises, plus the signals a ribosome needs to start translating, so once your gene is dropped in, the cell transcribes and translates it at full tilt. The best expression vectors also put the promoter under a switch you control — typically a small molecule you add to the culture — so the cell grows up peacefully first and only then is told to pour its energy into churning out your protein, which can otherwise be toxic in large amounts. This is recombinant protein expression, and it is how essentially all therapeutic insulin is now made: the human insulin gene, placed in an expression vector, grown in vats of bacteria or yeast that secrete the human protein.

Two honest caveats keep this from sounding like magic. First, a bacterium reads a gene a little differently from a human cell — it cannot splice out introns and it skips many of the chemical decorations a human protein needs to fold and work, so for some proteins you must instead use yeast, insect, or mammalian cells as the host, with vectors built for them. (This is one reason biologists often clone the intron-free cDNA version of a gene rather than the raw genomic copy — but that is a story for the next guides.) Second, foreign protein forced out at high speed can clump into useless insoluble lumps, so coaxing a good yield of properly folded, active protein is a craft in itself. An expression vector gives you the machinery to make protein; getting *usable* protein out of it is where much of the real work still lives.