分子生物学 1977

用链终止抑制剂进行 DNA 测序

弗雷德里克·桑格、史蒂夫·尼克伦与艾伦·库尔森

让 DNA 自我复制，却给每个碱基掺进一个「停」——长度便拼出序列。

Choose your version

In depth · the introduction

要读出一条 DNA 上字母的顺序，桑格的妙招是：让它自我复制——却悄悄掺进一些「停止」字母，于是复制品在每一个位置上堆叠起来，它们的长度便拼出序列。

核心想法

DNA 是一串由四个字母写成的密码——A、C、G、T。几十年里，我们看得见字母在那儿，却读不出它们的顺序。弗雷德里克·桑格找到了办法。你取来想读的那条链，让一台「复制酶」逐字搭出它的互补链。聪明之处在于你往混合液里加了什么：几个被「做了手脚」的字母——「双脱氧」版本——复制酶会乐意把它接上，可一旦接上，就再也不许任何字母加进来。每一个，都是一个句号。

把这件事做四遍，每种字母一遍，每遍只对 A、或只对 C……做手脚。在「遇 A 即停」的那一批里，你得到的是在序列中每一个 A 处都停下的复制品；在「遇 C 即停」的那一批里，得到的是在每一个 C 处停下的复制品。如今，分装在四堆里的，是一份「在每一个位置都停过」的复制品——而每一份复制品的长度，恰好告诉你它那个「停止字母」坐落在何处。

它是如何诞生的

到 1970 年代中期，DNA 的化学已经为人所知，但读出一段长序列依旧慢得令人发指。桑格在剑桥的医学研究理事会分子生物学实验室里默默工作，他早已因测出蛋白质——胰岛素——的序列而获过一次诺贝尔奖。他把同样的耐心转向了 DNA。他先与艾伦·库尔森做出一个较为笨拙的「加减」法；随后，1977 年，那个优雅的方法来了——双脱氧法，又叫链终止法。也正是这一年，一对美国人，阿兰·马克萨姆与沃尔特·吉尔伯特，发表了一种全然不同的化学方法。有一阵子两者都在用，但桑格的那个更省事，也正是机器最终能学会去跑的那一个。1980 年，它为他带来第二座诺贝尔奖。

它为何重要

在此之前，基因组是一本合着的书。桑格的方法把它翻开了。一旦能读出字母的顺序，你就能找出某种疾病背后的基因，把一个物种的 DNA 与另一个物种相比较，并核对一次编辑是否如你所愿。把它提速、交给机器，正是这同一个想法读出了整部人类基因组——三十亿个字母——奠定了现代遗传学与医学的根基。

一个可以想象的画面

想象你在复印一句很长的话，但你的复印机被动了手脚：它会随机地、在复印完某个特定字母——比方说每遇到一个「的」——之后卡住。印上一大叠，你就会得到停在第一个「的」、第二个「的」、第三个「的」……的一张张纸。把它们从短到长排好，每一张停下的位置，恰好告诉你那个字在整句话里落在哪儿。对每一个字都这么做一遍，你单凭这些卡住的复印件的长度，就能把整句话重建出来。那些「停止」字母，就是双脱氧碱基；而长度，是从一块凝胶上读出来的。

它的位置

沃森与克里克（1953）表明 DNA 是一种配对在双螺旋里的四字母密码；桑格的复制酶所利用的，正是这种配对。孟德尔那些抽象的「因子」，到此已成为可以读出的一段段字母。桑格测序随后与 PCR（1985）联手——后者把一个基因复制到足够读出的数量——一路通向人类基因组计划，通向今天能编辑基因的医学：在那里，一次 CRISPR 编辑，正是靠测出它所改动的那些字母来核对的。

The original document

Original source text

F. Sanger, S. Nicklen & A. R. Coulson · Proc. Natl. Acad. Sci. USA 74 (1977): 5463–5467

Sanger's group describes a way to read the order of bases along a DNA strand by copying it with a polymerase and making the copy stop, deliberately, at chosen letters. The abstract states the idea in full; the body of the paper is mapped in structure below, with its complete text at the source.

From the abstract

A new method for determining nucleotide sequences in DNA is described.

It is similar to the "plus and minus" method but makes use of the 2′,3′-dideoxy and arabinonucleoside analogues of the normal deoxynucleoside triphosphates, which act as specific chain-terminating inhibitors of DNA polymerase.

The technique has been applied to the DNA of bacteriophage ϕX174 and is more rapid and more accurate than either the plus or the minus method.

How the dideoxy method works (structural map)

A short primer is annealed to a single-stranded template and extended by DNA polymerase. The reaction is split four ways; each tube holds all four normal deoxynucleotides plus a trace of one 2′,3′-dideoxynucleotide (ddNTP). The dideoxy analogue is incorporated normally but has no 3′-hydroxyl, so the chain cannot be extended past it: synthesis stops specifically at that base.

Because only a fraction of chains terminate at each matching position, each tube yields a nested set of fragments — ending at every A, or every C, every G, every T — all sharing the primer's 5′ end and labelled radioactively. Run side by side on a denaturing polyacrylamide gel, separated to single-base resolution, the fragments form a ladder; read from the bottom (shortest) up, it gives the synthesized strand 5′→3′.

[ … ]

The dideoxy method built on the earlier "plus and minus" method of Sanger and Coulson (1975) and on the M13 single-stranded cloning system. A different chemical-cleavage method by Maxam and Gilbert appeared the same year; the dideoxy approach prevailed because it proved simpler and, later, automatable.

MRC Laboratory of Molecular Biology, Cambridge · 1977