经济学 1950

n 人博弈中的均衡点

约翰·福布斯·纳什

任何博弈中，都存在这样一种局面：没有人能靠单方面改变策略而获益。

Choose your version

In depth · the introduction

一场博弈，总能僵持成这样一种局面：只盯着自己得失的玩家，谁都没有理由再动一步——哪怕换个地方，所有人本可以都过得更好。

核心想法

大多数要紧的处境——企业定价、国家谈判、司机选路——都是博弈：什么对你最好，取决于其他每个人怎么做。约翰·纳什对这类博弈问了一个简单的问题：是否总存在一个稳定的歇脚点？他证明了：存在。它如今叫作纳什均衡——一种选择的组合，在其中，只要别人都不改，任何单独一位玩家都无法靠改变自己的选择而变得更好。

玄机在于：「稳定」并不等于「对所有人都好」。在著名的囚徒困境里，两名嫌犯无论对方怎么做，自己背叛都更划算——于是双方都背叛，结果两人都比保持沉默时更糟。这个结局，就是均衡：不是因为它最好，而是因为谁也无法靠一己之力把自己的处境改善。

它是如何诞生的

博弈论有一位巍然的奠基者——约翰·冯·诺伊曼，他与经济学家奥斯卡·摩根斯特恩在 1944 年写下了这门学问的「圣经」。但他们最锋利的结果，是关于两人零和博弈的：一方所得，恰是另一方所失。可大多数现实中的冲突，并非如此。1949 年，一位 21 岁的普林斯顿研究生约翰·纳什，定义了一种对任意人数、任意「合作与冲突之混合」都适用的均衡，并证明它总是存在。

他的宣告，1950 年仅占一页篇幅，完整的博士论文则在 1951 年发表。承认来得很慢：纳什的事业被精神分裂症中断了数十年，这段经历后来被电影《美丽心灵》讲述。1994 年，他凭这项工作，分享了诺贝尔经济学纪念奖。

它为何重要

它交给经济学与社会科学一件单一而普遍的工具，用以预测「人们的利益部分冲突、又部分一致」时的结局——而这几乎是一切处境的常态。纳什之前，这门理论多半只能应付纯粹的对决。纳什之后，同一个想法便能对准市场、拍卖、军备竞赛、演化与交通——凡是独立的决策者彼此塑造对方最佳应对之处，皆可。

一个可以想象的画面

想象在繁忙的超市挑一条结账队伍。一旦各队排得差不多平了，就没有哪一位顾客能换到另一条队而更快通过——若真有谁能，他早就挪过去了。这种平衡的排布，谁都无法靠单独换队而变好，就是一个均衡。它未必是对所有人而言最快的；它只是这样一个点：每个人只为自己打算，却已再也找不到挪动的理由。

它的位置

亚当·斯密的「看不见的手」（1776）设想自利会汇总成集体的善；而囚徒困境，正是那记尖锐的提醒——它并不总是如此：自利会把所有人一同锁进一个更糟的结局。纳什均衡，是这两者共同的精确语言。它的逻辑后来在生物学中重现，约翰·梅纳德·史密斯把它改铸为「演化稳定策略」；也在计算中重现，今天的人工智能系统，常常就是被当作「博弈到均衡」来训练的。

The original document

Original source text

J. F. Nash, Jr. · Proc. Natl. Acad. Sci. USA 36 (1950): 48–49 · communicated by S. Lefschetz, Nov. 16, 1949

One may define a concept of an n-person game in which each player has a finite set of pure strategies and in which a definite set of payments to the n players corresponds to each n-tuple of pure strategies, one strategy being taken for each player.

For mixed strategies, which are probability distributions over the pure strategies, the pay-off functions are the expectations of the players, thus becoming polylinear forms in the probabilities with which the various players play their various pure strategies.

Any n-tuple of strategies, one for each player, may be regarded as a point in the product space obtained by multiplying the n strategy spaces of the players. One such n-tuple counters another if the strategy of each player in the countering n-tuple yields the highest obtainable expectation for its player against the n − 1 strategies of the other players in the countered n-tuple.

A self-countering n-tuple is called an equilibrium point.

[ … ]

By using the continuity of the pay-off functions we see that the graph of the mapping is closed. Since the graph is closed and since the image of each point under the mapping is convex, we infer from Kakutani's theorem that the mapping has a fixed point (i.e., point contained in its image).

[ … ]

The author is indebted to Dr. David Gale for suggesting the use of Kakutani's theorem to simplify the proof and to the A. E. C. for financial support.

Department of Mathematics, Princeton University · November 16, 1949