經濟學 1950

n 人賽局中的均衡點

約翰·福布斯·納什

任何賽局中，都存在這樣一種局面：沒有人能靠單方面改變策略而獲益。

Choose your version

In depth · the introduction

一場賽局，總能僵持成這樣一種局面：只盯著自己得失的玩家，誰都沒有理由再動一步——哪怕換個地方，所有人本可以都過得更好。

核心想法

大多數要緊的處境——企業定價、國家談判、司機選路——都是賽局：什麼對你最好，取決於其他每個人怎麼做。約翰·納什對這類賽局問了一個簡單的問題：是否總存在一個穩定的歇腳點？他證明了：存在。它如今叫作納什均衡——一種選擇的組合，在其中，只要別人都不改，任何單獨一位玩家都無法靠改變自己的選擇而變得更好。

玄機在於：「穩定」並不等於「對所有人都好」。在著名的囚徒困境裡，兩名嫌犯無論對方怎麼做，自己背叛都更划算——於是雙方都背叛，結果兩人都比保持沉默時更糟。這個結局，就是均衡：不是因為它最好，而是因為誰也無法靠一己之力把自己的處境改善。

它是如何誕生的

賽局理論有一位巍然的奠基者——約翰·馮·紐曼，他與經濟學家奧斯卡·摩根斯特恩在 1944 年寫下了這門學問的「聖經」。但他們最鋒利的結果，是關於兩人零和賽局的：一方所得，恰是另一方所失。可大多數現實中的衝突，並非如此。1949 年，一位 21 歲的普林斯頓研究生約翰·納什，定義了一種對任意人數、任意「合作與衝突之混合」都適用的均衡，並證明它總是存在。

他的宣告，1950 年僅佔一頁篇幅，完整的博士論文則在 1951 年發表。承認來得很慢：納什的事業被思覺失調症中斷了數十年，這段經歷後來被電影《美麗境界》講述。1994 年，他憑這項工作，分享了諾貝爾經濟學紀念獎。

它為何重要

它交給經濟學與社會科學一件單一而普遍的工具，用以預測「人們的利益部分衝突、又部分一致」時的結局——而這幾乎是一切處境的常態。納什之前，這門理論多半只能應付純粹的對決。納什之後，同一個想法便能對準市場、拍賣、軍備競賽、演化與交通——凡是獨立的決策者彼此塑造對方最佳應對之處，皆可。

一個可以想像的畫面

想像在繁忙的超市挑一條結帳隊伍。一旦各隊排得差不多平了，就沒有哪一位顧客能換到另一條隊而更快通過——若真有誰能，他早就挪過去了。這種平衡的排布，誰都無法靠單獨換隊而變好，就是一個均衡。它未必是對所有人而言最快的；它只是這樣一個點：每個人只為自己打算，卻已再也找不到挪動的理由。

它的位置

亞當·斯密的「看不見的手」（1776）設想自利會匯總成集體的善；而囚徒困境，正是那記尖銳的提醒——它並不總是如此：自利會把所有人一同鎖進一個更糟的結局。納什均衡，是這兩者共同的精確語言。它的邏輯後來在生物學中重現，約翰·梅納德·史密斯把它改鑄為「演化穩定策略」；也在計算中重現，今天的人工智慧系統，常常就是被當作「賽局到均衡」來訓練的。

The original document

Original source text

J. F. Nash, Jr. · Proc. Natl. Acad. Sci. USA 36 (1950): 48–49 · communicated by S. Lefschetz, Nov. 16, 1949

One may define a concept of an n-person game in which each player has a finite set of pure strategies and in which a definite set of payments to the n players corresponds to each n-tuple of pure strategies, one strategy being taken for each player.

For mixed strategies, which are probability distributions over the pure strategies, the pay-off functions are the expectations of the players, thus becoming polylinear forms in the probabilities with which the various players play their various pure strategies.

Any n-tuple of strategies, one for each player, may be regarded as a point in the product space obtained by multiplying the n strategy spaces of the players. One such n-tuple counters another if the strategy of each player in the countering n-tuple yields the highest obtainable expectation for its player against the n − 1 strategies of the other players in the countered n-tuple.

A self-countering n-tuple is called an equilibrium point.

[ … ]

By using the continuity of the pay-off functions we see that the graph of the mapping is closed. Since the graph is closed and since the image of each point under the mapping is convex, we infer from Kakutani's theorem that the mapping has a fixed point (i.e., point contained in its image).

[ … ]

The author is indebted to Dr. David Gale for suggesting the use of Kakutani's theorem to simplify the proof and to the A. E. C. for financial support.

Department of Mathematics, Princeton University · November 16, 1949