博弈论导论 2

格式：pdf
大小：398.77 KB
文档页数：15

下载文档原格式

博弈论PPT课件

有i si 0, i si 1 si Si
这就是混合策略。
混合策略的纳什均衡定义
如果对于博弈中所有的游戏者i，对于所有的 σi∈Mi，都有ui﹙σ*﹚≥ui﹙σi，σ-i*﹚，则称 σ*就是一个混合策略的纳什均。
如何求混合策略的纳什均衡
猜硬币的博弈中解：设猜方猜正方的概率为p，猜反方的概率则为1－
无名氏（大众）定理
无名氏定理：在无穷次重复的由n个游戏者参与的博弈里，如果在每一次重复中博弈的行动集是有限的，则在满足下列三个条件时，在任何有限次重复中所观察到的任何行动组合都是某个子博弈完美均衡的惟一结果：
条件1：贴现因子接近于1；条件2：在每一次重复中，博弈结束的概率或等于0，或为非常小的一个正值；条件3：严格占优于一次性博弈中的最小最大收益组合的那个收益组合集是n维的。
博弈方
博弈方：独立决策、独立承担博弈结果的个人或组织
博弈规则面前博弈方之间平等，不因博弈方之间权利、地位的差异而改变
博弈方数量对博弈结果和分析有影响根据博弈方数量分单人博弈、两人博弈、多人
博弈等。最常见的是两人博弈，单人博弈是退化的博弈
策略
策略：博弈中各博弈方的选择内容策略有定性定量、简单复杂之分不同博弈方之间不仅可选策略不同，而且可
游戏和经济等决策竞争较量的共同特征：规则、结果、策略选择，策略和利益相互依存，策略的关键作用
游戏——下棋、猜大小经济——寡头产量决策、市场阻入、投标拍卖政治、军事——美国和伊朗、以色列和巴勒斯坦、中国和日本等等。
博弈的基本要素
博弈的参加者(Player)——博弈方各博弈方的策略(Strategies)或行动(Actions) 博弈的次序(Order) 博弈方的收益(Payoffs) （或称支付，或得益）

lecture_2（博弈论讲义GameTheory（MIT））

Last Time:Defined knowledge, common knowledge, meet (of partitions), and reachability.Reminders:• E is common knowledge at ω if ()I K E ω∞∈.• “Reachability Lemma” :'()M ωω∈ if there is a chain of states 01,,...m 'ωωωωω== such that for each k ω there is a player i(k) s.t. ()()1()(i k k i k k h h )ωω+=:• Theorem: Event E is common knowledge at ωiff ()M E ω⊆.How does set of NE change with information structure?Suppose there is a finite number of payoff matrices 1,...,L u u for finite strategy sets 1,...,I S SState space Ω, common prior p, partitions , and a map i H λso that payoff functions in state ω are ()(.)u λω; the strategy spaces are maps from into . i H i SWhen the state space is finite, this is a finite game, and we know that NE is u.h.c. and generically l.h.c. in p. In particular, it will be l.h.c. at strict NE.The “coordinated attack” game8,810,11,100,0A B A B-- 0,010,11,108,8A B A B--a ub uΩ= 0,1,2,….In state 0: payoff functions are given by matrix ; bu In all other states payoff functions are given by . a upartitions of Ω1H : (0), (1,2), (3,4),… (2n-1,2n)... 2H (0,1),(2,3). ..(2n,2n+1)…Prior p : p(0)=2/3, p(k)= for k>0 and 1(1)/3k e e --(0,1)ε∈.Interpretation: coordinated attack/email:Player 1 observes Nature’s choice of payoff matrix, sends a message to player 2.Sending messages isn’t a strategic decision, it’s hard-coded.Suppose state is n=2k >0. Then 1 knows the payoffs, knows 2 knows them. Moreover 2 knows that 1knows that 2 knows, and so on up to strings of length k: . 1(0n I n K n -Î>)But there is no state at which n>0 is c.k. (to see this, use reachability…).When it is c.k. that payoff are given by , (A,A) is a NE. But.. auClaim: the only NE is “play B at every information set.”.Proof: player 1 plays B in state 0 (payoff matrix ) since it strictly dominates A. b uLet , and note that .(0|(0,1))q p =1/2q >Now consider player 2 at information set (0,1).Since player 1 plays B in state 0, and the lowest payoff 2 can get to B in state 1 is 0, player 2’s expected payoff to B at (0,1) is at least 8. qPlaying A gives at most 108(1)q q −+−, and since , playing B is better. 1/2q >Now look at player 1 at 1(1,2)h =. Let q'=p(1|1,2), and note that '1(1)q /2εεεε=>+−.Since 2 plays B in state 1, player 1's payoff to B is at least 8q';1’s payoff to A is at most -10q'+8(1-q) so 1 plays B Now iterate..Conclude that the unique NE is always B- there is no NE in which at some state the outcome is (A,A).But (A,A ) is a strict NE of the payoff matrix . a u And at large n, there is mutual knowledge of the payoffs to high order- 1 knows that 2 knows that …. n/2 times. So “mutual knowledge to large n” has different NE than c.k.Also, consider "expanded games" with state space . 0,1,....,...n Ω=∞For each small positive ε let the distribution p ε be as above: 1(0)2/3,()(1)/3n p p n ee e e -==- for 0 and n <<∞()0p ε∞=.Define distribution by *p *(0)2/3p =,. *()1/3p ∞=As 0ε→, probability mass moves to higher n, andthere is a sense in which is the limit of the *p p εas 0ε→.But if we do say that *p p ε→ we have a failure of lower hemi continuity at a strict NE.So maybe we don’t want to say *p p ε→, and we don’t want to use mutual knowledge to large n as a notion of almost common knowledge.So the questions:• When should we say that one information structure is close to another?• What should we mean by "almost common knowledge"?This last question is related because we would like to say that an information structure where a set of events E is common knowledge is close to another information structure where these events are almost common knowledge.Monderer-Samet: Player i r-believes E at ω if (|())i p E h r ω≥.()r i B E is the set of all ω where player i r- believesE; this is also denoted 1.()ri B ENow do an iterative definition in the style of c.k.: 11()()rr I i i B E B E =Ç (everyone r-believes E) 1(){|(()|())}n r n ri i I B E p B E h r w w -=³ ()()n r n rI i i B E B =ÇEE is common r belief at ω if ()rI B E w ¥ÎAs with c.k., common r-belief can be characterized in terms of public events:• An event is a common r-truism if everyone r -believes it when it occurs.• An event is common r -belief at ω if it is implied by a common r-truism at ω.Now we have one version of "almost ck" : An event is almost ck if it is common r-belief for r near 1.MS show that if two player’s posteriors are common r-belief, they differ by at most 2(1-r): so Aumann's result is robust to almost ck, and holds in the limit.MS also that a strict NE of a game with knownpayoffs is still a NE when payoffs are "almost ck” - a form of lower hemi continuity.More formally:As before consider a family of games with fixed finite action spaces i A for each player i. a set of payoff matrices ,:l I u A R ->a state space W , that is now either finite or countably infinite, a prior p, a map such that :1,,,L l W®payoffs at ω are . ()(,)()w u a u a l w =Payoffs are common r-belief at ω if the event {|()}w l w l = is common r belief at ω.For each λ let λσ be a NE for common- knowledgepayoffs u .lDefine s * by *(())s l w w s =.This assigns each w a NE for the corresponding payoffs.In the email game, one such *s is . **(0)(,),()(,)s B B s n A A n ==0∀>If payoffs are c.k. at each ω, then s* is a NE of overall game G. (discuss)Theorem: Monder-Samet 1989Suppose that for each l , l s is a strict equilibrium for payoffs u λ.Then for any there is 0e >1r < and 1q < such that for all [,1]r r Î and [,1]q q Î,if there is probability q that payoffs are common r- belief, then there is a NE s of G with *(|()())1p s s ωωω=>ε−.Note that the conclusion of the theorem is false in the email game:there is no NE with an appreciable probability of playing A, even though (A,A) is a strict NE of the payoffs in every state but state 0.This is an indirect way of showing that the payoffs are never ACK in the email game.Now many payoff matrices don’t have strictequilibria, and this theorem doesn’t tell us anything about them.But can extend it to show that if for each state ω, *(s )ω is a Nash (but not necessarily strict Nash) equilibrium, then for any there is 0e >1r < and 1q < such that for all [,1]r r Î and [,1]q q Î, if payoffs are common r-belief with probability q, there is an “interim ε equilibria” of G where s * is played with probability 1ε−.Interim ε-equilibria:At each information set, the actions played are within epsilon of maxing expected payoff(((),())|())((',())|())i i i i i i i i E u s s h w E u s s h w w w w e-->=-Note that this implies the earlier result when *s specifies strict equilibria.Outline of proof:At states where some payoff function is common r-belief, specify that players follow s *. The key is that at these states, each player i r-believes that all other players r-believe the payoffs are common r-belief, so each expects the others to play according to s *.*ΩRegardless of play in the other states, playing this way is a best response, where k is a constant that depends on the set of possible payoff functions.4(1)k −rTo define play at states in */ΩΩconsider an artificial game where players are constrained to play s * in - and pick a NE of this game.*ΩThe overall strategy profile is an interim ε-equilibrium that plays like *s with probability q.To see the role of the infinite state space, consider the"truncated email game"player 2 does not respond after receiving n messages, so there are only 2n states.When 2n occurs: 2 knows it occurs.That is, . {}2(0,1),...(22,21,)(2)H n n =−−n n {}1(0),(1,2),...(21,2)H n =−.()2|(21,2)1p n n n ε−=−, so 2n is a "1-ε truism," and thus it is common 1-ε belief when it occurs.So there is an exact equilibrium where players playA in state 2n.More generally: on a finite state space, if the probability of an event is close to 1, then there is high probability that it is common r belief for r near 1.Not true on infinite state spaces…Lipman, “Finite order implications of the common prior assumption.”His point: there basically aren’t any!All of the "bite" of the CPA is in the tails.Set up: parameter Q that people "care about" States s S ∈,:f S →Θ specifies what the payoffs are at state s. Partitions of S, priors .i H i pPlayer i’s first order beliefs at s: the conditional distribution on Q given s.For B ⊆Θ,1()()i s B d =('|(')|())i i p s f s B h s ÎPlayer i’s second order beliefs: beliefs about Q and other players’ first order beliefs.()21()(){'|(('),('))}|()i i j i s B p s f s s B h d d =Îs and so on.The main point can be seen in his exampleTwo possible values of an unknown parameter r .1q q = o 2qStart with a model w/o common prior, relate it to a model with common prior.Starting model has only two states 12{,}S s s =. Each player has the trivial partition- ie no info beyond the prior.1122()()2/3p s p s ==.example: Player 1 owns an asset whose value is 1 at 1θ and 2 at 2θ; ()i i f s θ=.At each state, 1's expected value of the asset 4/3, 2's is 5/3, so it’s common knowledge that there are gains from trade.Lipman shows we can match the players’ beliefs, beliefs about beliefs, etc. to arbitrarily high order in a common prior model.Fix an integer N. construct the Nth model as followsState space'S ={1,...2}N S ´Common prior is that all states equally likely.The value of θ at (s,k) is determined by the s- component.Now we specify the partitions of each player in such a way that the beliefs, beliefs about beliefs, look like the simple model w/o common prior.1's partition: events112{(,1),(,2),(,1)}...s s s 112{(,21),(,2),(,)}s k s k s k -for k up to ; the “left-over” 12N -2s states go into 122{(,21),...(,2)}N N s s -+.At every event but the last one, 1 thinks the probability of is 2/3.1qThe partition for player 2 is similar but reversed: 221{(,21),(,2),(,)}s k s k s k - for k up to . 12N -And at all info sets but one, player 2 thinks the prob. of is 1/3.1qNow we look at beliefs at the state 1(,1)s .We matched the first-order beliefs (beliefs about θ) by construction)Now look at player 1's second-order beliefs.1 thinks there are 3 possible states 1(,1)s , 1(,2)s , 2(,1)s .At 1(,1)s , player 2 knows {1(,1)s ,2(,1)s ,(,}. 22)s At 1(,2)s , 2 knows . 122{(,2),(,3),(,4)}s s s At 2(,1)s , 2 knows {1(,2)s , 2(,1)s ,(,}. 22)sThe support of 1's second-order beliefs at 1(,1)s is the set of 2's beliefs at these info sets.And at each of them 2's beliefs are (1/3 1θ, 2/3 2θ). Same argument works up to N:The point is that the N-state models are "like" the original one in that beliefs at some states are the same as beliefs in the original model to high but finite order.(Beliefs at other states are very different- namely atθ or 2 is sure the states where 1 is sure that state is2θ.)it’s1Conclusion: if we assume that beliefs at a given state are generated by updating from a common prior, this doesn’t pin down their finite order behavior. So the main force of the CPA is on the entire infinite hierarchy of beliefs.Lipman goes on from this to make a point that is correct but potentially misleading: he says that "almost all" priors are close to a common. I think its misleading because here he uses the product topology on the set of hierarchies of beliefs- a.k.a topology of pointwise convergence.And two types that are close in this product topology can have very different behavior in a NE- so in a sense NE is not continuous in this topology.The email game is a counterexample. “Product Belief Convergence”:A sequence of types converges to if thesequence converges pointwise. That is, if for each k,, in t *i t ,,i i k n k *δδ→.Now consider the expanded version of the email game, where we added the state ∞.Let be the hierarchy of beliefs of player 1 when he has sent n messages, and let be the hierarchy atthe point ∞, where it is common knowledge that the payoff matrix is .in t ,*i t a uClaim: the sequence converges pointwise to . in t ,*i t Proof: At , i’s zero-order beliefs assignprobability 1 to , his first-order beliefs assignprobability 1 to ( and j knows it is ) and so onup to level n-1. Hence as n goes to infinity, thehierarchy of beliefs converges pointwise to common knowledge of .in t a u a u a u a uIn other words, if the number of levels of mutual knowledge go to infinity, then beliefs converge to common knowledge in the product topology. But we know that mutual knowledge to high order is not the same as almost common knowledge, and types that are close in the product topology can play very differently in Nash equilibrium.Put differently, the product topology on countably infinite sequences is insensitive to the tail of the sequence, but we know that the tail of the belief hierarchy can matter.Next : B-D JET 93 "Hierarchies of belief and Common Knowledge”.Here the hierarchies of belief are motivated by Harsanyi's idea of modelling incomplete information as imperfect information.Harsanyi introduced the idea of a player's "type" which summarizes the player's beliefs, beliefs about beliefs etc- that is, the infinite belief hierarchy we were working with in Lipman's paper.In Lipman we were taking the state space Ω as given.Harsanyi argued that given any element of the hierarchy of beliefs could be summarized by a single datum called the "type" of the player, so that there was no loss of generality in working with types instead of working explicitly with the hierarchies.I think that the first proof is due to Mertens and Zamir. B-D prove essentially the same result, but they do it in a much clearer and shorter paper.The paper is much more accessible than MZ but it is still a bit technical; also, it involves some hard but important concepts. (Add hindsight disclaimer…)Review of math definitions:A sequence of probability distributions converges weakly to p ifn p n fdp fdp ®òò for every bounded continuous function f. This defines the topology of weak convergence.In the case of distributions on a finite space, this is the same as the usual idea of convergence in norm.A metric space X is complete if every Cauchy sequence in X converges to a point of X.A space X is separable if it has a countable dense subset.A homeomorphism is a map f between two spaces that is 1-1, and onto ( an isomorphism ) and such that f and f-inverse are continuous.The Borel sigma algebra on a topological space S is the sigma-algebra generated by the open sets. (note that this depends on the topology.)Now for Brandenburger-DekelTwo individuals (extension to more is easy)Common underlying space of uncertainty S ( this is called in Lipman)ΘAssume S is a complete separable metric space. (“Polish”)For any metric space, let ()Z D be all probability measures on Borel field of Z, endowed with the topology of weak convergence. ( the “weak topology.”)000111()()()n n n X S X X X X X X --=D =´D =´DSo n X is the space of n-th order beliefs; a point in n X specifies (n-1)st order beliefs and beliefs about the opponent’s (n-1)st order beliefs.A type for player i is a== 0012(,,,...)()n i i i i n t X d d d =¥=Î´D0T .Now there is the possibility of further iteration: what about i's belief about j's type? Do we need to add more levels of i's beliefs about j, or is i's belief about j's type already pinned down by i's type ?Harsanyi’s insight is that we don't need to iterate further; this is what B-D prove formally.Coherency: a type is coherent if for every n>=2, 21marg n X n n d d --=.So the n and (n-1)st order beliefs agree on the lower orders. We impose this because it’s not clear how to interpret incoherent hierarchies..Let 1T be the set of all coherent typesProposition (Brandenburger-Dekel) : There is a homeomorphism between 1T and . 0()S T D ´.The basis of the proposition is the following Lemma: Suppose n Z are a collection of Polish spaces and let021201...1{(,,...):(...)1, and marg .n n n Z Z n n D Z Z n d d d d d --´´-=ÎD ´"³=Then there is a homeomorphism0:(nn )f D Z ¥=®D ´This is basically the same as Kolmogorov'sextension theorem- the theorem that says that there is a unique product measure on a countable product space that corresponds to specified marginaldistributions and the assumption that each component is independent.To apply the lemma, let 00Z X =, and 1()n n Z X -=D .Then 0...n n Z Z X ´´= and 00n Z S T ¥´=´.If S is complete separable metric than so is .()S DD is the set of coherent types; we have shown it is homeomorphic to the set of beliefs over state and opponent’s type.In words: coherency implies that i's type determines i's belief over j's type.But what about i's belief about j's belief about i's type? This needn’t be determined by i’s type if i thinks that j might not be coherent. So B-D impose “common knowledge of coherency.”Define T T ´ to be the subset of 11T T ´ where coherency is common knowledge.Proposition (Brandenburger-Dekel) : There is a homeomorphism between T and . ()S T D ´Loosely speaking, this says (a) the “universal type space is big enough” and (b) common knowledge of coherency implies that the information structure is common knowledge in an informal sense: each of i’s types can calculate j’s beliefs about i’s first-order beliefs, j’s beliefs about i’s beliefs about j’s beliefs, etc.Caveats:1) In the continuity part of the homeomorphism the argument uses the product topology on types. The drawbacks of the product topology make the homeomorphism part less important, but theisomorphism part of the theorem is independent of the topology on T.2) The space that is identified as“universal” depends on the sigma-algebra used on . Does this matter?(S T D ´)S T ×Loose ideas and conjectures…• There can’t be an isomorphism between a setX and the power set 2X , so something aboutmeasures as opposed to possibilities is being used.• The “right topology” on types looks more like the topology of uniform convergence than the product topology. (this claim isn’t meant to be obvious. the “right topology” hasn’t yet been found, and there may not be one. But Morris’ “Typical Types” suggests that something like this might be true.)•The topology of uniform convergence generates the same Borel sigma-algebra as the product topology, so maybe B-D worked with the right set of types after all.。

《经济博弈论》期末考试复习

《经济博弈论》期末考试复习资料第一章导论1.博弈的概念：博弈即一些个人、队组或其他组织，面对一定的环境条件，在一定的规则下，同时或先后，一次或多次，从各自允许选择的行为或策略中进行选择并加以实施，并从中各自取得相应结果的过程。

它包括四个要素：参与者，策略，次序和得益。

2.一个博弈的构成要素：博弈模型有下列要素：(1)博弈方。

即博弈中决策并承但结果的参与者．包括个人或组织等：(2)策略。

即博弈方决策、选择的内容，包括行为取舍、经济活动水平或多种行为的特定组合等。

各博弈方的策略选择范围称策略空间。

每个博弈方各选一个策略构成一个策略组合。

(3)进行博弈的次序：次序不同一般就是不同的博弈，即使博弈的其他方面都相同。

(4)得益。

各策略组合对应的各博弈方获得的数值结果，可以是经济利益，也可以是非经济利益折算的效用等。

3.合作博弈和非合作博弈的区别：合作博弈：允许存在有约束力协议的博弈；非合作博弈：不允许存在有约束力协议的博弈。

主要区别:人们的行为互相作用时，当事人能否达成一个具有约束力的协议。

假设博弈方是两个寡头企业，如果他们之间达成一个协议，联合最大化垄断利润，并且各自按这个协议生产，就是合作博弈。

如果达不成协议，或不遵守协议，每个企业都只选择自己的最优产品(价格)，则是非合作博弈。

合作博弈：团体理性(效率高，公正，公平)非合作博弈：个人理性，个人最优决策(可能有效率，可能无效率)4.完全理性和有限理性:完全理性：有完美的分析判断能力和不会犯选择行为的错误。

有限理性：博弈方的判断选择能力有缺陷。

区分两者的重要性在于如果决策者是有限理性的，那么他们的策略行为和博弈结果通常与在博弈方有完全理想假设的基础上的预测有很大差距，以完全理性为基础的博弈分析可能会失效。

所以不能简单地假设各博弈方都完全理性。

5.个体理性和集体理性：个体理性：以个体利益最大为目标；集体理性：追求集体利益最大化。

第一章课后题：2、4、56.设定一个博弈模型必须确定哪几个方面?设定一个博弈必须确定的方面包括:(1)博弈方，即博弈中进行决策并承担结果的参与者;(2)策略(空间)，即博弈方选择的内容，可以是方向、取舍选择，也可以是连续的数量水平等;(3)得益或得益函数，即博弈方行为、策略选择的相应后果、结果，必须是数量或者能够折算成数量;(4)博弈次序，即博弈方行为、选择的先后次序或者重复次数等;(5)信息结构，即博弈方相互对其他博弈方行为或最终利益的了解程度;(6)行为逻辑和理性程度，即博弈方是依据个体理性还是集体理性行为，以及理性的程度等。

(完整word)耶鲁大学博弈论_精简版

第一讲导论—五个入门结论1。

通过成绩博弈模型可以知道，不选择严格劣势策略，因为每次博弈会得到更好的收益.2。

通过囚徒的困境博弈模型可以知道，理性选择导致次优的结果（协商难以达成目的的原因不是因为缺少沟通，而是没有强制力）。

3。

通过愤怒天使博弈模型可以知道，汝欲得之,必先知之；永远选择优势策略，选择非劣势策略，损失小，如果对手有优势策略则应以此作为选择策略的指导.4.如果想要赢，就应该站在别人的立场去分析他们会怎么做.第二讲学会换位思考1.构成博弈要素包括，参与人,参与人的策略以及收益.2。

所谓严格优势策略，就是指不论对方采取什么策略，采取的这个策略总比采取其他任何策略都好的策略。

3。

在博弈中剔出某些选择时需要站在别人的角度去思考结果,因为对手不会选择劣势策略；同时要考虑到对手也是一个理性的参与人。

4.在博弈中剔除某些选择是一种直接思考，同时也是作为一个理性参与人的选择。

第三讲迭代剔除和中位选民定理1。

在选民投票博弈模型中,通过不断地迭代以及剔除来决定策略，由此,我们得到了一种新的选择策略的方法：迭代剔除法。

2.选民投票博弈模型的结果与现实存在偏差，主要是因为：现实中选民并不是均匀分布的;选民通常根据候选人的性格而非政治立场来进行投票，而政治立场只是单一维度；只适用于只有两个候选人的情况;④同时存在弃权票；⑤选民未必相信候选人所声明的立场。

3.建立模型，是为了更好的描述事实以激发灵感，模型是有重要的事是抽象而来，逐步增加约束条件完善模型观察结果，比较分析结果的变化。

第四节足球比赛与商业合作之最佳对策1。

点球博弈模型告诉我们，不要选择一个在任何情况或信念下都不是最佳对策的策略。

2.最佳对策：参与人针对对手策略的定义：参与人i的策略s^i（简写成BR）是对手策略S—i的最佳对策，如果参与人i在对手的策略S-i下选S^i的收益弱优于其它对策Si`，这对参与人i的所有Si`都适用，则策略S^i是其它参与人策略S—i的最佳对策。

博弈论最全完整ppt-讲解

模型
导论
二、博弈论与诺贝尔经济学奖获得者
1994年诺贝尔经济学奖获得者
美国人约翰-海萨尼(John C. Harsanyi) 和美国人约翰-纳什(John F. Nash Jr.)以及德国人莱因哈德-泽尔腾(Reinhard Selten)
获奖理由：在非合作博弈的均衡分析理论方面做出了开创性的贡献，对博弈论和经济学产生了重大影响。
如果一个博弈在所有各种对局下全体参与人之得益总和总是保持为一个常数，这个博弈就叫常和博弈；
相反，如果一个博弈在所有各种对局下全体参与人之得益总和不总是保持为一个常数，这个博弈就叫非常和博弈。
常和博弈也是利益对抗程度最高的博弈。非常和（变和）博弈蕴含双赢或多赢。
导论
四、主要参考文献
课程主要内容
第一章完全信息静态博弈第二章完全信息动态博弈第三章不完全信息静态博弈第四章不完全信息动态博弈第五章委托-代理理论第六章逆向选择与信号传递
第一章完全信息静态博弈
博弈论的基本概念及战略式表述纳什均衡
纳什均衡应用举例混合战略纳什均衡纳什均衡的存在性与多重性
第一节博弈论的基本概念
与战略式表述
博弈论的基本概念与战略式表述
博弈论（game theory）是研究决策主体的行为发生直接相互作用时候的决策以及这种决策的均衡问题。
博弈的战略式表述：G={N,(Si)iN,(Ui)iN} 有三个基本要素：（1）参与人（players）iN={1,2,…,n} ；（2）战略（strategies）,siSi(战略空间)；（3）支付（payoffs）,ui=ui(s-i,si)。
Because We Had a Flat Tire”

1.2 导论：博弈论与经济学

，美政策策，
博弈的定义：
博弈是指决策主体人在相互对抗中，对抗双博弈方（或多方）相互依存的一系列策略和行动的过程集合。注意： 1、博弈中的参与者各自追求的利益具有冲突性。
2、博弈是一个过程的集合。 3、博弈的一个本质特征就是策略的相互依存性。
博弈论是专门研究博弈如何出现均衡的规律博弈论起步于冯•诺依曼（Von Neumann）与摩跟斯坦恩（Morgenstern）1944年出版的《博弈理论与经济行为》。普林斯顿 **2、塔克(Tucher）发展了“囚徒困境”（1950）（库恩） **3、纳什(Nash)发表了关于均衡的定义与存在性的文章。 4、纳什与夏普利（Sharpley）发展了关于“讨价还价博弈”文章（1953）。到了1953年，所有将被经济学家们在此后二十年中运用到的博弈理论事实上都已被发现了。但直到70 年代中期，博弈论还保持着独立领域的地位，之后经济学家开始发现将博弈论与复杂的经济问题结合起来可能会得到什么结果。在80年代，博弈论迅速成为主流经济学的重要组成部分。事实上，他几乎吞并了整个微观经济学。
导论：博弈论与经济学
一、博弈论与经济学的联系与区别
1、传统经济学主要研究经济市场的两个极端情况：、传统经济学主要研究经济市场的两个极端情况：（1）垄断市场）（2）完全竞争市场）价格制度(或称市场制度价格理论)是其集中的体现或称市场制度、是其集中的体现。价格制度或称市场制度、价格理论是其集中的体现。两个基本假设：1、市场的参与人较多；2、所有参与人对信息的把握是完全对称的；这两个假设在现实中是不完全存在的。两个决策主体：1、经济主体人；2、市场。但市场是千千万万个消费者的消费意愿和消费能力的总和，所以它已不再具有人格化的面貌。

博弈论导论-中国经济学堂

1.1 背景 1.2 基本概念 1.3 基本框架
授课进度
第 2 章完全信息静态博弈
2.1 优势策略均衡 2.2 重复剔除的优势均衡 2.3 纳什均衡 2.4 混合策略 2.5 连续策略 2.6 纳什均衡的在性和多重性
第 3 章完全信息动态博弈
3.1 扩展式表述与信息分割 3.2 子博弈完美纳什均衡：逆向归纳法 3.3 应用 I：要挟诉讼模型 3.4 应用 II：斯塔克博格模型 3.5 承诺 3.6 有限重复博弈：连锁店悖论 3.7 无限重复博弈：无名氏定理
课程网站：（中国经济学堂）
教材
（推荐）[美]艾里克·拉斯缪森（Eric Rasmusen），2009：《博弈与信息》（第四版），中国人民大学出版社，定价68元。全书电子版（4rd）：/GI/download.htm
考试分数
平时作业（含小组演示） 40%
思考题（非强制）
10%
期末考试（计算和分析题）50%
1
相关规则
作业不能缺漏，考试不能迟到。
中国的问题，归根结底都是政治经济学问题。
“聂氏政经评论”由聂辉华教授负责运营。喜欢我们的文章，请点击右上角“分享到朋友圈”，或者搜索微信号（ruc_nie）关注。
第 1 章导论
第 7 章拍卖
7.1 拍卖的分类 7.2 私人价值拍卖与收益等价原理 7.3 公共价值拍卖与“赢者的诅咒”
3
（推荐）聂辉华，《跟<西游记>学创业——一本人人都要读的管理秘籍》，中国人民大学出版社，2015年
（备选）[美]Steven Tadelis, 2012, Game Theory: An Introduction，Princeton University Press. （备选）[美]罗伯特·吉本斯（Robert Gibbons），1999：《博弈论基础》，中国社会科学出版

博弈论最全完整-讲解

Because We Had a Flat Tire”
“乘客侧前轮”看起来是一个合乎逻辑的选择。但真正起作用的是你的朋友是否使用同样的
逻辑，或者认为这一选择同样显然。并且是否你认为这一选择是否对他同样显然；反之，是否她认为这一选择对你同样显然。……以此类推。也就是说，需要的是对这样的情况下该选什么的预期的收敛。这一使得参与者能够成功合作的共同预期的策略被称为焦点。心有灵犀一点通。
例3：为什么教授如此苛刻？
问题是，一个好心肠的教授如何维持如此铁石心肠的承诺？
他必须找到某种使拒绝变得强硬和可信的方法。
拿行政程序或者学校政策来做挡箭牌在课程开始时做出明确和严格的宣布通过几次严打来获得“冷面杀手”的声
誉
导论
博弈均衡与一般均衡博弈论与诺贝尔经济学奖获得者
博弈论的基本概念与类型主要参考文献
即使决策或行动有先后，但只要局中人在决策时都还不知道对手的决策或者行动是什么，也算是静态博弈
完全信息博弈与不完全信息博弈
(games of complete information and games of incomplete information)
按照大家是否清楚对局情况下每个局中人的得益。
“各种对局情况下每个人的得益是多少” 是所有局中人的共同知识（common knowledge）。
据“共同知识”的掌握分为完全信息与不完全信息博弈。
完美信息博弈与不完美信息博弈
(games with perfect information and games with imperfect information)
了解自己行动的限制和约束，然后以精心策划的方式选择自己的行为，按照自己的标准做到最好。 • 博弈论对理性的行为又从新的角度赋予其新的含义— —与其他同样具有理性的决策者进行相互作用。 • 博弈论是关于相互作用情况下的理性行为的科学。

博弈论判断题

博弈论判断题第一章导论（1)单人博弈就是个人最优化决策，与典型的博弈问题有本质区别。

(2)博弈方的策略空问必须是数量空间，博弈的结果必须是数量或者能够数量化。

(3）囚徒的困境博弈中两个因徒之所以会处于困境，无法得到较理想的结果，是因为两囚徒都不在乎坐牢时间长短本身，只在乎不能比对方坐牢的时间更长.(4)因为零和博弈中博奔方之间的关系都是竞争性的、对立的，因此零和博弈就是非合作博弈。

（5）凡是博弈方的选择、行为有先后次序的一定是动态博弈。

(6)多人博弈中的“破坏者"会对所有博弈方的利益产生不利影响.（7）合作博弈就是博弈方采取相互合作态度的博弈。

参考答案：（1）正确。

因为单人博弈只有一个博弈方,因此不可能存在博弈方之间行为和利益的交互作用和制约．因此实际上就是个人最优化决策，与存在博弈方之间行为和利益交互作用和制约的典型博弈问题有本质的区别。

（2)前半句错误，后半句正确.博弈方的策略空间不一定是数量空间，因为博弈方的策略除了可以是数量水平（如产量、价格等）以外，也可以是各种定性的行为取舍和方向选择，甚至也可能是各种函数或者其他更复杂的内容。

但一个博弈的结果必须是数量或者可以数量化，因为博弈分析只能以数量关系的比较为基础.(3）错误.结论恰恰相反，也就是囚徒的困境博弈中两囚徒之所以处于困境,根源正是因为两囚徒很在乎坐牢的绝对时间长短.此外，我们一开始就假设两囚徒都是理性经济人，而理性经济人都是以自身的（绝对）利益，而不是相对利益为决策目标的。

（4）错误。

虽然零和博弈中博弈方的利益确实是对立的．但非合作博弈的含义并不是博弈力之间的关系是竞争性的、对立的,而是指博弈方是以个体理性、个体利益最大化为行为的逻辑和依据,是指博弈中不能包含有约束力的协议。

(5）错误。

其实并不是所有选择、行为有先后次序的博弈问题都是动态博弈.例如两个厂商先后确定自己的产量，但只要后确定产量的厂商在定产之前不知道另一厂商定的产量是多少，就是静态博弈问题而非动态博弈问题.（6)错误。

博弈论最全完整-讲解

问题是，大家都这么做。这样一来，所有人的成绩都不比大家遵守协议来得高。而且，大家还付出了更多的功夫。
正因为这样的博弈对所有参与者存在着或大或小的潜在成本，如何达成和维护互利的合作就成为一个值得探究的重要问题。
存在双赢的博弈吗？实用文档
6
例2：焦点博弈 “We Can’t Take the Exam,
获奖理由：在非合作博弈的均衡分析理论方面做出了开创性的贡献，对博弈论和经济学产生了重大影响。
实用文档
17
约翰·纳什 1928年生于美国
莱因哈德·泽尔腾， 1930 年生于德国
实用文档
约翰· 海萨尼 1920年生于美国
18
1996年诺贝尔经济学奖获得者
英国人詹姆斯·莫里斯 (James A. Mirrlees)和美国人威廉-维克瑞 (William Vickrey)
获奖理由：前者在信息经济学理论领域做出了重大贡献，尤其是不对称信息条件下的经济激励理论的论述；后者在信息经济学、激励理论、博弈论等方面都做出了重大贡献。
实用文档
19
威廉·维克瑞， 1914-1996，生于美国
詹姆斯·莫里斯 1936年生于英国
实用文档
20
2001年诺贝尔经济学奖获得者
实用文档
35
第一章完全信息静态博弈
博弈论的基本概念及战略式表述纳什均衡
纳什均衡应用举例混合战略纳什均衡纳什均衡的存在性与多重性
实用文档
36
第一节博弈论的基本概念
与战略式表述
Байду номын сангаас
实用文档
37
博弈论的基本概念与战略式表述
博弈论（game theory）是研究决策主体的行为发生直接相互作用时候的决策以及这种决策的均衡问题。

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

博弈论导论 2

合集下载

博弈论PPT课件

lecture_2（博弈论讲义GameTheory（MIT））

《经济博弈论》期末考试复习

(完整word)耶鲁大学博弈论_精简版

博弈论最全完整ppt-讲解

1.2 导论：博弈论与经济学

博弈论导论-中国经济学堂

博弈论最全完整-讲解

博弈论判断题

博弈论最全完整-讲解

文档推荐

最新文档