Game Theory(2)

格式：ppt
大小：1.12 MB
文档页数：25

下载文档原格式

40_game_theory_ch_2

Chapter2Nim and combinatorial games2.1Aims of the chapterThis chapter•introduces the basics of combinatorial games,and explains the central role of the game nim.A detailed summary of the chapter is given in section2.5.Furthermore,this chapter•demonstrates the use of abstract mathematics in game theory.This chapter is written more formally than the other chapters,in parts in thetraditional mathematical style of deﬁnitions,theorems and proofs.One reason fordoing this,and why we start with combinatorial games,is that this topic and styleserves as a warning shot to those who think that game theory,and this unit inparticular,is‘easy’.If we started with the well-known‘prisoner’s dilemma’(whichmakes its due appearance in Chapter3),the less formally inclined student might belulled into a false sense of familiarity and‘understanding’.We therefore startdeliberately with an unfamiliar topic.This is a mathematics unit,with great emphasis on rigour and clarity,and on usingmathematical notions precisely.As mathematical prerequisites,game theory requiresonly the very basics of linear algebra,calculus and probability theory.However,gametheory provides its own conceptual tools that are used to model and analyseinteractive situations.This unit emphasises the mathematical structure of theseconcepts,which belong to‘discrete mathematics’.Learning a number of newmathematical concepts is exempliﬁed by combinatorial game theory,and it willcontinue in the study of classical game theory in the later chapters.2.2Learning objectivesAfter studying this chapter,you should be able to:•play nim optimally;•explain the concepts of game-sums,equivalent games,nim values and the mex rule;•apply these concepts to play other impartial games like those described in the exercises.40Game theory2.3Essential readingThis chapter of the guide.2.4Further readingVery few textbooks on game theory deal with combinatorial games.An exception ischapter1of the following book:•Mendelson,Elliot Introducing Game Theory and Its Applications.(Chapman& Hall/CRC,2004)[ISBN1584883006].The winning strategy for the game nim based on the binary system wasﬁrst describedin the following article,which is available electronically from the JSTOR archive:•Bouton,Charles‘Nim,a game with a complete mathematical theory.’The Annals of Mathematics,2nd Ser.,Vol.3,No.1/4(1902),pp.35–39.The deﬁnitive text on combinatorial game theory is the set of volumes‘WinningWays’by Berlekamp,Conway and Guy.The material of this chapter appears in theﬁrst volume:•Berlekamp,Elwyn R.,John H.Conway and Richard K.Guy Winning Ways for Your Mathematical Plays,Volume1,second edition.(A.K.Peters,2001)[ISBN1568811306].Some small pieces of that text have been copied here nearly verbatim,for example inSections2.7,2.9,and2.12below.The four volumes of‘Winning Ways’are beautiful books.However,they are notsuitable reading for a beginner,because the mathematics is hard,and the reader isconfronted with a wealth of material.The introduction to combinatorial game theorygiven here represents a very small fraction of that body of work,but may invite youto study it further.A very informative and entertaining mathematical tour of parlour games is•Bewersdorﬀ,J¨o rg Logic,Luck and White Lies.(A.K.Peters,2005)[ISBN 1568812108].Combinatorial games are treated in part II of that book.2.5What is combinatorial game theory?This chapter is on the topic of combinatorial games.These are games with twoplayers,perfect information,and no chance moves,speciﬁed by certain rules.Familiargames of this sort are chess,go,checkers,tic-tac-toe,dots-and-boxes,and nim.Suchgames can be played perfectly in the sense that either one player can force a win orboth can force a draw.In reality,games like chess and go are too complex toﬁnd anoptimal strategy,and they derive their attraction from the fact that(so far)it is notknown how to play them perfectly.We will,however,learn how to play nim perfectly.There is a‘classical’game theory with applications in economics which is verydiﬀerent from combinatorial game theory.The games in classical game theory aretypically formal models of conﬂict and co-operation which cannot only be lost orWhat is combinatorial game theory? won,and in which there is often no perfect information about past and future moves.To the economist,combinatorial games are not very interesting.Chapters3–6of theunit are concerned with classical game theory.Why,then,study combinatorial games at all in a unit that is mostly about classicalgame theory,and which aims to provide an insight into the theory of games as usedin economics?The reason is that combinatorial games have a rich and interesting mathematical theory.We will explain the basics of that theory,in particular thecentral role of the game nim for impartial games.It is non-trivial mathematics,it isfun,and you,the student,will have learned something that you would most likelynot have learned otherwise.Theﬁrst‘trick’from combinatorial game theory is how to win in the game nim,using the binary system.Historically,that winning strategy was discoveredﬁrst(published by Charles Bouton in1902).Only later did the central importance of nim,in what is known as the Sprague–Grundy theory of impartial games,becomeapparent.It also revealed why the binary system is important(and not,say,theternary system,where numbers are written in base three),and learning that is more satisfying than just learning how to use it.In this chapter,weﬁrst deﬁne the game nim and more general classes of games withperfect information.These are games where every player knows exactly the state ofthe game.We then deﬁne and study the concepts listed in the learning outcomesabove,which are the concepts of game-sums,equivalent games,nim values and themex rule.It is best to learn these concepts by following the chapter in detail.Wegive a brief summary here,which will make more sense,and should be re-consulted,after aﬁrst study of the chapter(so do not despair if you do not understand this summary).Mathematically,any game is deﬁned by other‘games’that a player can reach in hisﬁrst move.These games are called the options of the game.This seemingly circulardeﬁnition of a‘game’is sound because the options are simpler games,which needfewer moves in total until they end.The deﬁnition is therefore not circular,butrecursive,and the mathematical tool to argue about such games is that ofmathematical induction,which will be used extensively(it will also recur inchapter3as‘backward induction’for game trees).Here,it is very helpful to befamiliar with mathematical induction for proving statements about natural numbers.We focus here on impartial games,where the available moves are the same nomatter whether player I or player II is the player to make a move.Games are‘combined’by the simple rule that a player can make a move in exactly one of thegames,which deﬁnes a sum of these games.In a‘losing game’,theﬁrst player tomove loses(assuming,as always,that both players play as well as they can).Animpartial game added to itself is always losing,because any move can be copied inthe other game,so that the second player always has a move left.This is known asthe‘copycat’principle(lemma2.6).An important observation is that a losing gamecan be‘added’(via the game-sum operation)to any game without changing thewinning or losing properties of the original game.In section2.11,the central theorem2.10explains the winning strategy in nim.The importance of nim for impartial games is then developed in section2.12via thebeautiful mex rule.After the comparatively hard work of the earlier sections,wealmost instantly obtain that any impartial game is equivalent to a nim heap(corollary2.13).At the end of the chapter,the sizes of these equivalent nim heaps(called nim values)are computed for some examples of impartial games.Many other examples arestudied in the exercises.40Game theoryOur exposition is distinct from the classic text‘Winning Ways’in the followingrespects:First,we only consider impartial games,even though many aspects carryover to more general combinatorial games.Secondly,we use a precise deﬁnition ofequivalent games(see section2.10),because a game where you are bound to loseagainst a smart opponent is not the same as a game where you have already lost.Two such games are merely equivalent,and the notion of equivalent games is helpfulin understanding the theory.So this text is much more restricted,but to some extentmore precise than‘Winning Ways’,which should help make this topic accessible andenjoyable.2.6Nim–rulesThe game nim is played with heaps(or piles)of chips(or counters,beans,pebbles,matches).Players alternate in making a move,by removing some chips from one ofthe heaps(at least one chip,possibly the entire heap).Theﬁrst player who cannotmove any more loses the game.The players will be called,rather unimaginatively,player I and player II,with player Ito start the game.For example,consider three heaps of size1,1,2.What is a good move?Removingone of the chips from the heap with two chips will create the position1,1,1,thenplayer II must move to1,1,then player I to1,and then player II takes the last chipand wins.So this is not a good opening move.The winning move is to remove allchips from the heap of size2,to reach position1,1,and then player I will win.Hence we call1,1,2a winning position,and1,1a losing position.When moving in a winning position,the player to move can win by playing well,bymoving to a losing position of the other player.In a losing position,the player tomove will lose no matter what move she chooses,if her opponent plays well.Thismeans that all moves from a losing position lead to a winning position of theopponent.In contrast,one needs only one good move from a winning position thatgoes to a losing position of the next player.Another winning position consists of three nim heaps of sizes1,1,1.Here all movesresult in the same position and player I always wins.In general,a player in a winningposition must play well by picking the right move.We assume that players play well,forcing a win if they can.Suppose nim is played with only two heaps.If the two heaps have equal size,forexample in position4,4,then theﬁrst player to move loses(so this is a losingposition),because player II can always copy player I’s move by equalising the twoheaps.If the two heaps have diﬀerent sizes,then player I can equalise them byremoving an appropriate number of chips from the larger heap,putting player II in alosing position.The rule for2-heap nim is therefore:Lemma2.1The nim position m,n is winning if and only if m=n,otherwise losing,for all m,n≥0.This lemma applies also when m=0or n=0,and thus includes the cases that oneor both heap sizes are zero(meaning only one heap or no heap at all).With three or more heaps,nim becomes more diﬃcult.For example,it is notimmediately clear if,say,positions1,4,5or2,3,6are winning or losing positions.⇒At this point,you should try exercise2.1(a)on page28.Combinatorial games,in particular impartial games 2.7Combinatorial games,in particular impartial gamesThe games we study in this chapter have,like nim,the following properties:1.There are just two players.2.There are several,usuallyﬁnitely many,positions,and sometimes a particularstarting position.3.There are clearly deﬁned rules that specify the moves that either player canmake from a given position to the possible new positions,which are called theoptions of that position.4.The two players move alternately,in the game as a whole.5.In the normal play convention a player unable to move loses.6.The rules are such that play will always come to an end because some player willbe unable to move.This is called the ending condition.So there can be nogames which are drawn by repetition of moves.7.Both players know what is going on,so there is perfect information.8.There are no chance moves such as rolling dice or shuﬄing cards.9.The game is impartial,that is,the possible moves of a player only depend onthe position but not on the player.As a negation of condition5,there is also the mis`e re play convention where a playerunable to move wins.In the surrealist(and unsettling)movie‘Last year atMarienbad’by Alain Resnais from1962,mis`e re nim is played,several times,withrows of matches of sizes1,3,5,7.If you have a chance,try to watch that movie andspot when the other player(not the guy who brought the matches)makes a mistake!Note that this is mis`e re nim,not nim,but you will be able toﬁnd out how to play itonce you know how to play nim.(For games other than nim,normal play and mis`e reversions are typically not so similar.)In contrast to condition9,games where the available moves depend on the player(as in chess where one player can only move white pieces and the other only blackpieces)are called partisan games.Much of combinatorial game theory is aboutpartisan games,which we do not consider to keep matters simple.Chess,and the somewhat simpler tic-tac-toe,also fail condition6because they mayend in a tie or draw.The card game poker does not have perfect information(asrequired in7)and would lose all its interest if it had.The analysis of poker,althoughit is also a win-or-lose game,leads to the‘classical’theory of zero-sum games(withimperfect information)that we will consider later.The board game backgammon is agame with perfect information but with chance moves(violating condition8)because dice are rolled.We will be relatively informal in style,but our notions are precise.In condition3above,for example,the term option refers to a position that is reachable in onemove from the current position;do not use‘option’when you mean‘move’.Similarly,we will later use the term strategy to deﬁne a plan of moves,one for everyposition that can occur in the game.Do not use‘strategy’when you mean‘move’.However,we will take some liberty in identifying a game with its starting positionwhen the rules of the game are clear.40Game theory⇒Try now exercises2.2and2.3starting on page28.2.8Simpler games and notation for nim heapsA game,like nim,is deﬁned by its rules,and a particular starting position.Let G besuch a particular instance of nim,say with the starting position1,1,2.Knowing therules,we can identify G with its starting position.Then the options of G are1,2,and1,1,1,and1,1.Here,position1,2is obtained by removing either theﬁrst or thesecond heap with one chip only,which gives the same result.Positions1,1,1and1,1are obtained by making a move in the heap of size two.It is useful to list the optionssystematically,considering one heap to move in at a time,so as not to overlook anyoption.Each of the options of G is the starting position of another instance of nim,deﬁningone of the new games H,J,K,say.We can also say that G is deﬁned by the movesto these games H,J,K,and we call these games also the options of G(byidentifying them with their starting positions;recall that the term‘option’has beendeﬁned in point3of section2.7).That is,we can deﬁne a game as follows:Either the game has no move,and theplayer to move loses,or a game is given by one or several possible moves to newgames,in which the other player makes the initial move.In our example,G isdeﬁned by the possible moves to H,J,or K.With this deﬁnition,the entire game iscompletely speciﬁed by listing the initial moves and what games they lead to,because all subsequent use of the rules is encoded in those games.This is a recursive deﬁnition because a‘game’is deﬁned in terms of‘game’itself.We have to add the ending condition that states that every sequence of moves in agame must eventually end,to make sure that a game cannot go on indeﬁnitely.This recursive condition is similar to deﬁning the set of natural numbers as follows:(a)0is a natural number;(b)if n is a natural number,then so is n+1;and(c)allnatural numbers are obtained in this way,starting from0.Condition(c)can beformalised by the principle of induction that says:if a property P(n)is true for n=0,and if the property P(n)implies P(n+1),then it is true for all natural numbers.We use the following notation for nim heaps.If G is a single nim heap with nchips,n≥0,then we denote this game by∗n.This game is completely speciﬁed byits options,and they are:options of∗n:∗0,∗1,∗2,...,∗(n−1).(2.1) Note that∗0is the empty heap with no chips,which allows no moves.It is invisiblewhen playing nim,but it is useful to have a notation for it because it deﬁnes themost basic losing position.(In combinatorial game theory,the game with no moves,which is the empty nim heap∗0,is often simply denoted as0.)We could use(2.1)as the deﬁnition of∗n;for example,the game∗4is deﬁned by itsoptions∗0,∗1,∗2,∗3.It is very important to include∗0in that list of options,because it means that∗4has a winning move.Condition(2.1)is a recursivedeﬁnition of the game∗n,because its options are also deﬁned by reference to suchgames∗k,for numbers k smaller than n.This game fulﬁls the ending conditionbecause the heap gets successively smaller in any sequence of moves.If G is a game and H is a game reachable by one or more successive moves from thestarting position of G,then the game H is called simpler than G.We will oftenprove a property of games inductively,using the assumption that the property appliesto all simpler games.An example is the–already stated and rather obvious–Sums of games property that one of the two players can force a win.(Note that this applies togames where winning or losing are the only two outcomes for a player,as implied bythe‘normal play’convention in5above.)Lemma2.2In any game G,either the starting player I can force a win,or player IIcan force a win.Proof.When the game has no moves,player I loses and player II wins.Now assumethat G does have options,which are simpler games.By inductive assumption,ineach of these games one of the two players can force a win.If,in all of them,thestarting player(which is player II in G)can force a win,then she will win in G byplaying accordingly.Otherwise,at least one of the starting moves in G leads to agame G where the second-moving player in G (which is player I in G)can force awin,and by making that move,player I will force a win in G.If in G,player I can force a win,its starting position is a winning position,and wecall G a winning game.If player II can force a win,G starts with a losing position,and we call G a losing game.2.9Sums of gamesWe continue our discussion of nim.Suppose the starting position has heap sizes1,5,5.Then the obvious good move is to option5,5,which is losing.What about nim with four heaps of sizes2,2,6,6?This is losing,because2,2and6,6independently are losing positions,and any move in a heap of size2can becopied in the other heap of size2,and similarly for the heaps of size6.There is asecond way of looking at this example,where it is not just two losing games puttogether:consider the game with heap sizes2,6.This is a winning game.However,two such winning games,put together to give the game2,6,2,6,result in a losinggame,because any move in one of the games2,6,for example to2,4,can be copiedin the other game,also to2,4,giving the new position2,4,2,4.So the secondplayer,who plays‘copycat’,always has a move left(the copying move)and hencecannot lose.Deﬁnition2.3The sum of two games G and H,written G+H,is deﬁned asfollows:The player may move in either G or H as allowed in that game,leaving theposition in the other game unchanged.Note that G+H is a notation that applies here to games and not to numbers,evenif the games are in some way deﬁned using numbers(for example as nim heaps).The result is a new game.More formally,assume that G and H are deﬁned in terms of their options(via movesfrom the starting position)G1,G2,...,G k and H1,H2,...,H m,respectively.Then theoptions of G+H are given asoptions of G+H:G1+H,...,G k+H,G+H1,...,G+H m.(2.2) Theﬁrst list of options G1+H,G2+H,...,G k+H in(2.2)simply means that theplayer makes his move in G,the second list G+H1,G+H2,...,G+H m that hemakes his move in H.We can deﬁne the game nim as a sum of nim heaps,where any single nim heap isrecursively deﬁned in terms of its options by(2.1).So the game nim with heaps ofsize1,4,6is written as∗1+∗4+∗6.40Game theoryThe‘addition’of games with the abstract+operation leads to an interestingconnection of combinatorial games with abstract algebra.If you are somewhatfamiliar with the concept of an abstract group,you will enjoy this connection;if not,you do not need to worry,because this connection it is not essential for ourdevelopment of the theory.A group is a set with a binary operation+that fulﬁls three properties:1.The operation+is associative,that is,G+(J+K)=(G+J)+K holds for allG,J,K.2.The operation+has a neutral element0,so that G+0=G and0+G=G forall G.3.Every element G has an inverse−G so that G+(−G)=0.Furthermore,4.The group is called commutative(or‘abelian’)if G+H=H+G holds for allG,H.Familiar groups in mathematics are,for example,the set of integers with addition,orthe set of positive real numbers with multiplication(where the multiplicationoperation is written as·,the neutral element is1,and the inverse of G is written asG−1).The games that we consider form a group as well.In the way the sum of two gamesG and H is deﬁned,G+H and H+G deﬁne the same game,so+is commutative.Moreover,when one of these games is itself a sum of games,for example H=J+K,then G+H is G+(J+K)which means the player can make a move in exactly one ofthe games G,J,or K.This means obviously the same as the sum of games(G+J)+K,that is,+is associative.The sum G+(J+K),which is the same as(G+J)+K,can therefore be written unambiguously as G+J+K.An obvious neutral element is the empty nim heap∗0,because it is‘invisible’(itallows no moves),and adding it to any game G does not change the game.However,there is no direct way to get an inverse operation because for any game Gwhich has some options,if one adds any other game H to it(the intention beingthat H is the inverse−G),then G+H will have some options(namely at least theoptions of moving in G and leaving H unchanged),so that G+H is not equal to theempty nim heap.The way out of this is to identify games that are‘equivalent’in a certain sense.Wewill see shortly that if G+H is a losing game(where theﬁrst player to move cannotforce a win),then that losing game is‘equivalent’to∗0,so that H fulﬁls the role ofan inverse of G.2.10Equivalent gamesThere is a neutral element that can be added to any game G without changing it.By deﬁnition,because it allows no moves,it is the empty nim heap∗0:G+∗0=G.(2.3)However,other games can also serve as neutral elements for the addition of games.We will see that any losing game can serve that purpose,provided we considercertain games as equivalent according to the following deﬁnition.Equivalent games Deﬁnition2.4Two games G,H are called equivalent,written G≡H,if and only iffor any other game J,the sum G+J is losing if and only if H+J is losing.In deﬁnition2.4,we can also say that G≡H if for any other game J,the sum G+Jis winning if and only if H+J is winning.In other words,G is equivalent to H if,whenever G appears in a sum G+J of games,then G can be replaced by H without changing whether G+J is winning or losing.One can verify easily that≡is indeed an equivalence relation,meaning it is reﬂexive(G≡G),symmetric(G≡H implies H≡G),and transitive(G≡H and H≡K implyG≡K;all these conditions hold for all games G,H,K).Using J=∗0in deﬁnition2.4and(2.3),G≡H implies that G is losing if and only ifH is losing.The converse is not quite true:just because two games are winning doesnot mean they are equivalent,as we will see shortly.However,any two losing gamesare equivalent,because they are all equivalent to∗0:Lemma2.5If G is a losing game(the second player to move can force a win),thenG≡∗0.Proof.Let G be a losing game.We want to show G≡∗0By deﬁnition2.4,this istrue if and only if for any other game J,the game G+J is losing if and only if∗0+Jis losing.According to(2.3),this holds if and only if J is losing.So let J be any other game;we want to show that G+J is losing if and only if J islosing.Intuitively,adding the losing game G to J does not change which player in Jcan force a win,because any intermediate move in G by his opponent is simplycountered by the winning player,until the moves in G are exhausted.Formally,weﬁrst prove by induction the simpler claim that for all games J,if J islosing,then G+J is losing.(So weﬁrst ignore the‘only if’part.)Our inductive assumptions for this simpler claim are:for all losing games G that are simplerthan G,if J is losing,then G +J is losing;and for all games J that are simplerthan J,if J is losing,then G+J is losing.So suppose that J is losing.We want to show that G+J is losing.Any initial movein J leads to an option J which is winning,which means that there is acorresponding option J of J (by player II’s reply)where J is losing.Hence,whenplayer I makes the corresponding initial move from G+J to G+J ,player II cancounter by moving to G+J .By inductive assumption,this is losing because J islosing.Alternatively,player I may move from G+J to G +J.Because G is a losinggame,there is a move by player II from G to G where G is again a losing game,and hence G +J is also losing,by inductive assumption,because J is losing.Thiscompletes the induction and proves the claim.What is missing is to show that if G+J is losing,so is J.If J was winning,then therewould be a winning move to some option J of J where J is losing,but then,by ourclaim(the‘if’part that we just proved),G+J is losing,which would be a winningoption in G+J for player I.But this is a contradiction.This completes the proof.The preceding lemma says that any losing game Z,say,can be added to a game Gwithout changing whether G is winning or losing(in lemma2.5,Z is called G).Thatis,extending(2.3),Z losing=⇒G+Z≡G.(2.4)As an example,consider Z=∗1+∗2+∗3,which is nim with three heaps of sizes1,2,3.To see that Z is losing,we examine the options of Z and show that all ofthem are winning games.Removing an entire heap leaves two unequal heaps,whichis a winning position by lemma2.1.Any other move produces three heaps,two of40Game theorywhich have equal size.Because two equal heaps deﬁne a losing nim game Z,they can be ignored by(2.4),meaning that all these options are like single nim heaps and therefore winning positions,too.So Z=∗1+∗2+∗3is losing.The game G=∗4+∗5is clearly winning.By(2.4), the game G+Z is equivalent to G and is also winning.However,verifying directly that∗1+∗2+∗3+∗4+∗5is winning would not be easy to see without using(2.4). It is an easy exercise to show that in sums of games,games can be replaced by equivalent games,resulting in an equivalent sum.That is,for all games G,H,J,G≡H=⇒G+J≡H+J.(2.5)Note that(2.5)is not merely a re-statement of deﬁnition2.4,because equivalence of the games G+J and H+J means more than just that the games are either both winning or both losing(see the comments before lemma2.9below).Lemma2.6(The copycat principle)G+G≡∗0for any impartial game G. Proof.Given G,we assume by induction that the claim holds for all simpler games G .Any option of G+G is of the form G +G for an option G of G.This is winning by moving to the game G +G which is losing,by inductive assumption.So G+G is indeed a losing game,and therefore equivalent to∗0by lemma2.5.We now come back to the issue of inverse elements in abstract groups,mentioned at the end of section2.9.If we identify equivalent games,then the addition+of games deﬁnes indeed a group operation.The neutral element is∗0,or any equivalent game (that is,a losing game).The inverse of a game G,written as the negative−G,fulﬁlsG+(−G)≡∗0.(2.6) Lemma2.6shows that for an impartial game,−G is simply G itself.Side remark:For games that are not impartial,that is,partisan games,−G exists also.It is G but with the roles of the two players exchanged,so that whatever move was available to player I is now available to player II and vice versa.As an example, consider the game checkers(with the rule that whoever can no longer make a move loses),and let G be a certain conﬁguration of pieces on the checkerboard.Then−G is the same conﬁguration with the white and black pieces interchanged.Then in the game G+(−G),player II(who can move the black pieces,say),can also play‘copycat’.Namely,if player I makes a move in either G or−G with a white piece, then player II copies that move with a black piece on the other board(−G or G, respectively).Consequently,player II always has a move available and will win the game,so that G+(−G)is indeed a losing game for the starting player I,that is,G+(−G)≡∗0.However,we only consider impartial games,where−G=G.The following condition is very useful to prove that two games are equivalent. Lemma2.7Two impartial games G,H are equivalent if and only if G+H≡∗0.Proof.If G≡H,then by(2.5)and lemma2.6,G+H≡H+H≡∗0.Conversely,G+H≡∗0implies G≡G+H+H≡∗0+H≡H.Sometimes,we want to prove equivalence inductively,where the following observation is useful.Lemma2.8Two games G and H are equivalent if all their options are equivalent, that is,for every option of G there is an equivalent option of H and vice versa.。

博弈论(第一、二章)

游戏2：摘柿子
甲跑
摇跑
乙
摇跑
甲
摇
乙跑
摇跑
甲
不跑（2,2）
（0,0）
（0,1）（2,0）
（0,3）（4,0）
游戏3：免费彩票博弈
每个人可以免费购买任意数量彩票，随机抽取1张彩票中奖，奖金总额为1000万元/n，n 为彩票数量。
博弈论：研究理性人行为选择的理论
博弈论作用：帮助个人、组织等决策主体深刻理解策略并明智的选择行动。
第二章完全信息静态博弈
� 基本分析思路和方法 � 纳什均衡 � 混合策略 � 纳什均衡的选择
第一节基本分析思路和方法
行动或策略（acቤተ መጻሕፍቲ ባይዱion or strategy）
si：局中人i的一个特定策略 Si：局中人i的策略集（strategy set）或策略空间（strategy space），可以是离散的或连续的。
纳什的基本贡献是证明了非合作博弈均衡解及其存在性，建立了作为博弈论基础的“纳什均衡”概念；海萨尼则把不完全信息纳入到博弈论方法体系中；泽尔腾的贡献在于将博弈论由静态向动态的扩展，建立了“子博弈精练纳什均衡”的概念。
1996莫里斯（James A.Mirrlees）和维克瑞（William Vickrey）
游戏1：军事游戏-进攻和防守
博弈结果表
守方
B 攻方 a -1 b -1 c +1 +1 +1 -1 +1 -1 -1
C -1 +1 +1
游戏1：军事游戏-进攻和防守
博弈结果表
守方
B 攻方 a -1 b -1 c +1 +1 +1 -1 +1 -1 -1

波恩大学博弈论讲义 GameT-2

Game Theory Lecture 2A reminderLet G be a finite, two player game of perfect information without chance moves. Theorem (Zermelo, 1913):Either player 1 can force an outcome in T or player 2 can force an outcome in T’A reminder Zermelo’s proof uses Backwards InductionA reminderA game G is strictly Competitive if for any twoterminal nodes a,bab b 2a1An application of Zermelo’s theorem toStrictly Competitive GamesLet a 1,a 2,….a n be the terminal nodes of a strictly competitive game (with no chance moves and with perfectinformation) and let:a n 1a n-1 1 …. 1a 2 1a 1(i.e. a n 2a n-1 2 …. 2a 2 2a 1).?Then there exists k ,n k 1 s.t. player 1 can forcean outcome in a n , a n-1……a kAnd player 2can force an outcome in a k , a k-1……a 1?a n 1a n-1 1.. 1 a k 1 .. 1a 2 1a 1G(s,t)=a kPlayer 1has a strategy swhich forces an outcomebetter or equal to a k ( 1)Player 2has a strategy twhich forces an outcomebetter or equal to a k ( 2)Let w j= a n, a n-1……,a j wn+1=an , an-1…aj…, a2, a1w1w2w jwn+1wnPlayer 1can force an outcome in W 1 = a n , a n-1…,a 1 ,and cannot force an outcome in w n+1= .Let w j = a n , a n-1……,a j w n+1= w 1, w 2, ….w n ,w n+1can force cannot forcecan force ??Let k be the maximal integer s.t. player 1can force an outcome in W kProof :w 1, w 2, … wk , w k+1...,w n+1Player 1 can force Player 1 cannot forceLet k be the maximal integer s.t. player 1can force an outcome in W ka n , a n-1…a k+1 ,a k …, a 2, a 1 w 1w k+1w k Player 2can force an outcome in T -w k+1by Zermelo’s theorem!!!!!a n 1a n-1 1.. 1 a k 1 .. 1a 2 1a 1G(s,t)=a k Player 1has a strategy s which forces an outcome better or equal to a k ( 1)Player 2has a strategy t which forces an outcome better or equal to a k ( 2)Now consider the implications of this result for thestrategic form game s ta kplayer 1’s strategy s guarantees at least a kplayer 2’s strategy t guarantees him at least a k------+++++i.e.at most a k for player 1stak------+++++The point (s,t)is a Saddle pointstak------+++++Given that player 2plays t,Player 1hasno better strategythan sstrategy s is player 1’sbest responseto player 2’s strategy tSimilarly, strategy t is player 2’sbest responseA pair of strategies (s,t)such that eachis a best response to the other isa Nash EquilibriumJohn F. Nash Jr.This definition holds for any game,not only for strict competitive ones12 221WW LWWrlRML121 2Example3R LRrL lbackwards Induction(Zermelo)r( l , r ) ( R, , )2 221WW LWWrlRML1223RLRrLl rAll thosestrategy pairs areNash equilibriaBut there are otherNash equilibria …….( l , r ) ( L,( l , r ) ( L, , )2 221WW LWWrlRML1223RLRrLl rThe strategies obtained bybackwards inductionAre Sub-Game Perfect equilibriain each sub-game they prescribe2 221WW LWWrlRML1223RLRrLl rWhereas, thenon Sub-Game PerfectNash equilibriumprescribes a non equilibrium( l , r ) ( L, , )A Sub-Game Perfect equilibriaprescribes a Nash equilibriumin each sub-gameR. SeltenAwarding the Nobel Prize in Economics -1994Chance MovesNature (player 0),chooses randomly, with known probabilities, among some actions.+ + + = 11/61111111/6123456information setN.S .S .N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .Payoffs:W (when the other dies, or when the other chosenot shoot in his turn)D (when not shooting)L (when dead)1/61111111/6123456N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .Payoffs:W (when the other dies, or when the other didnot shoot in his turn)D (when not shooting)L (when dead)W D L1/61111111/6123456N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .DDDDDDL222221/61111111/6123456N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .N.S .S .DDDDDDL22222N.S .S .DLN.S .S .DN.S .S .DN.S .S .DN.S .S .D。

博弈论 Game theory (全)

博弈论 Game Theory博弈论亦名“对策论”、“赛局理论”，属应用数学的一个分支, 目前在生物学，经济学，国际关系，计算机科学, 政治学，军事战略和其他很多学科都有广泛的应用。

在《博弈圣经》中写到：博弈论是二人在平等的对局中各自利用对方的策略变换自己的对抗策略，达到取胜的意义。

主要研究公式化了的激励结构间的相互作用。

是研究具有斗争或竞争性质现象的数学理论和方法。

也是运筹学的一个重要学科。

博弈论考虑游戏中的个体的预测行为和实际行为，并研究它们的优化策略。

表面上不同的相互作用可能表现出相似的激励结构（incentive structure），所以他们是同一个游戏的特例。

其中一个有名有趣的应用例子是囚徒困境（Prisoner's dilemma）。

具有竞争或对抗性质的行为称为博弈行为。

在这类行为中，参加斗争或竞争的各方各自具有不同的目标或利益。

为了达到各自的目标和利益，各方必须考虑对手的各种可能的行动方案，并力图选取对自己最为有利或最为合理的方案。

比如日常生活中的下棋，打牌等。

博弈论就是研究博弈行为中斗争各方是否存在着最合理的行为方案，以及如何找到这个合理的行为方案的数学理论和方法。

生物学家使用博弈理论来理解和预测演化（论）的某些结果。

例如，约翰·史密斯（John Maynard Smith）和乔治·普莱斯（George R. Price）在1973年发表于《自然》杂志上的论文中提出的“evolutionarily stable strategy”的这个概念就是使用了博弈理论。

其余可参见演化博弈理论（evolutionary game theory）和行为生态学（behavioral ecology）。

博弈论也应用于数学的其他分支，如概率，统计和线性规划等。

历史博弈论思想古已有之，我国古代的《孙子兵法》就不仅是一部军事著作，而且算是最早的一部博弈论专著。

博弈论最初主要研究象棋、桥牌、赌博中的胜负问题，人们对博弈局势的把握只停留在经验上，没有向理论化发展。

英语第一章阅读 game theory 原文及翻译

都具有相互依赖的共同特征。也就是说，每个参与者的结果取决于所有人的选择（策略）。在所谓的零和游戏中，玩家的利益是完全冲突的博弈论是战略的科学。它试图从数学和逻辑上确定 “玩家”应采取的行动，以确保他们在各种“游戏”中获得最佳成果。所研究的游戏包括从国际象棋到儿童饲养，从网球到收购。但是这些游戏，所以一个人的收益总是另一个人的损失。更典型的是有相互收益（正数）或相互伤害（负数）的博弈，以及一些冲突。
The essence of a game is the interdependence of player strategies. There are two distinct types of strategic interdependence: sequential and simultaneous. In the former the players move in sequence, each aware of the others’ previous actions. In the latter the players act at the same time, each ignorant of the others’ actions.
Game theory was pioneered by Princeton mathematician john von Neumann. In the early years the emphasis was on games of pure conflict (zero-sum games). Other games were considered in a cooperative form. That is, the participants were supposed to choose and implement their actions jointly. Recent research has focused on games that are neither zero sum nor purely cooperative. In these games the players choose their actions separately, but their links to others involve elements of both competition and cooperation.

game theory lecture2

• we cab use the common knowledge of rationality to • build an iterative process that takes this reasoning to the limit. • After employing this reasoning one time, we can eliminate all the strategies that are never a best response, resulting in a possibly “smaller” reduced game that includes only strategies that can be a best response in the original game. • Then we can employ this reasoning again and again, in a similar way that we did for IESDS, in order to eliminate outcomes that should not be played by players who share a common knowledge of rationality
A cournot Duopoly Game
• Two identical firms, players 1 and 2, produce some good. • Let the demand be given by p(q) = 100 − q, where q = q1+ q2. • Assume that there are no fixed costs of production, the firms have linear cost for producing quantity qi is given by ci(qi) =10qi for i ∈ {1, 2}. • What is the production plans of the firms?

博弈论Game Theory2

划线法

在具有策略和利益相互依存的博弈问题中，各个博弈方的得益既取决于自己选择的策略，还与其他策略方选择的策略有关。因此，博弈方在决策时必须考虑其他博弈方的存在和策略选择。依据这种思想，科学的决策思路应该是：找出自己针对其他博弈方每种策略和策略组合的最佳对策，即自己的可选策略与其他博弈方每种策略配合，给自己带来最大得益的策略，然后通过对其他博弈方策略选择的判断，预测博弈的可能结果和确定自己的最优策略。
举例

古诺的寡头模型设一市场有两家厂商生产同样的产品。如果厂商1 的产量为q1，厂商2的产量为q2，则市场总产量为 Q = q1 + q2 。设市场出清价格P（可以将产品全部卖出去的价格）是市场总产量的函数P = P（Q） = 8 -Q。再设两厂商的生产都无固定成本，且每增加一单位产量的边际成本相等，C1 = C2 = 2，即它们分别生产q1和q2单位产量的总成本分别为2 q1和2 q2 。最后强调两厂商同时决定各自的产量，即他们在决策之前都不知道另一方的产量。
求解纳什均衡

博弈方就是n个农户，他们各自的策略空间就是他们可能选择的羊群数目qi(i=1,2, …,n)，取值范围，当各户羊群数为q1, …qn时,在公共草地上放牧羊群的总数为Q= q1+ q2+…+ qn,,每只羊的产出应是羊群总数Q的函数V=v(Q)=v(q1+ q2+…+ qn).假设每只羊的成本是不变的常数c,则农户i养qi只羊的得益函数为:
u i q i V ( Q ) q i c q iV ( q 1 q 2 q n ) q i c
续
假设 n 3 , 即只有三个农户，每只羊的产出函数为 V 100 Q 100 ( q 1 q 2 q 3 ), 而成本 c 4 .这时，三个农户的得益函数分别为 u 1 q 1 [100 ( q 1 q 2 q 3 )] 4 q 1 u 2 q 2 [100 ( q 1 q 2 q 3 )] 4 q 2 u 3 q 3 [100 ( q 1 q 2 q 3 )] 4 q 3 把上述得益函数看作连续函数。

博弈论-game-theory-两人轮流进行游戏

g(a(k+1))=0 !
当k∞时 x 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 …… g(x) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 …… 这有啥用
游戏的联合
定义：对于n个给定的公平组合博弈G1, G2, …, Gn，定义他们集的合联；合对为于G一=个G1局+G面2+x…i属+G于n.X对i，于设游F戏i(xGi)i表Байду номын сангаас示设xXi的i为后它继的局局面面集合对。于G那的么一G个的局局面面x集=合{x1X,x=2,X…1*,xXn2}*，…它*X的n（后其继中局*为面笛集卡合儿积）；
gn(x1,x2,…,xn) = g(x1)⊕g(x2)⊕…⊕g(xn)
= x1⊕x2⊕…⊕xn
经典Nim游戏
图的游戏
3
0
2 0
1
3 ⊕0 ⊕0=3
0 0
1 0
1
Anti-Nim
有n堆石子，每堆ai个，两个人轮流游戏，每次游戏者取走某一石碓中至少1枚，至多k枚的石子。谁取走最后一颗石子算谁输。
一方算输无论游戏如何进行，总可以在有限步之内结束。(the
Ending Condition)
N局面，P局面
N局面——先手必胜局面
winning for the Next player
P局面——后手必胜局面
winning for the Previous player
定义：
每一个最终局面都是P局面对于一个局面，若至少有一种操作使它变成一个P局面，
还扩展
游戏4：游戏有n堆石子，第i堆有ai枚，两人轮流进行游戏，每次游戏者可以从任意一堆取走任意多枚石子，也可以将任意的一堆石子任意的分成两堆。谁取走最后一颗石子为胜。

博弈论讲义2

三重复剔除的占优均衡
重复剔除严格劣策略：
思路：首先找到某个参与人的劣策略（假定存在），把这个劣策略剔除掉，重新构造一个不包含已剔除策略的新的博弈，然后再剔除这个新的博弈中的某个参与人的劣策略，一直重复这个过程，直到只剩下唯一的策略组合为止。这个唯一剩下的策略组合就是这个博弈的均衡解，称为“重复剔除的占优均衡”。
独木桥
进
A
退
B
进退 -3，-3 2，0
0，2 0，0
纳什均衡：A进，B退；A退，B进
斗鸡博弈
村子里有两户富户，有两种可能：一家修，另一家就不修；一家不修，另一家就得修。
冷战期间美苏抢占地盘：一方抢占一块地盘，另一方就占另一块。
夫妻吵架，一方厉害，另一方就出去躲躲。
注意：在混合策略纳什均衡条件下，也可能两败俱伤。
注意：如果所有人都有（严格）占优策略存在，
那么占优策略均衡就是可以预测的唯一均衡。占优策略只要求每个参与人是理性的，而不要求每个参与人知道其他参与人是理性的（也就是说，不要求理性是共同知识）。为什么？
二占优策略均衡
案例-囚徒困境
囚徒A
囚徒 B
坦白
坦白 -8，-8
抵赖
0，-10 -8大于-10
相安无事；第二天，相安无事……；直到第100天，突然，每个妻子都把丈夫杀了。为什么会这样？
这是一个推理和行动的过程。如果她的丈夫不忠的话，她就杀死他；如果没有证据证明她的丈夫不忠的话，她便相信他，不杀死他。

如果村里只有一个男人是不忠的话，在老太太作了宣布之
后的第一天，这个男人的妻子在老太太宣布之后马上就能知道
两只猪一起去按，然后一起回槽边进食，由于大猪吃得快可吃下8个单位的食物，小猪只能吃到2个单位食物。

game theory博弈论

game theory博弈论
游戏理论，也被称为博弈论，是一种研究人类决策和行为的数学框架。

它旨在理解在人类决策中存在的不确定性和竞争条件下，每个参与者的决策如何影响整个系统的结果。

从二战后的经济学开始，游戏理论已经成为经济学、政治学、心理学、哲学和博弈理论的重要研究领域。

它也成为了解决现实生活中许多社会问题的一种有力工具，例如市场竞争、调解博弈、投票、拍卖、国际贸易等。

游戏理论中的核心概念包括博弈、策略、收益和均衡等。

博弈是指参与者之间的相互作用，策略是指参与者制定的行动计划，收益是指参与者对于结果的评价，均衡是指没有参与者有动机改变他们的策略的状态。

在游戏理论中，有许多不同的博弈模型，例如零和博弈、合作博弈、非合作博弈等。

在每种模型中，参与者的决策和行为都会受到不同的影响和限制。

通过了解游戏理论，我们可以更好地理解许多人类行为的原理和动机，同时也可以更好地理解和预测许多社会问题的发展趋势。

- 1 -。

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

Game Theory(2)

合集下载

40_game_theory_ch_2

博弈论(第一、二章)

波恩大学博弈论讲义 GameT-2

博弈论 Game theory (全)

英语第一章阅读 game theory 原文及翻译

game theory lecture2

博弈论Game Theory2

博弈论-game-theory-两人轮流进行游戏

博弈论讲义2

game theory博弈论

文档推荐

最新文档

Game Theory(2)

合集下载

40_game_theory_ch_2

博弈论(第一、二章)

波恩大学博弈论 讲义 GameT-2

博弈论 Game theory (全)

英语第一章阅读 game theory 原文及翻译

game theory lecture2

博弈论Game Theory2

博弈论-game-theory-两人轮流进行游戏

博弈论讲义2

game theory博弈论

文档推荐

最新文档

波恩大学博弈论讲义 GameT-2