博弈论课文翻译
- 格式:doc
- 大小:33.50 KB
- 文档页数:3
Game Theory: Lecture 1 TranscriptProfessor Ben Polak: So this is Game Theory Economics 159. If you're here for art history, you're either in the wrong room or stay anyway, maybe this is the right room; but this is Game Theory, okay. You should have four handouts; everyone should have four handouts. There is a legal release form--we'll talk about it in a minute--about the videoing. There is a syllabus, which is a preliminary syllabus: it's also online. And there are two games labeled Game 1 and Game 2. Can I get you all to look at Game 1 and start thinking about it. And while you're thinking about it, I am hoping you can multitask a bit. I'll describe a bit about the class and we'll get a bit of admin under our belts. But please try and look at--somebody's not looking at it, because they're using it as a fan here--so look at Game 1 and fill out that form for me, okay?So while you're filling that out, let me tell you a little bit about what we're going to be doing here. So what is Game Theory? Game Theory is a method of studying strategic situations. So what's a strategic situation? Well let's start off with what's not a strategic situation. In your Economics - in your Intro Economics class in 115 or 110, you saw some pretty good examples of situations that were not strategic. You saw firms working in perfect competition. Firms in perfect competition are price takers: they don't particularly have to worry about the actions of their competitors. You also saw firms that were monopolists and monopolists don't have any competitors to worry about, so that's not a particularly strategic situation. They're not price takers but they take the demand curve. Is this looking familiar for some of you who can remember doing 115 last year or maybe two years ago for some of you? Everything in between is strategic. So everything that constitutes imperfect competition is a strategic setting. Think about the motor industry, the motor car industry. Ford has to worry about what GM is doing and what Toyota is doing, and for the moment at least what Chrysler is doing but perhaps not for long. So there's a small number of firms and their actions affect each other.So for a literal definition of what strategic means: it's a setting where the outcomes that affect you depend on actions, not just on your own actions, but on actions of others. All right, that's as much as I'm going to say for preview right now, we're going to come back and see plenty of this over the course of the next semester.So what I want to do is get on to where this applies. It obviously applies in Economics, but it also applies in politics, and in fact, this class will count as a Political Science class if you're a Political Science major. You should go check with the DUS in Political Science. It count - Game Theory is very important in law these days. So for those of you--for the half of you--that aregoing to end up in law school, this is pretty good training. Game Theory is also used in biology and towards the middle of the semester we're actually going to see some examples of Game Theory as applied to evolution. And not surprisingly, Game Theory applies to sport.So let's talk about a bit of admin. How are you doing on filling out those games? Everyone managing to multitask: filling in Game 1? Keep writing. I want to get some admin out of the way and I want to start by getting out of the way what is obviously the elephant in the room. Some of you will have noticed that there's a camera crew here, okay. So as some of you probably know, Yale is undergoing an open education project and they're videoing several classes, and the idea of this, is to make educational materials available beyond the walls of Yale. In fact, on the web, internationally, so people in places, maybe places in the U.S. or places miles away, maybe in Timbuktu or whatever, who find it difficult to get educational materials from the local university or whatever, can watch certain lectures from Yale on the web.Some of you would have been in classes that do that before. What's going to different about this class is that you're going to be participating in it. The way we teach this class is we're going to play games, we're going to have discussions, we're going to talk among the class, and you're going to be learning from each other, and I want you to help people watching at home to be able to learn too. And that means you're going to be on film, at the very least on mike.So how's that going to work? Around the room are three T.A.s holding mikes. Let me show you where they are: one here, one here, and one here. When I ask for classroom discussions, I'm going to have one of the T.A.s go to you with a microphone much like in "Donahue" or something, okay. At certain times, you're going to be seen on film, so the camera is actually going to come around and point in your direction.Now I really want this to happen. I had to argue for this to happen, cause I really feel that this class isn't about me. I'm part of the class obviously, but it's about you teaching each other and participating. But there's a catch, the catch is, that that means you have to sign that legal release form.So you'll see that you have in front of you a legal release form, you have to be able to sign it, and what that says is that we can use you being shown in class. Think of this as a bad hair day release form. All right, you can't sue Yale later if you had a bad hair day. For those of you who are on the run from the FBI, your Visa has run out, or you're sitting next to your ex-girlfriend, now would be a good time to put a paper bag over your head.All right, now just to get you used to the idea, in every class we're going to have I think the same two people, so Jude is the cameraman; why don't you all wave to Jude: this is Jude okay. And Wes is our audio guy: this is Wes. And I will try and remember not to include Jude and Wes in the classroom discussions, but you should be aware that they're there. Now, if this is making you nervous, if it's any consolation, it's making me very nervous. So, all right, we'll try and make this class work as smoothly as we can, allowing for this extra thing. Let me just say, no one's making any money off this--at least I'm hoping these guys are being paid--but me and the T.A.s are not being paid. The aim of this, that I think is a good aim, it's an educational project, and I'm hoping you'll help us with it. The one difference it is going to mean, is that at times I might hold some of the discussions for the class, coming down into this part of the room, here, to make it a little easier for Jude.All right, how are we doing now on filling out those forms? Has everyone filled in their strategy for the first game? Not yet. Okay, let's go on doing a bit more admin. The thing you mostly care about I'm guessing, is the grades. All right, so how is the grade going to work for this class? 30% of the class will be on problem sets, 30% of the grade; 30% on the mid-term, and 40% on the final; so 30/30/40.The mid-term will be held in class on October 17th; that is also in your syllabus. Please don't anybody tell me late - any time after today you didn't know when the mid-term was and therefore it clashes with 17 different things. The mid-term is on October 17th, which is a Wednesday, in class. All right, the problem sets: there will be roughly ten problem sets and I'll talk about them more later on when I hand them out. The first one will go out on Monday but it will be due ten days later. Roughly speaking they'll be every week.The grade distribution: all right, so this is the rough grade distribution. Roughly speaking, a sixth of the class are going to end up with A's, a sixth are going to end up with A-, a sixth are going to end up with B+, a sixth are going to end up with B, a sixth are going to end up with B-, and the remaining sixth, if I added that up right, are going to end up with what I guess we're now calling the presidential grade, is that right?That's not literally true. I'm going to squeeze it a bit, I'm going to curve it a bit, so actually slightly fewer than a sixth will get straight A's, and fewer than a sixth will get C's and below. We'll squeeze the middle to make them be more B's. One thing I can guarantee from past experience in this class, is that the median grade will be a B+. The median will fall somewhere in the B+'s. Just as forewarning for people who have forgotten what a median is,that means half of you--not approximately half, it means exactly half of you--will be getting something like B+ and below and half will get something like B+ and above.Now, how are you doing in filling in the forms? Everyone filled them in yet? Surely must be pretty close to getting everyone filled in. All right, so last things to talk about before I actually collect them in - textbooks. There are textbooks for this class. The main textbook is this one, Dutta'sbook Strategy and Games. If you want a slightly tougher book, more rigorous book, try Joel Watson's book, Strategies. Both of those books are available at the bookstore.But I want to warn everybody ahead of time, I will not be following the textbook. I regard these books as safety nets. If you don't understand something that happened in class, you want to reinforce an idea that came up in class, then you should read the relevant chapters in the book and the syllabus will tell you which chapters to read for each class, or for each week of class, all right. But I will not be following these books religiously at all. In fact, they're just there as back up.In addition, I strongly recommend people read, Thinking Strategically. This is good bedtime reading. Do any of you suffer from insomnia? It's very good bedtime reading if you suffer from insomnia. It's a good book and what's more there's going to be a new edition of this book this year and Norton have allowed us to get advance copies of it. So if you don't buy this book this week, I may be able to make the advance copy of the new edition available for some of you next week. I'm not taking a cut on that either, all right, there's no money changing hands.All right, sections are on the syllabus sign up - sorry on the website, sign up as usual. Put yourself down on the wait list if you don't get into the section you want. You probably will get into the section you want once we're done. All right, now we must be done with the forms. Are we done with the forms? All right, so why don't we send the T.A.s, with or without mikes, up and down the aisles and collect in your Game #1; not Game #2, just Game #1.Just while we're doing that, I think the reputation of this class--I think--if you look at the course evaluations online or whatever, is that this class is reasonably hard but reasonably fun. So I'm hoping that's what the reputation of the class is. If you think this class is going to be easy, I think it isn't actually an easy class. It's actually quite a hard class, but I think I can guarantee it's going to be a fun class. Now one reason it's a fun class, is the nice thing about teaching Game Theory - quieten down folks--one thing about teaching Game Theory is, you get to play games, and that's exactlywhat we've just been doing now. This is our first game and we're going to play games throughout the course, sometimes several times a week, sometimes just once a week.We got all these things in? Everyone handed them in? So I need to get those counted. Has anyone taken the Yale Accounting class? No one wants to - has aspirations to be - one person has. I'll have a T.A. do it, it's all right,we'll have a T.A. do it. So Kaj, can you count those for me? Is that right? Let me read out the game you've just played."Game 1, a simple grade scheme for the class. Read the following carefully. Without showing your neighbor what you are doing, put it in the box below either the letter Alpha or the letter Beta. Think of this as a grade bid. I will randomly pair your form with another form and neither you nor your pair will ever know with whom you were paired. Here's how the grades may be assigned for the class. [Well they won't be, but we can pretend.] If you put Alpha and you're paired with Beta, then you will get an A and your pair a C. If you and your pair both put Alpha, you'll both get B-. If you put Beta and you're paired with Alpha, you'll get a C and your pair an A. If you and your pair both put Beta, then you'll both get B+."So that's the thing you just filled in.Now before we talk about this, let's just collect this information in a more useful way. So I'm going to remove this for now. We'll discuss this in a second, but why don't we actually record what the game is, that we're playing, first. So this is our grade game, and what I'm going to do, since it's kind of hard to absorb all the information just by reading a paragraph of text, I'm going to make a table to record the information. So what I'm going to do is I'm going to put me here, and my pair, the person I'm randomly paired with here, and Alpha and Beta, which are the choices I'm going to make here and on the columns Alpha and Beta, the choices my pair is making.In this table, I'm going to put my grades. So my grade if we both put Alpha is B-, if we both put Beta, was B+. If I put Alpha and she put a Beta, I got an A, and if I put Beta and she put an Alpha, I got a C. Is that correct? That's more or less right? Yeah, okay while we're here, why don't we do the same for my pair? So this is my grades on the left hand table, but now let's look at what my pair will do, what my pair will get.So I should warn the people sitting at the back that my handwriting is pretty bad, that's one reason for moving forward. The other thing I should apologize at this stage of the class is my accent. I will try and improve the handwriting, there's not much I can do about the accent at this stage.So once again if you both put Alpha then my pair gets a B-. If we both put Beta, then we both get a B+; in particular, my pair gets a B+. If I put Alpha and my pair puts Beta, then she gets a C. And if I put Beta and she puts Alpha, then she gets an A. So I now have all the information that was on the sheet of paper that you just handed in.Now there's another way of organizing this that's standard in Game Theory, so we may as well get used to it now on the first day. Rather then drawing two different tables like this, what I'm going to do is I'm going to take the second table and super-impose it on top of the first table. Okay, so let me do that and you'll see what I mean. What I'm going to do is draw a larger table, the same basic structure: I'm choosing Alpha and Beta on the rows, my pair is choosing Alpha and Beta on the columns, but now I'm going to put both grades in. So the easy ones are on the diagonal: you both get B- if we both choose Alpha; we both get B+ if we both choose Beta. But if I choose Alpha and my pair chooses Beta, I get an A and she gets a C. And if I choose Beta and she chooses Alpha, then it's me who gets the C and it's her who gets the A.So notice what I did here. The first grade corresponds to the row player, me in this case, and the second grade in each box corresponds to the column player, my pair in this case. So this is a nice succinct way of recording what was in the previous two tables. This is an outcome matrix; this tells us everything that was in the game.Okay, so now seems a good time to start talking about what people did. So let's just have a show of hands. How many of you chose Alpha? Leave your hands up so that Jude can catch that, so people can see at home, okay. All right and how many of you chose Beta? There's far more Alphas - wave your hands the Beta's okay. All right, there's a Beta here, okay. So it looks like a lot of - well we're going to find out, we're going to count--but a lot more Alpha's than Beta's. Let me try and find out some reasons why people chose.So let me have the Alpha's up again. So, the woman who's in red here, can we get a mike to the - yeah, is it okay if we ask you? You're not on the run from the FBI? We can ask you why? Okay, so you chose Alpha right? So why did you choose Alpha?Student: [inaudible] realized that my partner chose Alpha, therefore I chose [inaudible].Professor Ben Polak: All right, so you wrote out these squares, you realized what your partner was going to do, and responded to that. Any otherreasons for choosing Alpha around the room? Can we get the woman here? Try not to be intimidated by these microphones, they're just mikes. It's okay.Student: The reason I chose Alpha, regardless of what my partner chose, I think there would be better outcomes than choosing Beta.Professor Ben Polak: All right, so let me ask your names for a second-so your name was?Student: Courtney.Professor Ben Polak: Courtney and your name was?Student: Clara Elise.Professor Ben Polak: Clara Elise. So slightly different reasons, same choice Alpha. Clara Elise's reason - what did Clara Elise say? She said, no matter what the other person does, she reckons she'd get a better grade if she chose Alpha. So hold that thought a second, we'll come back to - is it Clara Elise, is that right? We'll come back to Clara Elise in a second. Let's talk to the Beta's a second; let me just emphasize at this stage there are no wrong answers. Later on in the class there'll be some questions that have wrong answers. Right now there's no wrong answers. There may be bad reasons but there's no wrong answers. So let's have the Beta's up again. Let's see the Beta's. Oh come on! There was a Beta right here. You were a Beta right? You backed off the Beta, okay. So how can I get a mike into a Beta? Let' s stick in this aisle a bit. Is that a Beta right there? Are you a Beta right there? Can I get the Beta in here? Who was the Beta in here? Can we get the mike in there? Is that possible? In here - you can leave your hand so that - there we go. Just point towards - that's fine, just speak into it, that's fine. Student: So the reason right?Professor Ben Polak: Yeah, go ahead.Student: I personally don't like swings that much and it's the B-/B+ range, so I'd much rather prefer that to a swing from A to C, and that's my reason. Professor Ben Polak: All right, so you're saying it compresses the range.I'm not sure it does compress the range. I mean if you chose Alpha, you're swinging from A to B-; and from Beta, swinging from B+ to C. I mean those are similar kind of ranges but it certainly is a reason. Other reasons for choosing? Yeah, the guy in blue here, yep, good. That's all right. Don't hold the mike; just let it point at you, that's fine.Student: Well I guess I thought we could be more collusive and kind of work together, but I guess not. So I chose Beta.Professor Ben Polak: There's a siren in the background so I missed the answer. Stand up a second, so we can just hear you.Student: Sure.Professor Ben Polak: Sorry, say again.Student: Sure. My name is Travis. I thought we could work together, but I guess not.Professor Ben Polak: All right good. That's a pretty good reason. Student: If you had chosen Beta we would have all gotten B+'s but I guess not.Professor Ben Polak: Good, so Travis is giving us a different reason, right? He's saying that maybe, some of you in the room might actually care about each other's grades, right? I mean you all know each other in class. You all go to the same college. For example, if we played this game up in the business school--are there any MBA students here today? One or two. If we play this game up in the business school, I think it's quite likely we're going to get a lot of Alpha's chosen, right? But if we played this game up in let's say the Divinity School, all right and I'm guessing that Travis' answer is reflecting what you guys are reasoning here. If you played in the Divinity School, you might think that people in the Divinity School might care about other people's grades, right? There might be ethical reasons--perfectly good, sensible, ethical reasons--for choosing Beta in this game. There might be other reasons as well, but that's perhaps the reason to focus on. And perhaps, the lesson I want to draw out of this is that right now this is not a game. Right now we have actions, strategies for people to take, and we know what the outcomes are, but we're missing something that will make this a game. What are we missing here?Student: Objectives.Professor Ben Polak: We're missing objectives. We're missing payoffs. We're missing what people care about, all right. So we can't really start analyzing a game until we know what people care about, and until we know what the payoffs are. Now let's just say something now, which I'll probably forget to say in any other moment of the class, but today it's relevant.Game Theory, me, professors at Yale, cannot tell you what your payoff should be. I can't tell you in a useful way what it is that your goals in life should be or whatever. That's not what Game Theory is about. However, once we know what your payoffs are, once we know what your goals are, perhaps Game Theory can you help you get there.So we've had two different kinds of payoffs mentioned here. We had the kind of payoff where we care about our own grade, and Travis has mentioned the kind of payoff where you might care about other people's grades. And what we're going to do today is analyze this game under both those possible payoffs. To start that off, let's put up some possible payoffs for the game. And I promise we'll come back and look at some other payoffs later. We'll revisit the Divinity School later.All right, so here once again is our same matrix with me and my pair, choosing actions Alpha and Beta, but this time I'm going to put numbers in here. And some of you will perhaps recognize these numbers, but that's not really relevant for now. All right, so what's the idea here? Well the first idea is that these numbers represent utiles or utilities. They represent what these people are trying to maximize, what they're to achieve, their goals.The idea is - just to compare this to the outcome matrix - for the person who's me here, (A,C) yields a payoff of--(A,C) is this box--so (A,C) yields a payoff of three, whereas (B-,B-) yields a payoff of 0, and so on. So what's the interpretation? It's the first interpretation: the natural interpretation that a lot of you jumped to straight away. These are people--people with these payoffs are people--who only care about their own grades. They prefer an A to a B+, they prefer a B+ to a B-, and they prefer a B- to a C. Right, I'm hoping I the grades in order, otherwise it's going to ruin my curve at the end of the year. So these people only care about their own grades. They only care about their own grades.What do we call people who only care about their own grades? What's a good technical term for them? In England, I think we refer to these guys - whether it's technical or not - as "evil gits." These are not perhaps the most moral people in the universe. So now we can ask a different question. Suppose, whether these are actually your payoffs or not, pretend they are for now. Suppose these are all payoffs. Now we can ask, not what did you do, but what should you do? Now we have payoffs that can really switch the question to a normative question: what should you do? Let's come back to - was it Clara Elise--where was Clara Elise before? Let's get the mike on you again. So just explain what you did and why again.Student: Why I chose Alpha?Professor Ben Polak: Yeah, stand up a second, if that's okay.Student: Okay.Professor Ben Polak: You chose Alpha; I'm assuming these were roughly your payoffs, more or less, you were caring about your grades.Student: Yeah, I was thinking -Professor Ben Polak: Why did you choose Alpha?Student: I'm sorry?Professor Ben Polak: Why did you choose Alpha? Just repeat what you said before.Student: Because I thought the payoffs - the two different payoffs that I could have gotten--were highest if I chose Alpha.Professor Ben Polak: Good; so what Clara Elise is saying--it's an important idea--is this (and tell me if I'm paraphrasing you incorrectly but I think this is more or less what you're saying): is no matter what the other person does, no matter what the pair does, she obtains a higher payoff by choosing Alpha. Let's just see that. If the pair chooses Alpha and she chooses Alpha, then she gets 0. If the pair chooses Alpha and she chose Beta, she gets -1. 0 is bigger than -1. If the pair chooses Beta, then if she chooses Alpha she gets 3, Beta she gets 1, and 3 is bigger than 1. So in both cases, no matter what the other person does, she receives a higher payoff from choosing Alpha, so she should choose Alpha. Does everyone follow that line of reasoning? That's a stronger line of reasoning then the reasoning we had earlier. So the woman, I have immediately forgotten the name of, in the red shirt, whose name was -Student: Courtney.Professor Ben Polak: Courtney, so Courtney also gave a reason for choosing Alpha, and it was a perfectly good reason for choosing Alpha, nothing wrong with it, but notice that this reason's a stronger reason. It kind of implies your reason.So let's get some definitions down here. I think I can fit it in here. Let's try and fit it in here.Definition: We say that my strategy Alpha strictly dominates my strategy Beta, if my payoff from Alpha is strictly greater than that from Beta, [and this is the key part of the definition], regardless of what others do.Shall we just read that back? "We say that my strategy Alpha strictly dominates my strategy Beta, if my payoff from Alpha is strictly greater than that from Beta, regardless of what others do." Now it's by no means my main aim in this class to teach you jargon. But a few bits of jargon are going to be helpful in allowing the conversation to move forward and this is certainly one. "Evil gits" is maybe one too, but this is certainly one.Let's draw out some lessons from this. Actually, so you can still read that, let me bring down and clean this board. So the first lesson of the class, and there are going to be lots of lessons, is a lesson that emerges immediately from the definition of a dominated strategy and it's this. So Lesson One of the course is:do not play a strictly dominated strategy. So with apologies to Strunk and White, this is in the passive form, that's dominated, passive voice. Do not play a strictly dominated strategy. Why? Somebody want to tell me why? Do you want to get this guy? Stand up - yeah.Student: Because everyone's going to pick the dominant outcome and then everyone's going to get the worst result - the collectively worst result.Professor Ben Polak: Yeah, that's a possible answer. I'm looking for something more direct here. So we look at the definition of a strictly dominated strategy. I'm saying never play one. What's a possible reason for that? Let's - can we get the woman there?Student: [inaudible]Professor Ben Polak: "You'll always lose." Well, I don't know: it's not about winning and losing. What else could we have? Could we get this guy in the pink down here?Student: Well, the payoffs are lower.Professor Ben Polak: The payoffs are lower, okay. So here's an abbreviated version of that, I mean it's perhaps a little bit longer. The reason I don't want to play a strictly dominated strategy is, if instead, I play the strategy that dominates it, I do better in every case. The reason I never want to play a strictly dominated strategy is, if instead I play the strategy that dominates it, whatever anyone else does I'm doing better than I would have done. Now that's a pretty convincing argument. That sounds like a convincing argument. It sounds like too obvious even to be worth stating in class, so let me now try and shake your faith a little bit in this answer.。
搏弈论阿维纳什•迪克斯特&巴里•内尔巴夫1搏弈是有关策略的科学。
它试图以数学和逻辑的方法来帮助搏弈者作出决策,在一系列纷繁复杂的搏弈中应采取何种策略来保证自己获得最大利益。
搏弈论研究的搏弈的范围包括了从下棋到抚育儿童,从网球竞技到公司转手。
但是所有的博弈都具有一个共同的特征:相互作用。
也就是说,每一个博弈者的博弈结果取决于所有博弈参与者的策略选择。
在零和搏弈中,搏弈者的利益之间是完全冲突的,因此一方的得利必然导致另一方的损失。
更多具有代表性的例子还有会导致共同得利(正和)搏弈和共同损失(负和)搏弈,同样的情况还会发生在另外一些冲突中。
2搏弈论研究的先驱者是普林斯顿数学家约翰•冯•诺依曼。
在早先的一段时间里,研究的重点被放在了完全冲突(零和)搏弈(非合作搏弈)上,其他的搏弈当时被认为是以合作形式出现。
也就是说,搏弈要求参与者共同地选择和实施他们的行为.最近的研究则把重点放在了那些既不属于零和搏弈也不属于绝对合作搏弈的情况上,在这些搏弈中,搏弈者自主地选择搏弈行为,但他们之间的相互关系中充满了合作与竞争。
3搏弈行为与我们在中性环境中所作的各种决定有着根本性的不同。
要说明这一点,我们可以思考一下伐木工人和军队将军所作决定之间的不同。
当伐木工人决定要如何砍树时,他不会考虑树木本身会有什么反抗,他所处的环境为中性。
而当将军决定要消灭敌军时,他必须提前预料到并消除敌军的反抗。
与这一例子中的将军相类似,一个搏弈者必须认识到他与其他机智且怀有争胜之心的竞争者之间的相互作用,他自己所作的决定也必须能够同时应对可能出现的合作或冲突。
‖4搏弈的实质是搏弈者采取策略之间的相互依赖性。
这种策略性的相互依赖表现为两个不同的类别:连续策略之间的相互作用以及联立策略之间的相互作用。
就前者而言,搏弈者依次采取行动,每个人都会注意其他搏弈者先前的行为。
就后者而言,搏弈者同时采取行动,每个人都会忽略其他搏弈者当前的行为。
5对连续策略博弈中的某一博弈这来说,一个普遍的原则就是放眼前方,及时反思和总结。
博弈论 Game Theory博弈论亦名“对策论”、“赛局理论”,属应用数学的一个分支, 目前在生物学,经济学,国际关系,计算机科学, 政治学,军事战略和其他很多学科都有广泛的应用。
在《博弈圣经》中写到:博弈论是二人在平等的对局中各自利用对方的策略变换自己的对抗策略,达到取胜的意义。
主要研究公式化了的激励结构间的相互作用。
是研究具有斗争或竞争性质现象的数学理论和方法。
也是运筹学的一个重要学科。
博弈论考虑游戏中的个体的预测行为和实际行为,并研究它们的优化策略。
表面上不同的相互作用可能表现出相似的激励结构(incentive structure),所以他们是同一个游戏的特例。
其中一个有名有趣的应用例子是囚徒困境(Prisoner's dilemma)。
具有竞争或对抗性质的行为称为博弈行为。
在这类行为中,参加斗争或竞争的各方各自具有不同的目标或利益。
为了达到各自的目标和利益,各方必须考虑对手的各种可能的行动方案,并力图选取对自己最为有利或最为合理的方案。
比如日常生活中的下棋,打牌等。
博弈论就是研究博弈行为中斗争各方是否存在着最合理的行为方案,以及如何找到这个合理的行为方案的数学理论和方法。
生物学家使用博弈理论来理解和预测演化(论)的某些结果。
例如,约翰·史密斯(John Maynard Smith)和乔治·普莱斯(George R. Price)在1973年发表于《自然》杂志上的论文中提出的“evolutionarily stable strategy”的这个概念就是使用了博弈理论。
其余可参见演化博弈理论(evolutionary game theory)和行为生态学(behavioral ecology)。
博弈论也应用于数学的其他分支,如概率,统计和线性规划等。
历史博弈论思想古已有之,我国古代的《孙子兵法》就不仅是一部军事著作,而且算是最早的一部博弈论专著。
博弈论最初主要研究象棋、桥牌、赌博中的胜负问题,人们对博弈局势的把握只停留在经验上,没有向理论化发展。
博弈论(部分英文版翻译)博弈论托马斯·S.Ferguson/translator:·xly第一部分:公平组合游戏1.外卖游戏1.1简单的外卖游戏1.2什么是组合游戏?1.3 P状态和N状态1.4游戏1.5相关练习2.尼姆游戏初步分析尼姆和多堆尼姆游戏布顿理论证明守财奴版尼姆游戏相关练习3.图形游戏有向图形游戏SG函数相关例子的一般图的SG函数4.组合游戏和N图游戏及SG定理的相关应用与休息游戏相关的练习5.硬币游戏的例子二维空间中的硬币旋转游戏尼姆复杂的网格游戏练习6.绿色哈肯布什竹竿树木上的绿色哈肯布什普通根图练习的绿色引导参考材料第一部分:公平组合游戏1。
外卖游戏组合游戏是两人游戏。
如果有足够的条件,当一方不能继续经营时,游戏的结果就会出来。
这个游戏的结果取决于一系列的状态,包括初始状态和准备操作的玩家。
游戏双方轮流操作,直到达到最终状态。
最终状态意味着该状态不能再运行。
此时,结果已经出现分歧。
这里有两个关于组合游戏的主要材料。
一部是康威的《论数字与游戏》,学术出版社1976年出版。
这本书介绍了这一领域的许多基本思想,加速了这一领域今天的发展。
另一本更适合这门课的参考书是学术出版社于1982年出版的两卷本平装本,书名是《柏林坎普、康威和盖伊的数学游戏制胜之道》。
这本书介绍了许多有趣的游戏,学习数学的本科生可以理解。
这些理论可以分为两类。
公平游戏指的是任何给定的状态,游戏双方要采取的行动是相同的。
另一方面,游击队游戏意味着给定一个状态,游戏双方将采取不同的行动。
例如,国际象棋是一种游击队游戏。
在第一部分,我们只研究“公平竞争”。
公平组合游戏的介绍可以在理查德·盖伊写的公平游戏中找到(发表在1989年的COMAP数学探索系列中)。
让我们从一个简单的例子开始。
1.1一个简单的外卖游戏。
这是这个公平组合游戏的一些规则(从一堆筹码中取一些):(1)有两个玩家,我们分别将他们标记为1号和2号;(2)桌上有一堆筹码,总共21个筹码;(3)一次操作可以取1、2、3个筹码,至少要取一个筹码,最多要取3个筹码。
So this is Game Theory Economics 159.欢迎来到经济学159 博弈论If you're here for art history,如果你是来上艺术史的you're either in the wrong room or stay anyway,那你走错教室了不过你不妨留下来maybe this is the right room; but this is Game Theory, okay.没准你会喜欢这门课这里是博弈论You should have four handouts; everyone should have four handouts.你们每人都应该有四页讲义人手一份There is a legal release form--里面有一授权书we'll talk about it in a minute--about the videoing.是关于视频采集的我们一会再说这个There is a syllabus, which is a preliminary, it's also online.有一初步教学大纲上也有And there are two games labeled Game 1 and Game 2.有两页分别印有游戏1和游戏2的传单Can I get you all to look at Game 1 and start thinking about it.大家先浏览下游戏1 然后思考一下And while you're thinking about it, i am hoping you can multitask a bit. 你们一边思考一边听我说I'll describe a bit about the class我来简单介绍下这门课程and we'll get a bit of admin under our belts.还有这门课程的评分细则But please try and look at--somebody's not looking at it,请大家看下游戏1 我知道有人没看because they're using it as a fan here--他们用那纸扇风呢So look at Game 1 and fill out that form for me, okay?快浏览下游戏1 然后填完好吗So while you're filling that out, let me tell you a little你们一边写一边听我说bit about what we're going to be doing here.我来简单地介绍一下这门课So what is Game Theory?什么是博弈论Game Theory is a method of studying strategic situations.博弈论研究策略形势So what's a strategic situation?那什么是策略形势呢Well let's start off with what's not a strategic situation.我们先看看什么不属于策略形势In your Economics - in your Intro Economics class in 115 or 110,在你们经济学基础课115或者110中you saw some pretty good examples of situations that were not strategic. 应该讲过一些不是策略形势的案例You saw firms working in perfect competition.比如说自由竞争企业Firms in perfect competition are price takers:这些企业是价格承受者they don't particularly have to worry about他们不必担心the actions of their competitors.他们的竞争对手的策略You also saw firms that were monopolists又比如说垄断企业and monopolists don't have any competitors to worry about,垄断企业没有竞争对手so that's not a particularly strategic situation.所以这也不是策略形势They're not price takers but they take the demand curve.他们不是价格承受者但面对需求曲线Is this looking familiar for some of you who can remember对于你们这些学过经济学115的来说doing 115 last year or maybe two years ago for some of you?这些听起来都不陌生吧Everything in between is strategic.介于这两种情况之间的就是策略形势So everything that constitutes imperfect competition也就是说不完全竞争的情况is a strategic setting.就是策略形势Think about the motor industry, the motor car industry.比如汽车产业在汽车产业里Ford has to worry about what GM is doing and what Toyota is doing,福特关注通用和丰田的决策and for the moment at least现在可能暂时what Chrysler is doing but perhaps not for long.还得关注克莱斯勒的决策So there's a small number of firms and their actions affect each other. 少数几家公司的决策会互相影响So for a literal definition of what strategic means:策略形势书面定义是it's a setting where the outcomes that affect you depend on actions, 行为影响结果然而结果不仅not just on your own actions, but on actions of others.取决于你的行为还取决于其他人的行为All right, that's as much as I'm going to say for preview right now. 好了暂时就先介绍这么多We're going to come back and see plenty of this通过学习本学期的课程over the course of the next semester.你们会接触更多类似案例So what I want to do is get on to where this applies.下面我们讲讲博弈论适用于什么领域It obviously applies in Economics, but it also applies in politics, 显然适用于经济学但也适用于政治学and in fact, this class will count as a Political Science class实际上对政治专业的同学来说if you're a Political Science major.它也算作政治课You should go check with the DUS in Political Science. It counts.回去咨询下本科教学主管确实算的Game Theory is very important in law these days.近年来博弈论在法学上也举足轻重So for those of you--for the half of you我估计你们多数人可能会去读法学院--that are going to end up in law school, this is pretty good training. 博弈论对你们来说是门不错的基础课Game Theory is also used in biology博弈论同样适用于生物学and towards the middle of the semester在本学期期中的时候we're actually going to see some examples of我们会探讨一下Game Theory as applied to evolution.博弈论应用于生物进化论的案例And not surprisingly, Game Theory applies to sport.而且毫不奇怪博弈论适用于体育So let's talk about a bit of admin.我们接下来谈谈评分标准How are you doing on filling out those games?这两个游戏你们填好了没有Everyone managing to multitask.每个人都要学会一心多用啊Filling in Game 1? Keep writing.还在填游戏1 好吧继续填I want to get some admin out of the way我们继续讲评分标准and I want to start by getting out of the way不过我得先介绍一下what is obviously the elephant in the room.教室里的大家伙Some of you will have noticed that there's a camera crew here, okay. 有些人注意到了这几位摄像人员So as some of you probably know,可能你们已经知道了Yale is undergoing an open education project耶鲁大学正在开展一个教育项目and they're videoing several classes,我们在录制一些课程的录像and the idea of this,这样做的目的是让is to make educational materials available beyond the walls of Yale. 耶鲁外的人也能享受到这些教学资源In fact, on the web, internationally,课程将被发布到网络上so people in places, maybe places in the U.S.全球各地的人都能看到比如美国人or places miles away, maybe in Timbuktu or whatever,或者更遥远的国度像廷巴克图who find it difficult to get educational materials他们可能在当地大学from the local university or whatever,不容易得到这样的教育资源can watch certain lectures from Yale on the web.他们就可以到耶鲁观看这些视频Some of you would have been in classes that do that before.有些同学可能之前参加过这种课程What's going to different about this class is that这门课的不同之处在于you're going to be participating in it.大家都要参与到课程中来The way we teach this class is we're going to play games,课堂上我们通过做游戏来博弈we're going to have discussions, we're going to talk among the class, 课上会有随堂讨论课堂交流and you're going to be learning from each other,我们相互学习这样看视频的学生and I want you to help people watching at home to be able to learn too. 也能收到良好的教学效果And that means you're going to be on film, at the very least on mike. 但同时这意味着你要被拍到视频中来So how's that going to work?至少也会被录音课程怎么拍摄呢Around the room are three T.A.S holding mikes.教室里有三位助教持有麦克风Let me show you where they are: one here, one here, and one here.我来告诉大家助教坐在哪里When I ask for classroom discussions,当我们进行随堂讨论时I'm going to have one of the T.A.S go to you with a microphone我会请一位助教拿着麦克风到你那much like in "Donahue" Or something. Okay.就想多纳休那样At certain times, you're going to be seen on film,有时候你会被拍摄到so the camera is actually going to come around and point in your direction. 此时摄像机会转到你所在的方向Now I really want this to happen.我希望大家多多配合I had to argue for this to happen, cause I really feel that我来解释一下我的理由我觉得this class isn't about me.这门课不应该仅仅由我来讲I'm part of the class obviously,没错我是课程的一部分but it's about you teaching each other and participating.但你们也要参与到课堂中来But there's a catch, the catch is,但是这会有个条件that that means you have to sign that legal release form.你们都要签署授权书So you'll see that you have in front of you a legal release form,你们的面前应该有授权书you have to be able to sign it, and what that says is that授权书的大概容是we can use you being shown in class.允许我们把你拍摄进来Think of this as a bad hair day release form.大家考虑好日后你要是在视频里All right, you can't sue Yale later if you had a bad hair day.发现你糟糕的发型可别起诉耶鲁大学For those of you who are on the run from the FBI,被FBI通缉的签证失效的your Visa has run out, or you're sitting next to your ex-girlfriend, 还有前女友就坐在旁边的now would be a good time to put a paper bag over your head.赶快找个纸袋子把脸给蒙上All right, now just to get you used to the idea,还要说一下每节课我们都会in every class we're going to have I think the same two people,看到这两名工作人员so Jude is the cameraman; why don't you all wave to Jude.裘德是摄影师大家跟裘德打个招呼This is Jude okay. And Wes is our audio guy: this is Wes.这位就是裘德韦斯是录音师And I will try and remember not to我会尽量注意include Jude and Wes in the classroom discussions,不要让他们两位加入课堂讨论的but you should be aware that they're there.但是你们也要注意到他们的存在Now, if this is making you nervous,要是这令你很紧if it's any consolation, it's making me very nervous.它会让我更紧这权当安慰了吧So, all right, we'll try and make this class我会尽量让work as smoothly as we can, allowing for this extra thing.课堂顺利地进行下去还有一件事Let me just say, no one's making any money off this--让我说完没有人会从中得到好处at least I'm hoping these guys are being paid--虽然我也希望他们会得到酬劳but me and the T.A.S are not being paid.但至少我和助教们是没有酬劳的The aim of this, that I think is a good aim, it's an educational project, 这完全出于好意这是个教育项目and I'm hoping you'll help us with it.我也希望你们也能尽己所能把它做好The one difference it is going to mean, is that还有希望大家都能相互理解一下at times I might hold some of the discussions for the class,当我们举行课堂讨论的时候coming down into this part of the room, here,请你们靠前站出来一点to make it a little easier for Jude.这样便于裘德拍摄All right, how are we doing now on filling out those forms?言归正传你们填好了没Has everyone filled in their strategy for the first game?各位是否都填好第一个游戏了Not yet. Okay, let's go on doing a bit more admin.还没有那我们多介绍下评分标准The thing you mostly care about i'm guessing, is the grades.我猜你们最关心的应该是成绩了All right, so how is the grade going to work for this class?那么这门课是怎么评分的呢30% of the class will be on problem sets,课堂作业占总分数的30%30% of the grade; 30% on the mid-term,期中考试占30%and 40% on the final; so 30/30/40.期末考试占40% 即30/30/40The mid-term will be held in class on October 17;期中考试在十月十七日that is also in your syllabus.教学大纲上也写了Please don't anybody tell me late -所以以后千万别跟我说any time after today you didn't know when the mid-term你不知道什么时候期中考试was and therefore it clashes with 17 different things.然后还在考试那天安排大一堆事儿去做The mid-term is on October 17, which is a Wednesday, in class.期中考试在十月十七号周三随堂举行All right, the problem sets:关于课堂作业there will be roughly ten problem sets我应该会安排十次左右的课堂作业and I'll talk about them more later on when I hand them out.在留作业的时候我再细说The first one will go out on Monday but it will be due ten days later. 周一留第一份作业十天之交上来Roughly speaking they'll be every week.差不多每周都有课堂作业The grade distribution:有关分数的分布all right, so this is the rough grade distribution.我介绍下大体的分数的分布Roughly speaking, a sixth of the class are going to end up with A's, 总体来说六分之一的人会得到Aa sixth are going to end up with A-,六分之一的人会得到A-a sixth are going to end up with B+,六分之一的人会得到B+a sixth are going to end up with B,六分之一的人会得到Ba sixth are going to end up with B-,六分之一的人会得到B-and the remaining sixth, if I added that up right,剩下的六分之一如果我没算错are going to end up with what I guess we're now calling这些人将会得到the presidential grade, is that right?传说中的"超优"成绩That's not literally true. I'm going to squeeze it a bit,但我可能稍微压缩下分数段I'm going to curve it a bit,我会让分数段更有弹性so actually slightly fewer than a sixth will get straight A's,实际上略少于六分之一的人会得到Aand fewer than a sixth will get C's and below.略少于六分之一会得到CWe'll squeeze the middle to make them be more B's.我们这样压缩会使更多的人得到BOne thing I can guarantee from past experience in this class,不过从教学经验看is that the median grade will be a B+.中间成绩应该是B+The median will fall somewhere in the B+'s.分数中位数会在B+的附近Just as forewarning for people who have forgotten what a median is, 稍微提示下忘记中位数含义的同学that means half of you--not approximately half,这也就是说不是一半左右it means exactly half of you--是绝对一半的同学will be getting something like B+ and below会得到B+的成绩或更低的成绩and half will get something like B+ and above.另一半会到B+的成绩或更高的成绩Now, how are you doing in filling in the forms?那两表你们填得怎么样了Everyone filled them in yet?每个人都填好了吗Surely must be pretty close to getting everyone filled in.应该差不多每个人都填好了吧All right, so last things to talk about在我把表收回之前before I actually collect them in - textbooks.我再讲最后一件事课本There are textbooks for this class.这门课的课本The main textbook is this one, Dutta's book Strategy and Games.主要是这本杜塔的《策略与博弈》If you want a slightly tougher book, more rigorous book,要是你想要一本更难更有挑战的try Joel Watson's book, Strategies.试试乔尔?沃森的《策略》Both of those books are available at the bookstore.这两本书在书店都有卖的But I want to warn everybody ahead of time,不过我先提个醒I will not be following the textbook.我不会完全参照课本I regard these books as safety nets.课本只不过是救命稻草If you don't understand something that happened in class,要是你课堂上没听懂you want to reinforce an idea that came up in class,或者你课下想深入了解下课上的容then you should read the relevant chapters in the book你应该去读书上对应的章节and the syllabus will tell you which chapters to read for each class, 教学大纲会告诉你每堂课讲哪章or for each week of class, all right.或者每周讲哪几章But I will not be following these books religiously at all.但是我不会严格按照课本来讲课In fact, they're just there as back up.重申一下课本就是救命稻草In addition, I strongly recommend people read Thinking Strategically. 对了我强烈建议读一读《战略思想》This is good bedtime reading. Do any of you suffer from insomnia?这是本不错的睡前读物你们失眠吗It's very good bedtime reading if you suffer from insomnia.你要是失眠它绝对是最好的睡前读物It's a good book and what's more这也的确是本好书there's going to be a new edition of this book this year而且今年会有新版上市and Norton have allowed us to get advance copies of it.出版商会让我们提前得到新版的So if you don't buy this book this week,如果你这周不买I may be able to make the advance copy of the new可能下周你们edition available for some of you next week.就能买到新版的了I'm not taking a cut on that either, all right,我不会从中得到提成there's no money changing hands.这不是推销All right, sections are on the syllabus sign up小组讨论的安排在教学大纲上sorry on the website, sign up as usual.哦我说错了像平常一样在网上注册Put yourself down on the wait list如果你想换个小组if you don't get into the section you want.请在等候名录里预约You probably will get into the section you want once we're done.一般你只需要预约一次就可以了All right, now we must be done with the forms.你们现在应该填完了吧Are we done with the forms?都填完了吗All right, so why don't we send the T.A.S, with or without mikes,我们把表交给助教吧up and down the aisles and collect in your Game #1;让他们来帮我收一下游戏1not Game #2, just Game #1.不要游戏2 只要游戏1Just while we're doing that, I think the reputation of this class-- 收表的时候我来说说这门课的名声I think--if you look at the course evaluations online or whatever,你们要是在上看了教学评估is that this class is reasonably hard but reasonably fun.就会发现这门课有点难但也很有趣So I'm hoping that's what the reputation of the class is.我也希望这门课是这样的If you think this class is going to be easy,你要是认为这门课挺简单I think it isn't actually an easy class. It's actually quite a hard class, 这不是轻松的课事实上它有点难but I think I can guarantee it's going to be a fun class.我保证这门课一定很有趣Now one reason it's a fun class,为什么这门课很有趣is the nice thing about teaching Game Theory因为学习博弈论- quieten down folks--大家安静一下one thing about teaching Game Theory is, you get to play games,在课堂上我们通过一些游戏来学习博弈and that's exactly what we've just been doing now.我们不刚做完一个游戏吗This is our first game and we're going to play games throughout the course, 这是第一游戏整个课程充满了游戏sometimes several times a week, sometimes just once a week.有时候一周做几次有时一周一次We got all these things in?表收齐了吗Everyone handed them in?每个人都交了吧So I need to get those counted.下面要开始统计了Has anyone taken the Yale Accounting class?有人上过会计课吗No one wants to - has aspirations to be - one person has.有人想帮忙统计吗这有一个人想I'll have a T.A. do it, it's all right, we'll have a T.A. do it.我还是安排助教做吧So Kaj, can you count those for me? Is that right?卡茄你帮我统计一下好吗Let me read out the game you've just played.我们回顾一下刚才进行的游戏"Game 1, a simple grade scheme for the class."游戏1是一个简单的成绩博弈Read the following carefully.请仔细阅读以下条款Without showing your neighbor what you are doing,在不被你同桌看到的情况下put it in the box below either the letter Alpha or the letter Beta. 在方框中填写字母α或者字母βThink of this as a grade bid.把这看成成绩的赌注I will randomly pair your form with another form我会随机把你们分成两两一组and neither you nor your pair will ever know with whom you were paired. 你们不知道会跟谁分到一组Here's how the grades may be assigned for the class.按如下方法给出你们的成绩Well they won't be,纯属娱乐大家别当真If you put Alpha and you're paired with Beta,如果你选α而你对手选βthen you will get an A and your pair a C.那么你得A你对手得CIf you and your pair both put Alpha, you'll both get B-.如果你们都选α 那么你们都得B-If you put Beta and you're paired with Alpha,如果你选β你对手选αyou'll get a C and your pair an A.你得C你对手得AIf you and your pair both put Beta, then you'll both get B+."如果你们都选β 你们都得B+"So that's the thing you just filled in.这就是你们之前填的东西Now before we talk about this,在我们讨论之前let's just collect this information in a more useful way.先有效地整理一下这些信息So I'm going to remove this for now.我先把它们擦掉We'll discuss this in a second,一会再开始分析but why don't we actually record what the game is, that we're playing first.我们先记录一下这个游戏的容So this is our grade game, and what I'm going to do,就是我们的成绩博弈since it's kind of hard to absorb all the information因为这么长一段文字just by reading a paragraph of text,从中摘取信息有点困难I'm going to make a table to record the information.不如我们列表整理信息So what I'm going to do is I'm going to put me here,那么我把我写在这里and my pair, the person I'm randomly paired with here,我对手就是被随机分到的人写在这and Alpha and Beta, which are the choices I'm going to make here还有是α和β 就是我要做的选择and on the columns Alpha and Beta, the choices my pair is making.还有我对手要做的选择In this table, I'm going to put my grades.我把我的成绩填到表格里So my grade if we both put Alpha is B-, if we both put Beta, was B+. 如果我们都选α我得B- 都选β我得B+If I put Alpha and she put a Beta, I got an A,如果我选α她选β 我得Aand if I put Beta and she put an Alpha, I got a C.如果我选β她选α 那么我得CIs that correct? That's more or less right?这些都对吧Yeah, okay while we're here, why don't we do the same for my pair?好了我们再把对手的情况也写下来So this is my grades on the left hand table,我的成绩写在左边的表格里but now let's look at what my pair will do, what my pair will get.我们再看看我对手她会得到什么成绩So I should warn the people sitting at the back提醒一下坐在后面的人that my handwriting is pretty bad, that's one reason for moving forward. 我的板书写得很糟所以最好往前坐The other thing I should apologize at this stage of the class is my accent. 还有我得检讨的是我的口音I will try and improve the handwriting,我会努力写好板书there's not much I can do about the accent at this stage.但是口音暂时不太好改So once again if you both put Alpha then my pair gets a B-.如果我们都选α 我对手得B-If we both put Beta, then we both get a B+;如果我们都选β 我们都会得到B+in particular, my pair gets a B+.此情况下我对手得B+If I put Alpha and my pair puts Beta, then she gets a C.如果我选α我对手选β 那么她得CAnd if I put Beta and she puts Alpha, then she gets an A.如果我选β她选α 那么她得ASo I now have all the information由此我就把这纸上的that was on the sheet of paper that you just handed in.所有信息都包含进来了Now there's another way of organizing this that's standard in Game Theory,下面再用博弈论的标准方式来整理信息so we may as well get used to it now on the first day.从今天以后大家要习惯用这种标准Rather then drawing two different tables like this,与其画两个不同的表格what I'm going to do is I'm going to take the second table不如把第二个表格的容插入到and super-impose it on top of the first table.第一个表中Okay, so let me do that and you'll see what I mean.我来做个示你就明白我的意思了What I'm gonna do is draw a larger table, the same basic structure: 我要画个结构相同的大表格I'm choosing Alpha and Beta on the rows,用列来表示我选α还是βmy pair is choosing Alpha and Beta on the columns,用行来表示我对手选α还是βbut now I'm going to put both grades in.但我要把两人成绩写在一个单元格里So the easy ones are on the diagonal:这样就更直观了you both get B- if we both choose Alpha;如果我们都选α的话得B-we both get B+ if we both choose Beta.如果我们都选β的话得B+But if I choose Alpha and my pair chooses Beta,要是我选α我对手选βI get an A and she gets a C.我得A 她得CAnd if I choose Beta and she chooses Alpha,如果我选β她选αthen it's me who gets the C and it's her who gets the A.我得C而她得ASo notice what I did here.注意下我的表示方法The first grade corresponds to the row player, me in this case,单元格第一个成绩是我的成绩and the second grade in each box corresponds to the column player, 每个单元格第二个成绩是my pair in this case.我对手的成绩So this is a nice succinct way of这样就把前两表格的信息recording what was in the previous two tables.更简明地表述出来了This is an outcome matrix; this tells us everything that was in the game. 它是一个列出了博弈所有容的矩阵Okay, so now seems a good time to start talking about what people did. 现在我们来考虑一下我们的选择So let's just have a show of hands.我们举手表决吧How many of you chose Alpha?多少人选了α?Leave your hands up so that Jude can catch that,先不要放下手让裘德拍一下so people can see at home, okay.这样看视频的人也能看到All right and how many of you chose Beta?多少人选了β?There's far more Alphas - wave your hands the Beta's okay.远远少于选α的先不要放下手All right, there's a Beta here, okay.这里有一个选β的So it looks like a lot of - well we're going to find out,我们来数一下确认一下we're going to count--but a lot more Alpha's than Beta's.选α的人要远远多于选βLet me try and find out some reasons why people chose.我们来探讨下你们为什么这么选So let me have the Alpha's up again.来看看都谁选α了So, the woman who's in red here,穿红色衣服的女士can we get a mike to the -yeah, is it okay if we ask you?递给她一个麦克风You're not on the run from the FBI? We can ask you why?请问没被FBI通缉吧?Okay, so you chose Alpha right? So why did you choose Alpha?你选了α对吧你为什么要选α呢Realized that my partner chose alpha, therefore I chose .我感觉我对手会选α因此我也选αAll right, so you wrote out these squares,这么说你也写下了这些矩阵you realized what your partner was going to do, and responded to that. 你分析出了对手的选择并依此做出的选择Any other reasons for choosing Alpha around the room?还有别的原因使你们选α吗Can we get the woman here?把麦克风递给那位女士好吗Try not to be intimidated by these microphones,它们只不过是麦克风而已they're just mikes. It's okay.别紧The reason I chose Alpha, regardless of what my partner chose,我选α是因为不管我对手选什么I think there would be better outcomes than choosing Beta.我选α得到的成绩总会比选β的要好All right, so let me ask your names for a second-so your name was?请问你叫什么名字?Courtney.考特尼Courtney and your name was?考特尼另一位女士的名字是Clara Elise.克莱拉?埃莉斯So slightly different reasons, same choice Alpha.虽然原因不尽相同但她们都选了αClara Elise's reason - what did Clara Elise say?克莱拉?埃莉斯的理由是她怎么说She said, no matter what the other person does,她说无论别人怎么选she reckons she'd get a better grade if she chose Alpha.她觉得选α会使自己得到最优的结果So hold that thought a second, we'll come back to先保留这个说法我们稍后再做讨论is it Clara Elise, is that right?这是克莱拉?埃莉斯的理由We'll come back to Clara Elise in a second.我们一会再讨论这个理由Let's talk to the Beta's a second;我们先来讨论一下选β的理由let me just emphasize at this stage there are no wrong answers.我要强调一下这里没有错误的答案Later on in the class there'll be some questions that have wrong answers. 在以后在课上可能会有错误的答案Right now there's no wrong answers.但是现在没有错误的答案There may be bad reasons but there's no wrong answers.有不充分的理由但没有错误的理由So let's have the Beta's up again. Let's see the Beta's.请选β的再举下手看看有谁选了βOh come on! There was a Beta right here.之前这里有选β的啊You were a Beta right? You backed off the Beta, okay.你选过β对吗? 哦你改变主意了So how can I get a mike into a Beta?能不能把麦克风给一位选了β的同学Let' s stick in this aisle a bit.我们往过道这边来一点Is that a Beta right there? Are you a Beta right there?那有人选β对吗你是不是选了βCan I get the Beta in here? Who was the Beta in here?这边有人选β的吗Can we get the mike in there? Is that possible?能把麦克拿到那里吗In here - you can leave your hand so that - there we go.这有一个选β的先别放下手对了Just point towards - that's fine, just speak into it, that's fine. 把麦克对着他请讲So the reason right?为什么选β对吗Yeah, go ahead.没错继续说I personally don't like swings that much我不喜欢成绩波动很大的and it's the B-/B+ range,比如B-/B+这个围so I'd much rather prefer that to a swing from A to C,所以我还是喜欢像A到C这样小点的and that's my reason.这就是我的原因All right, so you're saying it compresses the range.你说的意思是压缩围I'm not sure it does compress the range.但是我不太确定这是否压缩了围I mean if you chose Alpha, you're swinging from A to B-;要是你选α 你的分数会在A到B-and from Beta, swinging from B+ to C.选β 分数围会在B+到CI mean those are similar kind of ranges这两个围大小差不多but it certainly is a reason.但这也算是个原因Other reasons for choosing?还有别的原因吗Yeah, the guy in blue here, yep, good.。
搏弈论
阿维纳什•迪克斯特&巴里•内尔巴夫
1搏弈是有关策略的科学。
它试图以数学和逻辑的方法来帮助搏弈者作出决策,在一系列纷繁复杂的搏弈中应采取何种策略来保证自己获得最大利益。
搏弈论研究的搏弈的范围包括了从下棋到抚育儿童,从网球竞技到公司转手。
但是所有的博弈都具有一个共同的特征:相互作用。
也就是说,每一个博弈者的博弈结果取决于所有博弈参与者的策略选择。
在零和搏弈中,搏弈者的利益之间是完全冲突的,因此一方的得利必然导致另一方的损失。
更多具有代表性的例子还有会导致共同得利(正和)搏弈和共同损失(负和)搏弈,同样的情况还会发生在另外一些冲突中。
2搏弈论研究的先驱者是普林斯顿数学家约翰•冯•诺依曼。
在早先的一段时间里,研究的重点被放在了完全冲突(零和)搏弈(非合作搏弈)上,其他的搏弈当时被认为是以合作形式出现。
也就是说,搏弈要求参与者共同地选择和实施他们的行为.最近的研究则把重点放在了那些既不属于零和搏弈也不属于绝对合作搏弈的情况上,在这些搏弈中,搏弈者自主地选择搏弈行为,但他们之间的相互关系中充满了合作与竞争。
3搏弈行为与我们在中性环境中所作的各种决定有着根本性的不同。
要说明这一点,我们可以思考一下伐木工人和军队将军所作决定之间的不同。
当伐木工人决定要如何砍树时,他不会考虑树木本身会有什么反抗,他所处的环境为中性。
而当将军决定要消灭敌军时,他必须提前预料到并消除敌军的反抗。
与这一例子中的将军相类似,一个搏弈者必须认识到他与其他机智且怀有争胜之心的竞争者之间的相互作用,他自己所作的决定也必须能够同时应对可能出现的合作或冲突。
‖
4搏弈的实质是搏弈者采取策略之间的相互依赖性。
这种策略性的相互依赖表现为两个不同的类别:连续策略之间的相互作用以及联立策略之间的相互作用。
就前者而言,搏弈者依次采取行动,每个人都会注意其他搏弈者先前的行为。
就后者而言,搏弈者同时采取行动,每个人都会忽略其他搏弈者当前的行为。
5对连续策略博弈中的某一博弈这来说,一个普遍的原则就是放眼前方,及时反思和总结。
每个博弈者应该弄清楚其他博弈者会对他当前的策略行为做出怎样的回应,他自己将如何应对等情况。
博弈者要预料到他的最初决定会最终导致何种结果,并且运用对形势的判断来计划好当前的最佳策略。
在考虑其他博弈者会如何应对时,博弈者必须能设身处地地换位思考,而不能把自己的主观判断强加与人。
6从理论上说,采取固定次序行动的任何连续博弈都可以圆满地完成。
我们可以通过预测每个可能的结果来决定各个博弈者的最佳策略。
例如象井字棋(tic-tac-toe)这样的简单游戏由于可以以这样的方式完成,因而并不具有挑战性。
但诸如象象棋等的其他博弈,即使是借助电脑的帮助,由于其本身的计算过程过于复杂而难以在实践中去实施。
因此,博弈者往往会会依据经验提前对形势作出判断并尽可能的评估最终的局面。
7与连续策略博弈的线形思维不同的是,采取共发性策略的博弈要求逻辑思维。
在忽略其他参与者当前策略的情况下,尽管博弈者们同时采取行动,每一个参与者必须清楚的意识同时还会有其他的参与者在依次关注整个博弈过程。
这时的思维模式可描述为:我想他认为我会这样考虑…。
因此,博弈者必须从全体博弈者的立场出发并努力判断出最终的博弈结果。
每个参与者的个人最佳行为都是全局谋划中不可或缺的一部分。
8运用普林斯顿数学家约翰•纳什提出的均衡概念,可以推导出这种逻辑思维的结论。
我们寻求一系列的策略组合,每个博弈者都会有自己的选择,当所有的对手们在实施他们决定的最佳策略时,我们所做的选择应该对自己是最有利的。
换句话说,每个博弈者都会对其他人的策略作出最优化的应对。
9有时, 无论其他博弈者如何行动,博弈的一方的最佳策略组合始终如一,这被称作这一博弈者的优策略。
在其他情况下,如果博弈者的策略始终于己不利,则被称作劣策略,其含义是指无论其他博弈者如何行动,对手的策略总是优于自己。
因此,谋求策略均衡应该从寻找优策略和消除劣策略开始。
10当我们把博弈的结果表述为一种均衡的时候,并不是基于以下的假设:即博弈的每个参与者的个人最佳策略将会带来共同的最优化结果。
确实也存在着一些糟糕的例子,比如囚徒困境(见下文),由于囚徒们都追求个人私利的最大化而导致了全体参与者的困境。
11纳什的关于均衡的定义还不能完全解决联立策略博弈中逻辑推理思考的问题,有些博弈包含多种此类的均衡,而有些博弈却并不包含这样的均衡。
纳什均衡也还没有清楚地说明关于导致均衡的动态过程。
尽管有这样的一些缺陷,纳什均衡的定义已被证明在分析策略性互动时具有重要作用。
‖
12以下策略性互动的实例可以说明博弈论的一些基本理论框架:
13囚徒困境。
两个嫌疑犯分别被审问,每个人都可以招供或保持沉默。
如果嫌犯A保持沉默,嫌犯B可以通过招供而获得较轻的发落。
如果嫌犯A招供,嫌犯B最好选择招供以免被从重处理。
这时招供就是嫌犯B的优策略。
同样的情况也适用与B。
因此,在均衡的情况下双方都选择招供。
如果双方均缄口不言也都可相安无事。
由于合作分裂所导致的长远性损失远比招供而获得的暂时性得利严重,所以这样的合作行为会在博弈中反复进行下去。
在这种情况下,一般推荐采取针锋相对的策略。
14混合性策略。
在一些冲突中,任何规律性的行为都会被对手发觉并利用。
因而,通过采取混合性策略来迷惑对手就显的非常重要。
我们可以从体育运动中找到一些典型的例子,比如足球运动中的特定情况下选择跑位还是传球,网球比赛中选择击打近网球还是底线球,博弈论都为提高洞察力和掌握混合性策略恰当的火候提供了参考。
15战略性行动。
博弈者可以通过运用威吓和许诺来改变其他对手对其行动的预测,从而诱使他们采取对其有利的行为或改变他们对己不利的行为。
为了取得成功,威吓和许诺必须是可信的。
这往往会导致出现一些问题,原因就在于当需要采取以上策略时,通常实施威吓和许诺而获益往往会付出较大的代价。
博弈论研究几种提高可信度的方法,一个基本的原则就是威吓和许诺要在符合博弈者利益的前提下减少自身未来行动的自由度。
通过这样的方式,博弈者可以避免自己违背承诺,或者避免产生对对手冒犯的纵容。
16例如,当柯台斯到墨西哥后,烧掉了战船,故意没有给自己留下撤军的退路。
由于没有返城的船只,柯台斯面对的只有战胜并征服对手或被对手消灭掉两种可能。
尽管他的士兵在数量上处于绝对劣势,但这种血战至死的威胁使得对手的士气变的低落,印第安军队面对这样意志坚定的对手时,他们选择了退却。
宝丽来公司在拒绝共享即时成像市场时也采取了类似的策略,当时它决定与任何挤占该市场的对手拼个你死我活。
当柯达公司试图染指即时成像业时,宝丽来调动了所有的资源进行反击。
14年后, 宝丽来公司在与柯达公司的诉讼中获胜,重新赢得了在即时成像市场的垄断地位。
17使威吓变得可信的另一个方法是运用冒险性的边缘化策略,这一策略的风险在于: 如果其他博弈者未能按照威吓的要求去做的话,结果对大家将是一损俱损。
托马斯•谢林在他的《冲突策略》一书中介绍说,边缘化策略就是故意使局势变的无法控制的策略,正是由于局势的无法收拾可能令其他对手难以接受,从而迫使对手作出妥协。
18讨价还价。
两个博弈者一起来分蛋糕,每个人都想要一个大块,双方都倾向于尽快达成一致。
当两个人依次开始提议分配时,放眼前方和反思总结的博弈原则决定了均衡的份额,双方这时可以立即达成一致,但时间的延误决定了双方的份额,竞争中缺乏耐心的一方只能分得较小的蛋糕。
19隐瞒和公开信息。
当博弈的一方掌握其他各方不了解的信息时,他会急于隐瞒这一
信息(例如纸牌游戏中抽到的牌),其他情况下他也可能想把信息令人信服的公布出来。
这两种情况下,一个基本的原则就是行胜于言。
要隐瞒信息,就要采取混合性策略,比如纸牌游戏中的作弊手法千变万化。
我们还可以回忆起温斯顿•丘吉尔关于把真相置于谎言保卫下的名言。
要公布信息,就要采用传递可信信息的策略,而且如果形势发生变化时策略也必须相应进行改变,例如,提供长期的质量保证是企业生产高质量产品信心的显示,对消费者来说也是一个可以信赖的信号。
‖
20最近博弈论研究的最新进展已经成功地解释和规范了在各种冲突和合作情况下应采取的恰当策略,然而博弈论研究仍有待进一步完善,从多角度设计成功策略也有待进一步去研究。
‖。