30 November 2010

Prisoner's Dilemma: Game Theory for Noobs

The Prisoner's Dilemma is a fairly famous scenario in game theory. The dilemma occurs when two alleged criminals are captured by the authorities. Both suspects are handled by some tough customers in their own holding cells, but the police do not have enough evidence to put either of them away for good just yet. In order for either of the suspects to go to jail for a significant time, the other has to deliver testimony. The authorities are stuck between a rock and a hard place, so they come up with a plea bargain.

Should you take the plea bargain?

The plea bargain gives each captured suspect a few options:
  • Deliver testimony while your partner keeps silent, get out of jail free and leave your partner rotting in jail for a full decade.

  • Both keep silent. If your partner doesn't spill the beans, this is the best scenario for both of you as the police wouldn't have enough on you to put either of you away for good, but both of you would spend 6 months in prison.

  • Spill the beans, have your partner spill the beans and share the decade between you. Both of you would thus spend 5 years imprisoned. Time to learn the harmonica and get used to orange jumpsuits.

Should you take the plea bargain, or hope that your partner in crime shuts up too?

Greed is good for you

It turns out that the best course of action is to cooperate with the police and leave your friend to maybe rot in jail. You might think that being selfish and greedy is bad and that you should consider other people's feelings. If you do think so, you are wrong in this case. Wrong because defecting from your partner and cooperating with the police is a dominant strategy. A dominant strategy is a strategy that is a clear winner, regardless of how you feel about it and what else you consider in your equation. Why is this?

To see why, it is often helpful to construct a normal form (or payoff matrix) representation of the strategy. This is the normal form of our prisoner's dilemma:

prisoners dilemma game theory payoff matrix normal form

I found the normal form of the prisoner's dilemma confusing, because it seems unnatural to me to label cooperation with the authorities as cooperation from the viewpoint of allegedly hardened criminals. Surely, cooperation should refer to honour amongst thieves and defection to cooperation with the authorities? Regardless, cooperation here means defaulting on your accomplice's friendship and signing whatever the police put in front of you. To defect means to keep your mouth shut least you sleep with the fishes.

To see why cooperation with the police is clearly dominant strategy, count the numbers for each player in each column. I have colour coded them for your benefit. Since most people would like to spend the least amount of time possible in prison, the column with the lowest number is the dominant strategy.

Danica McKellar model mathematics hot and smart

Unless of course you are lucky enough to get locked up with Danica McKellar and she could explain mathematics to you.

No comments:

Google sucks piles I'm moving to Steemit

Short and sweet, Google isn't allowing me to post ads on my blogs here on blogspot any longer. Not that I provide my angry nerd rants fo...