Optimal pure strategies of players. The concept of pure and mixed strategies

A mixed strategy SA of player A is the use of pure strategies A1, A2, ..., Am with probabilities p1, p2, ..., pi, ..., pm and the sum of the probabilities is equal to 1: Mixed strategies of player A are written in the form of a matrix or in the form of a string SA = (p1, p2, ..., pi, ..., pm) Similarly, mixed strategies of player B are denoted: , or, SB = (q1, q2, ..., qi, ..., qn ), where the sum of the probabilities of the appearance of strategies is equal to 1: Pure strategies can be considered a special case of mixed ones and can be specified by a line in which 1 corresponds to a pure strategy. Based on the minimax principle, the optimal solution (or solution) of the game is determined: this is a pair of optimal strategies S*A , S*B in the general case mixed, having the following property: if one of the players adheres to his optimal strategy, then it cannot be profitable for the other to deviate from his. The payoff corresponding to the optimal solution is called the game price v. The price of the game satisfies the inequality: ? ? v? ? (3.5) where? And? - lower and upper prices of the game. The following fundamental theorem of game theory is true - Neumann's theorem. Every finite game has at least one optimal solution, possibly among mixed strategies. Let S*A = (p*1, p*2, ..., p*i, ..., p*m) and S*B = (q*1, q*2, ..., q* i, ..., q*n) is a pair of optimal strategies. If a pure strategy is included in an optimal mixed strategy with a nonzero probability, then it is called active. The theorem about active strategies is true: if one of the players adheres to his optimal mixed strategy, then the payoff remains unchanged and equal to the cost of the game v, if the second player does not go beyond the limits of his active strategies. This theorem is of great practical importance - it provides specific models for finding optimal strategies in the absence of a saddle point. Consider a game of size 2x2, which is the simplest case of a finite game. If such a game has a saddle point, then the optimal solution is a pair of pure strategies corresponding to this point. A game in which there is no saddle point, in accordance with the main theorem of game theory, the optimal solution exists and is determined by a pair of mixed strategies S*A = (p*1, p*2) and S*B = (q*1, q*2) . In order to find them, we use the theorem on active strategies. If player A adheres to his optimal strategy S "A, then his average payoff will be equal to the cost of the game v, no matter what active strategy player B uses. For a 2x2 game, any pure strategy of the opponent is active if there is no saddle point. The gain of player A (loss of player B) is a random variable, the mathematical expectation (average value) of which is the price of the game. Therefore, the average payoff of player A (optimal strategy) will be equal to v for both the 1st and 2nd strategy of the opponent. Let the game be given by the payoff matrix. The average payoff of player A, if he uses the optimal mixed strategy, and player B uses the pure strategy B1 (this corresponds to the 1st column of the payoff matrix P), is equal to the price of the game v: a11 p*1+ a21 p*2 = v. Player A gets the same average payoff if player 2 uses strategy B2, i.e. a12 p*1+ a22 p*2= v. Considering that p*1+ p*2= 1, we obtain a system of equations for determining the optimal strategy S"A and the game price v: (3.6) Solving this system, we obtain the optimal strategy (3.7) and the game price (3.8) Applying the theorem active strategies when finding SВ* - the optimal strategy of player B, we find that for any pure strategy of player A (A1 or A2), the average loss of player B is equal to the price of the game v, i.e. (3.9) Then the optimal strategy is determined by the formulas: (3.10 )

A two-player zero-sum matrix game can be thought of as the following abstract two-player game.

The first player has m strategies i = 1,2,...,m, the second has n strategies j= 1,2,...,n. Each pair of strategies ( i, j) matched number A ij , expressing the gain of player 1 at the expense of player 2 if the first player accepts his i- yu strategy, and 2 – his own j th strategy.

Each player makes one move: player 1 chooses his i th strategy ( i= ), 2– yours j th strategy ( j=
), after which player 1 receives the winnings A ij at the expense of player 2 (if A ij < 0, then this means that player 1 pays the second amount | A ij|). This is where the game ends.

Each player strategy i=
;
j =
often called pure strategy.

If we consider the matrix

A=

then conducting each game of a matrix game with a matrix A comes down to player 1's choice i th row, and player 2 j th column and player 1 (at the expense of player 2) receives winnings a ij .

The main thing in the study of games is the concept of optimal strategies of players. This concept intuitively has the following meaning: a player’s strategy is optimal if the use of this strategy provides him with the largest guaranteed win for all possible strategies of the other player. Based on these positions, player 1 examines the payoff matrix A as follows: for each value i (i =
) the minimum winning value is determined depending on the strategies used by player 2

A ij (i=
)

those. the minimum winnings for player 1 are determined, provided that he accepts his i th pure strategy, then from these minimal payoffs such a strategy is found i = i O, at which this minimum gain will be maximum, i.e. located


A ij =
=(1)

Definition . Number , defined by formula (1) is called lower net price games and shows what minimum winnings player 1 can guarantee for himself by applying his pure strategies for all possible actions of player 2.

Player 2, with his optimal behavior, should strive, if possible, through his strategies, to minimize the payoff of player 1. Therefore, for player 2, we find

A ij

those. the max win of player 1 is determined, provided that player 2 uses his j th pure strategy, then player 2 looks for his own j = j 1 a strategy in which player 1 will receive a min win, i.e. finds


a ij =
=(2).

Definition . Number , determined by formula (2), is called net top price games and shows what maximum winnings player 1 can guarantee for himself due to his strategies.

In other words, using his pure strategies, player 1 can ensure a win no less than , and player 2, through the use of his pure strategies, can prevent player 1 from winning more than .

Definition . If in a game with matrix A =then they say that this game has saddle point in pure strategies and net price games

 = =.

Saddle point is a pair of pure strategies (i O , j O ) respectively, players 1 and 2, at which equality is achieved =. This concept has the following meaning: if one of the players adheres to the strategy corresponding to the saddle point, then the other player cannot do better than to adhere to the strategy corresponding to the saddle point. Mathematically, this can be written another way:


Where i, j– any pure strategies of players 1 and 2, respectively; (i O , j O ) – strategies that form a saddle point.

Thus, based on (3), the saddle element
is minimal in the i o -th row and maximum in the j o -th column in matrix A. Finding the saddle point of matrix A occurs as follows: in matrix A sequentially in each line find the minimum element and check whether this element is the maximum in its column. If yes, then it is a saddle element, and the pair of strategies corresponding to it forms a saddle point. A couple of pure strategies (i O , j O ) players 1 and 2, forming a saddle point and a saddle element
, called game solution . Wherein i O And j O are called optimal clean strategies respectively players 1 and 2.

Example 1

The saddle point is the pair ( i O = 3;j O= 1), at which = == 2.

Note that although the payoff in situation (3;3) is also equal to 2 = =, it is not a saddle point, because this winning is not the maximum among the winnings of the third column.

Example 2

From the analysis of the payoff matrix it is clear that
, i.e. this matrix does not have a saddle point. If player 1 chooses his pure maximin strategy i = 2, then player 2, choosing his minimax j= 2, only 20 will lose. In this case, it is profitable for player 1 to choose strategy i = 1, i.e. deviate from his pure maximin strategy and win 30. Then it will be profitable for player 2 to choose strategy j = 1, i.e. deviate from his pure minimax strategy and lose 10. In turn, player 1 must choose his 2nd strategy to win 40, and player 2 will respond by choosing his 2nd strategy, etc.

Let's look at an example. Let the game matrix (4) be given:

We need to find the lower price of the game α, the upper price of the game β and the minimax strategies and check whether they are stable. Solution. From the analysis of the additional column and row we get: α = 5, β = 5. Maximin is equal to minimax! This is a special case. What follows from this? Let's take a couple of minimax strategies: K 2 and C 3. If both stick to these strategies, then the payoff will be 5. Now, let's say we have learned about the enemy's behavior. What do we do? Nothing! We will continue to adhere to the K2 strategy, because any deviation from it is unprofitable for us. Whether we know or don’t know about the enemy’s behavior, we will still stick to the K 2 strategy! The same applies to the “blues” - there is no point in them changing their C 3 strategy. In this example, the pair of strategies K 2 and C 3 is stable, that is, it represents an equilibrium position and provides a solution to the game. Why did this happen? Because there is a special element 5 in the matrix; it is minimal in its row and at the same time maximal in its column. Such an element is called saddle point. If the matrix has a saddle point (that is, the lower price of the game is equal to the upper), then the game has a solution in pure strategies: this is a pair of strategies intersecting at the saddle point. The saddle point itself gives the price of the game - in our example it is equal to 5. The class of games that have a saddle point is of great importance in game theory. In particular, it has been proven that if, according to the rules of the game, each player knows the result of all previous moves, both his own and his opponent’s (the so-called game with complete information), then the game has a saddle point and, therefore, has a solution in pure strategies. Examples of games with complete information include: chess, checkers, tic-tac-toe, etc. Let us give an example of a game with complete information, the solution of which is easy to find. Two players - K and C - alternately place identical coins on the round table. The position of each coin is chosen arbitrarily, as long as it does not overlap with others. The last player to place a coin wins (when there is no room left for others). It is worth a little thought to make sure that the outcome of this game is always a foregone conclusion and that there is a well-defined strategy that guarantees a win for the player who puts in the coin first (let it be K). Namely, K must place the first coin in the center of the table, and then for each move C must respond with a move exactly symmetrical relative to the center of the table! Poor S can behave as he pleases, but there is still no escape for him... Obviously, such a game makes sense only for those who do not know the solution. It is curious that exactly the same thing is true with such a popular game as chess! This game only makes sense until a solution is found. It has been theoretically proven that a solution exists and the outcome of a chess game is essentially predetermined: if each side uses its own optimal strategy, then the game will either always end in a win for White, or always in a win for Black, or always in a draw! But what exactly? We don’t know this yet, since the number of possible strategies is too large to be able to construct a matrix of a chess game and find a saddle point in it... Probably, chess fans are interested in the fact that the chess game will not be solved soon. Let us note in conclusion that there can be not one, but several saddle points in the matrix; then there are as many solutions to the game in pure strategies as there are saddle points. Each of them gives a win equal to the price of the game.

Among finite games of practical importance, games with a saddle point are relatively rare; A more typical case is when the lower and upper price of the game are different. Analyzing the matrices of such games, we came to the conclusion that if each player is given a choice

one - only strategy., then, counting on a rationally acting enemy, this choice should be determined by the minimax principle. By adhering to our maximin strategy, regardless of the enemy’s behavior, we obviously guarantee ourselves a win equal to the lower price of game a. A natural question arises: is it possible to guarantee an average payoff greater than a if you use not just one “pure” strategy, but randomly alternate several strategies?

Such combined strategies, consisting of the use of several pure strategies, alternating according to a random law with a certain frequency ratio, are called mixed strategies in game theory.

Obviously, each pure strategy is a special case of a mixed one, in which all strategies except one are applied with zero frequencies, and this one with frequency 1.

It turns out that, using not only pure, but also mixed strategies, it is possible for each finite game to obtain a solution, i.e., a pair of such (in the general case mixed) strategies such that when both players use them, the payoff will be equal to the price of the game, and when Any one-sided deviation from the optimal strategy can only change the payoff in a direction unfavorable for the deviant.

The stated statement constitutes the content of the so-called fundamental theorem of game theory. This theorem was first proven by von Neumann in 1928. Known proofs of the theorem are relatively complex; Therefore, we will give only its formulation.

Every finite game has at least one solution (possibly in the realm of mixed strategies).

The payoff resulting from the decision is called the cost of the game. From the main theorem it follows that every finite game has a price. Obviously, the price of game v always lies between the lower price of game a and the upper price of game :

Indeed, there is a maximum guaranteed win that we can ensure for ourselves by using only our pure strategies. Since mixed strategies include, as a special case, all pure ones, then, allowing, in addition to pure ones, also mixed ones

strategy, we, in any case, do not worsen our capabilities; hence,

Similarly, considering the capabilities of the enemy, we will show that

whence the required inequality (3.1) follows.

Let us introduce a special notation for mixed strategies. If, for example, our mixed strategy consists of using AL strategies, with frequencies and we will denote this strategy

Similarly, we will denote the enemy’s mixed strategy:

where are the frequencies at which strategies are mixed

Let us assume that we have found a solution to the game consisting of two optimal mixed strategies S, S. In the general case, not all pure strategies available to a given player are included in his optimal mixed strategy, but only some. We will call the strategies included in a player’s optimal mixed strategy his “useful” strategies.

It turns out that the solution to the game has another remarkable property: if one of the players adheres to its optimal mixed strategy 5 (5). then the payoff remains unchanged and equal to the price of the game v, regardless of what the other player does, if he. just doesn’t go beyond its “useful” strategies. For example, he can use any of his “useful” strategies in its pure form, and can also mix them in any proportions.

Let's prove this statement. Let there be a solution to the game. To be specific, we will assume that the optimal mixed strategy consists of a mixture of three

"useful" strategies accordingly consists of a mixture of three "useful" strategies

Moreover, it is stated that if we adhere to strategy S, then the opponent can use strategies in any proportions, and the payoff will remain unchanged and will still be equal to the cost of the game

5. THEORY OF GAMES AND STATISTICAL DECISIONS

5.1. Zero-sum matrix game

Economic and mathematical modeling is carried out under the following conditions:

Certainty;

Uncertainties.

Modeling in conditions of certainty assumes the availability of all the initial regulatory data necessary for this (matrix modeling, network planning and management).

Modeling at risk carried out under stochastic uncertainty, when the values ​​of some initial data are random and the laws of probability distribution of these random variables are known (regression analysis, queuing theory).

Modeling in conditions of uncertainty corresponds to the complete absence of some data necessary for this (game theory).

Mathematical models for making optimal decisions in conflict situations are built under conditions of uncertainty.

Game theory operates with the following basic concepts:

Strategy;

Win function.

On the move we will call the choice and implementation by the player of one of the actions provided for by the rules of the game.

Strategy - this is a technology for choosing an action option at each move, depending on the current situation.

Win function serves to determine the amount of payment from the losing player to the winning one.

In a matrix game, the payoff function is represented as payment matrix :

where is the amount of payment to player I, who chose the move, from player II, who chose the move.

In such a paired game, the values ​​of the payoff functions of both players in each situation are equal in magnitude and opposite in sign, i.e. and this game is called zero sum .

The process of "playing the matrix game" is represented as follows:

The payment matrix is ​​set;

Player I, regardless of player II, chooses one of the rows of this matrix, for example, -th;

Player II, regardless of player I, chooses one of the columns of this matrix, for example, - th;

The matrix element determines how much player I will receive from player II. Of course, if , then we are talking about the actual loss of player I.

We will call an antagonistic paired game with a payoff matrix a game.

Example

Let's consider the game.

The payment matrix is ​​set:

.

Let player I, independently of player II, choose the 3rd row of this matrix, and player II, independently of player I, choose the 2nd column of this matrix:

Then player I will receive 9 units from player II.

5.2. Optimal pure strategy in a matrix game

Optimal strategy is called a strategy of player I in which he will not reduce his winnings for any choice of strategy by player II, and such a strategy of player II in which he will not increase his loss for any choice of strategy by player I.

By choosing the th row of the payoff matrix as a move, player I ensures that he will win no less than the value in the worst case, when player II tries to minimize this value. Therefore, player I will choose the th row that will provide him with the maximum winnings:

.

Player II argues similarly and can certainly ensure a minimal loss:

.

The inequality is always true:

The quantity is called lower price of the game .

The quantity is called top price of the game .

Optimal strategies are called clean , if the equalities hold for them:

,

.

The quantity is called at the pure price of the game , If .

Optimal pure strategies form saddle point payment matrix.

For a saddle point the following conditions are met:

that is, the element is the smallest in the row and the largest in the column.

Thus, if the payoff matrix has saddle point , then you can find optimal pure strategies players.

The pure strategy of player I can be represented by an ordered set of numbers (vector), in which all numbers are equal to zero, except for the number in the -th place, which is equal to one.

Player II's pure strategy can be represented by an ordered set of numbers (a vector) in which all numbers are equal to zero, except for the number in the -th place, which is equal to one.

Example

.

By choosing any row of the payoff matrix as a move, player I ensures that in the worst case the winnings are no less than the value in the column marked:

Therefore, player I will choose the 2nd row of the payment matrix, which provides him with the maximum winnings regardless of the move of player II, who will try to minimize this value:

Player II reasons similarly and chooses the 1st column as his move:

Thus, there is a saddle point of the payment matrix:

corresponding to the optimal pure strategy for player I and for player II, in which player I will not reduce his winnings for any change in strategy by player II and player II will not increase his loss for any change in strategy by player I.

5.3. Optimal mixed strategy in a matrix game

If the payoff matrix does not have a saddle point, then it is irrational for any player to use one pure strategy. It is more profitable to use "probability mixtures" pure strategies. Then mixed strategies are identified as optimal.

Mixed strategy a player is characterized by the probability distribution of a random event consisting in the choice of a move by this player.

The mixed strategy of player I is such an ordered set of numbers (vector) that satisfies two conditions:

1) for , i.e. the probability of choosing each row of the payment matrix is ​​non-negative;

2), i.e., the choice of each of the rows of the payment matrix together represents a complete group of events.

Player II's mixed strategy will be an ordered set of numbers (vector) satisfying the conditions:

Payment amount to player I, who chose a mixed strategy

from player II, who chose a mixed strategy

,

represents the average value

.

Optimal called mixed strategies

And ,

if for any arbitrary mixed strategies and the following condition is satisfied:

i.e., with an optimal mixed strategy, player I’s gain is the greatest, and player II’s loss is the least.

If there is no saddle point in the payment matrix, then

,

i.e. there is a positive difference ( unallocated difference )

- ³ 0,

and players need to look for additional opportunities to confidently receive a larger share of this difference in their favor.

Example

Consider the game defined by the payoff matrix:

.

Let's determine whether there is a saddle point:

, .

It turns out that there is no saddle point in the payment matrix and the undistributed difference is equal to:

.

5.4. Finding optimal mixed strategies

for games 2x2

Determination of optimal mixed strategies for a payment matrix of dimension is carried out by finding the optimal points of a function of two variables.

Let the probability of player I choosing the first row of the payment matrix

equal to . Then the probability of choosing the second row is equal to .

Let the probability of player II choosing the first column be equal to . Then the probability of choosing the second column is equal to .

The amount of payment to player I by player II is equal to:

The extreme value of player I's gain and player II's loss corresponds to the following conditions:

;

.

Thus, the optimal mixed strategies of players I and II are respectively equal to:

5.5. Geometric solution of games 2×n

As the dimension of the payment matrix increases from to, it is no longer possible to reduce the determination of optimal mixed strategies to finding the optimum of a function of two variables. However, given that one of the players has only two strategies, a geometric solution can be used.

The main stages of finding a solution to the game are as follows.

Let us introduce a coordinate system on the plane. Let's plot the segment on the axis. We draw perpendiculars from the left and right ends of this segment.


The left and right ends of the unit segment correspond to two strategies and available to player I. We will plot the winnings of this player on the drawn perpendiculars. For example, for the payment matrix


such payoffs of player I when choosing a strategy will be and , and when choosing a strategy they will be and .

Let us connect by straight line segments the winning points of player I, corresponding to the strategies of player II. Then the formed broken line, limiting the graph from below, determines the lower limit of the payoff of player I.



Finding the optimal mixed strategy of player I

,

which corresponds to the point on the lower bound of player I's payoff with the maximum ordinate.

Let us pay attention to the fact that in the example under consideration, using only two strategies and , corresponding to straight lines intersecting at the found point on the lower boundary of the payoff of player I, player II can prevent player I from getting a larger payoff.

Thus, the game is reduced to a game and the optimal mixed strategy of player II in the example under consideration will be

,

where the probability is the same as in the game:

5.6. Game solvingm× n

If a matrix game does not have a solution in pure strategies (i.e., there is no saddle point) and, due to the large dimension of the payoff matrix, cannot be solved graphically, then to obtain a solution, use linear programming method .

Let a payment matrix of dimension be given:

.

Need to find probabilities , with which player I must choose his moves so that this mixed strategy guarantees him a win of no less than the value regardless of the choice of moves by player II.

For each move chosen by player II, the payoff of player I is determined by the dependencies:

Let us divide both sides of the inequalities by and introduce new notations:

Equality

Will take the form:

Since player I seeks to maximize the payoff, the inverse must be minimized. Then the linear programming problem for player I will take the form:

under restrictions

The problem for player II is constructed similarly as dual:

under restrictions

Solving problems using the simplex method, we get:

,

5.7. Features of solving matrix games

Before solving the problem of finding optimal strategies, two conditions should be checked:

Is it possible to simplify the payment matrix;

Does the payment matrix have a saddle point?

Let's consider the possibility of simplifying the payment matrix:

Due to the fact that player I strives to get the largest win, the th row can be deleted from the payment matrix, since he will never use this move if the following relation is satisfied with any other row:

Similarly, striving for the smallest loss, player II will never choose the -th column in the payoff matrix as a move, and this column can be crossed out if the following relation is satisfied with any other -th column:

The simplest solution to the game is the presence of a saddle point in the simplified payment matrix, which meets the following condition (by definition):

Example

The payment matrix is ​​given:

.

Simplification of the payment matrix:

Presence of a saddle point:

5.8. Playing with nature

Unlike game theory problems in problems of statistical decision theory an uncertain situation does not have an antagonistic conflict connotation and depends on objective reality, which is usually called "nature" .

In matrix games with nature, player II is a set of uncertain factors that influence the effectiveness of decisions made.

Matrix games with nature differ from ordinary matrix games only in that when choosing an optimal strategy for player I, it is no longer possible to rely on the fact that player II will strive to minimize his loss. Therefore, along with the payment matrix, we introduce risk matrix :

where is the amount of risk of player I when using a move in conditions, equal to the difference between the payoff that player I would receive if he knew that the condition would be established, i.e. , and the winnings that he will receive, not knowing when choosing a move that the condition will be established.

Thus, the payment matrix is ​​uniquely transformed into a risk matrix, but the inverse transformation is ambiguous.

Example

Winning matrix:

.

Risk matrix:

Possible two problem statements about choosing a solution in a matrix game with nature :

Maximizing winnings;

Minimizing risk.

The decision-making problem can be posed for one of two conditions:

- at risk , when the probability distribution function of nature's strategies is known, for example, the random variable of the occurrence of each of the expected specific economic situations;

- in conditions of uncertainty , when such a probability distribution function is unknown.

5.9. Solving problems in statistical decision theory

at risk

When making decisions under risk conditions, player I knows the probabilities the onset of states of nature.

Then it is advisable for player I to choose the strategy for which the average winning value taken by line is maximum :

.

When solving this problem with a risk matrix, we obtain the same solution, corresponding minimum average risk :

.

5.10. Solving problems in statistical decision theory

in conditions of uncertainty

When making decisions under conditions of uncertainty, you can use the following criteria :

Maximum Wald criterion;

Savage's minimum risk criterion;

Hurwitz's criterion of pessimism - optimism;

Laplace's principle of insufficient reason.

Let's consider maximin Wald test .

The game with nature is played as with a reasonable aggressive opponent, i.e., a reinsurance approach is taken from the position of extreme pessimism for the payment matrix:

.

Let's consider Savage's minimum risk criterion .

An approach similar to the previous one from the position of extreme pessimism for the risk matrix:

.

Let's consider Hurwitz criterion of pessimism - optimism .

The opportunity is offered not to be guided by either extreme pessimism or extreme optimism:

where is the degree of pessimism ;

at - extreme optimism,

at - extreme pessimism.

Let's consider Laplace's principle of insufficient reason .

It is assumed that all states of nature are equally probable:

,

.

Conclusions on the fifth section

In a matrix game, two players participate and the payoff function, which serves to determine the amount of payment from the losing player to the winning player, is represented in the form of a payment matrix. It was agreed that player I chooses one of the rows of the payment matrix as a move, and player II chooses one of its columns. Then at the intersection of the selected rows and columns of this matrix there is a numerical value of the payment to player I from player II (if this value is positive, then player I really won, and if it is negative, then player II essentially won).

If there is a saddle point in the payoff matrix, then the players have optimal pure strategies, i.e., to win, each of them must repeat his one optimal move. If there is no saddle point, then to win, each of them must use the optimal mixed strategy, that is, use a mixture of moves, each of which must be made with optimal probability.

Finding optimal mixed strategies for 2x2 games is done by calculating optimal probabilities using known formulas. Using the geometric solution of 2×n games, determining optimal mixed strategies in them is reduced to finding optimal mixed strategies for 2×2 games. To solve m×n games, the linear programming method is used to find optimal mixed strategies in them.

Some payment matrices can be simplified, as a result of which their dimension is reduced by removing rows and columns corresponding to unpromising moves.

If player II is a set of uncertain factors that depend on objective reality and do not have an antagonistic conflict overtones, then such a game is called a game with nature, and problems from the theory of statistical decisions are used to solve it. Then, along with the payment matrix, a risk matrix is ​​introduced and two formulations of the problem of choosing a solution in a matrix game with nature are possible: maximizing gain and minimizing risk.

Solving problems of the theory of statistical decisions under risk conditions shows that it is advisable for player I to choose the strategy for which the average value (mathematical expectation) of winnings, taken from a row of the payment matrix, is maximum, or (which is the same thing) the average value (mathematical expectation) of risk , taken by row of the risk matrix, is minimal. When making decisions under conditions of uncertainty, the following criteria are used: Wald's maximin criterion, Savage's minimum risk criterion, Hurwitz's pessimism-optimism criterion, Laplace's principle of insufficient reason.

Self-test questions

How are the basic concepts of game theory defined: move, strategy and payoff function?

What is the payoff function represented in a matrix game?

Why is a matrix game called zero-sum?

What is the process of playing a matrix game like?

What game is called the m×n game?

What matrix game strategy is called optimal?

What is the optimal strategy for a matrix game called pure?

What does the saddle point of the payment matrix mean?

What is the optimal strategy for a matrix game called mixed?

What does a player's mixed strategy look like?

What is the amount of payment to player I from player II who chose mixed strategies?

What mixed strategies are called optimal?

What does undistributed difference mean?

What method is used to find optimal mixed strategies for 2x2 games?

How are optimal mixed strategies found for 2×n games?

What method is used to find optimal mixed strategies for m×n games?

What are the features of solving matrix games?

What does simplification of the payment matrix mean and under what conditions can it be implemented?

Which matrix game is easier to solve when the payoff matrix has or does not have a saddle point?

What game theory problems are related to statistical decision theory problems?

How does the payment matrix translate into a risk matrix?

What two formulations of the problem of choosing solutions are possible in a matrix game with nature?

For what two conditions can decision-making problems be posed in a matrix game with nature?

What strategy is appropriate for player I to choose when solving a statistical decision theory problem under risk conditions?

What decision-making criteria can be used when solving statistical decision theory problems under conditions of uncertainty?

Examples of problem solving

1. The payment matrix indicates the amount of profit of the enterprise when it sells different types of products (columns) depending on established demand (rows). It is necessary to determine the optimal strategy of the enterprise for the production of different types of products and the corresponding maximum (on average) income from their sales.

Let us denote the given matrix by and introduce the variables . We will also use the matrix (vector). Then and , i.e. .

The inverse matrix is ​​calculated:

The values ​​are found:

.

Probabilities are calculated:

The average sales income is determined:

.

2. The Pharmacist company is a manufacturer of medicines and biomedical products in the region. It is known that the peak demand for some drugs occurs in the summer (cardiovascular drugs, analgesics), for others - in the autumn and spring (anti-infective, antitussive).

Costs per 1 standard unit units products for September-October were: for the first group (cardiovascular drugs and analgesics) – 20 rubles; for the second group (anti-infectious, antitussive drugs) – 15 rubles.

According to observations over the past few years, the company's marketing service has established that it can sell 3050 conventional units during the two months under consideration in warm weather conditions. units products of the first group and 1100 conventional units. units products of the second group; in cold weather conditions - 1525 arb. units products of the first group and 3690 conventional units. units second group.

In connection with possible changes in the weather, the task is set to determine the company's product production strategy that ensures maximum sales income at a selling price of 40 rubles. for 1 standard unit units products of the first group and 30 rub. - second group.

SOLUTION. The company has two strategies:

The weather will be warm this year;

The weather will be cold.

If the company adopts the strategy and in reality there is warm weather (nature strategy), then the manufactured products (3050 standard units of drugs of the first group and 1100 standard units of the second group) will be fully sold and the income will be

3050×(40-20)+1100×(30-15)=77500 rub.

In cool weather conditions (nature's strategy), the drugs of the second group will be sold in full, and the first group only in the amount of 1525 conventional units. units and some of the drugs will remain unsold. The income will be

1525×(40-20)+1100×(30-15)-20×()=16500 rub.

Likewise, if the form adopts the strategy and the weather is actually cold, then the income will be

1525×(40-20)+3690×(30-15)=85850 rub.

In warm weather, income will be

1525×(40-20)+1100×(30-15)-() ×15=8150 rub.

Considering the company and the weather as two players, we obtain the payment matrix

,

The price of the game is in the range

From the payment matrix it is clear that under all conditions the company’s income will be no less than 16,500 rubles, but if weather conditions coincide with the chosen strategy, then the company’s income can be 77,500 rubles.

Let's find a solution to the game.

Let us denote the probability of a firm using a strategy by , and a strategy by , and . Solving the game graphically, we get , while the price of the game is p.

The optimal plan for the production of drugs will be

Thus, it is advisable for the company to produce 2379 conventional units during September and October. units drugs of the first group and 2239.6 conventional units. units drugs of the second group, then in any weather she will receive an income of at least 46,986 rubles.

In conditions of uncertainty, if it is not possible for a company to use a mixed strategy (agreements with other organizations), we use the following criteria to determine the optimal strategy of the company:

Walde criterion:

Hurwitz criterion: for definiteness we accept , then for the company’s strategy

for strategy

It is advisable for a company to use a strategy.

Savage criterion. The maximum element in the first column is 77500, in the second column - 85850.

The elements of the risk matrix are found from the expression

,

where , ,

The risk matrix looks like

,

It is advisable to use the or strategy.

Therefore, it is advisable for the company to use the or strategy.

Let us note that each of the considered criteria cannot be considered completely satisfactory for the final choice of decisions, however, their joint analysis allows us to more clearly imagine the consequences of making certain management decisions.

Given a known probability distribution of various states of nature, the decision criterion is the maximum mathematical expectation of winning.

Let it be known for the problem under consideration that the probabilities of warm and cold weather are equal and amount to 0.5, then the optimal strategy of the company is determined as follows:

It is advisable for the company to use the or strategy.

Tasks for independent work

1. An enterprise can produce three types of products (A, B and C), while receiving a profit depending on demand. Demand, in turn, can take one of four states (I, II, III and IV). In the following matrix, the elements characterize the profit that the enterprise will receive when releasing the -th product and the -th state of demand:

"Pure" strategies

We are already familiar with jambs. However, what happens if the jambs are removed from the chain of any strategy? We will get a “pure strategy”. Pure strategies are those in the chain of actions of which, from the very root to the effective part, there are no ineffective substrategies (jambs), and this can often only be evidenced by the presence of all links in consciousness.

Of course, from the point of view of all possible outcomes of using a strategy, it is difficult for us to talk about the most effective one, since we may simply not have certain experience, and therefore certain intermediate strategies, but it is precisely from the point of view of our experience that the strategy should be as effective as possible.

The concept of pure strategies is also one of the key ones in these materials, so I will give an example:

Evening. You are in your native area and rushing home. The milk runs away. Flying past “a lot of suspicious types” you hear addressed to you: “Hey, you, [cut out by censorship]. Don’t walk here, it’ll snow on your head!”

What will you do? There can be many options. Someone will go to sort things out, someone will get scared and speed up their pace, someone will shout something back. However, let's think about what the pure strategy of behavior is in this case?

A person you don’t know shouts something to you on the street. You have your own affairs, which you actually go about. Judging by the text, positive benefits for you from communicating with this person are unlikely. The logical conclusion: calmly move on with your business. I draw attention to the fact that it is “calm”, without a shadow of negative emotions, but with a healthy indifference to what is happening. How many people would do that? I'm guessing the vast minority. Why?

Because most people have a whole layer of subconscious strategies tied in lower layers to self-preservation, in particular these can be: “Always respond to rudeness with rudeness”, “If someone says nasty things, then you need to run”, “If someone if he’s rude, you need to punch him in the face”, “If someone is rude, then there is danger”, and the like in different variations. Of course, not everyone will take any active action, but it will affect almost everyone emotionally. And this is a jamb.

Pure strategies are always emotionally neutral or positive, and this is embedded in your brain, you just have to use it.

You can read a little about pure strategies in the notes “Why pure strategies?” and "House, Hopkins, etc."

From the book Strategies of Geniuses. Albert Einstein by Dilts Robert

Strategies 1. Definition of the term “strategy”: a) Comes from the Greek word “strategos”, meaning: “military leader”, “the science, art of warfare”, “the art of leading social, political struggle”. b) A detailed plan for achieving a goal or advantageous

From the book Strategies of Geniuses (Aristotle Sherlock Holmes Walt Disney Wolfgang Amadeus Mozart) by Dilts Robert

From the book Can You Study Well?! A useful book for careless students author Karpov Alexey

STRATEGIES Your studies will take place at a completely different level of quality if you think and choose an action strategy. Strategy is an overall plan. This is a general line taking into account real conditions. These are goals, deadlines, taking into account unpredictability and diversity... This is the very feeling of the pulse

From the book Strategy of Mind and Success author Antipov Anatoly

From the book Emotional Intelligence by Daniel Goleman

IQ and Emotional Intelligence: Pure Types IQ and emotional intelligence are not in opposition, but rather separate competencies. We all combine intelligence with acute experience; people with high

From the book 12 Christian beliefs that can drive you crazy by Townsend John

Right Intentions or Pure Thoughts Right intention is the decision to do the right thing. We choose a good action that pleases God, usually without thinking about whether we really want to do it. We just do it - that's all. Many evangelical preachers

From the book Entering Life: Collection author author unknown

Rudolf Ivanovich ABEL: “REMEMBER HOW DZERZHINSKY SAID: “CLEAN HANDS, A COOL HEAD AND A WARM HEART...” Rudolf Ivanovich Abel spent more than thirty years working in Soviet intelligence. He was awarded the Order of Lenin, two Orders of the Red Banner, the Order of Labor

From the book Homo Sapiens 2.0 [Homo Sapiens 2.0 http://hs2.me] by Sapiens Homo

Strategies

From the book Homo Sapiens 2.0 by Sapiens 2.0 Homo

“Clean” strategies We are already familiar with the jambs. However, what happens if the jambs are removed from the chain of any strategy? We will get a “pure strategy”. Pure strategies are those in the chain of actions of which, starting from the very root to the effective part, there are no

From the book Get Started. Punch fear in the face, stop being “normal” and do something worthwhile. by Acuff John

From the book Man as an Animal author Nikonov Alexander Petrovich

Strategies General concept of strategies In principle, everyone understands to one degree or another what a strategy is. Possessing a certain set of knowledge obtained as a result of acquiring and processing experience, we build certain models of behavior. Strategy is a model for achieving a goal.

From the book Turn on your working memory to its full potential by Alloway Tracy

Why pure strategies? The lion's share of the material in this project constantly points out that it is necessary to use pure strategies for rewriting and be sure to look for a jamb based on them. This point is not obvious at first glance and

From the book Introvert in an Extroverted World author Romantseva Elizaveta

From the author's book

From the author's book

Strategies Computer strategies require concentration from the player, the ability to plan their actions and solve various problems. Recent research suggests that strategies can help improve the cognitive skills of players of all ages. According to

From the author's book

Pure types There is such a concept - “pure psychological type”. Actually, there is a concept, but there are practically no objects, that is, people who ideally fit this concept. There are no pure introverts and pure extroverts. Moreover, we agreed