In computational complexity theory, Yao's principle states that the expected cost of a randomized algorithm on the worst-case input is no better than the expected cost for a worst-case probability distribution on the inputs of the deterministic algorithm that performs best against that distribution. Thus, to establish a lower bound on the performance of randomized algorithms, it suffices to find an appropriate distribution of difficult inputs, and to prove that no deterministic algorithm can perform well against that distribution. This principle is named after Andrew Yao, who first proposed it. Yao's principle may be interpreted in game theoretic terms, via a two-playerzero-sum game in which one player, Alice, selects a deterministic algorithm, the other player, Bob, selects an input, and the payoff is the cost of the selected algorithm on the selected input. Any randomized algorithm R may be interpreted as a randomized choice among deterministic algorithms, and thus as a strategy for Alice. By von Neumann'sminimax theorem, Bob has a randomized strategy that performs at least as well against R as it does against the bestpure strategy Alice might choose; that is, Bob's strategy defines a distribution on the inputs such that the expected cost of R on that distribution is no better than the expected cost of any single deterministic algorithm against the same distribution.
Statement
The formulation below states the principle for Las Vegas randomized algorithms, i.e., distributions over deterministic algorithms that are correct on every input but have varying costs. It is straightforward to adapt the principle to Monte Carlo algorithms, i.e., distributions over deterministic algorithms that have bounded costs but can be incorrect on some inputs. Consider a problem over the inputs, and let be the set of all possible deterministic algorithms that correctly solve the problem. For any algorithm and input, let be the cost of algorithm run on input. Let be a probability distribution over the algorithms, and let denote a random algorithm chosen according to. Let be a probability distribution over the inputs, and let denote a random input chosen according to. Then, That is, the worst-case expected cost of the randomized algorithm is at least the cost of the best deterministic algorithm against input distribution.