Hoeffding's inequality

In probability theory, Hoeffding's inequality provides an upper bound on the probability that the sum of bounded independent random variables deviates from its expected value by more than a certain amount. Hoeffding's inequality was proven by Wassily Hoeffding in 1963.
Hoeffding's inequality is a generalization of the Chernoff bound, which applies only to Bernoulli random variables, and a special case of the Azuma–Hoeffding inequality and the McDiarmid's inequality. It is similar to, but incomparable with, the Bernstein inequality, proved by Sergei Bernstein in 1923.

Special case of Bernoulli random variables

Hoeffding's inequality can be applied to the important special case of identically distributed Bernoulli random variables, and this is how the inequality is often used in combinatorics and computer science. We consider a coin that shows heads with probability and tails with probability. We toss the coin times. The expected number of times the coin comes up heads is. Furthermore, the probability that the coin comes up heads at most times can be exactly quantified by the following expression:
where is the number of heads in coin tosses.
When for some, Hoeffding's inequality bounds this probability by a term that is exponentially small in :
Similarly, when for some, Hoeffding's inequality bounds the probability that we see at least more tosses that show heads than we would expect:
Hence Hoeffding's inequality implies that the number of heads that we see is concentrated around its mean, with exponentially small tail.
For example, taking gives:

General case of bounded random variables

Let be independent random variables bounded by the interval : . We define the empirical mean of these variables by
One of the inequalities in Theorem 1 of states
where.
Theorem 2 of is a generalization of the above inequality when it is known that are strictly bounded by the intervals :
which are valid for positive values of. Here is the expected value of. The inequalities can be also stated in terms of the sum
of the random variables:
Note that the inequalities also hold when the have been obtained using sampling without replacement; in this case the random variables are not independent anymore. A proof of this statement can be found in Hoeffding's paper. For slightly better bounds in the case of sampling without replacement, see for instance the paper by.

General case of sub-Gaussian random variables

A random variable is called sub-Gaussian, if
for some c>0. For a random variable, the following norm is finite if and only if it is sub-Gaussian:
Then let be zero-mean independent sub-Gaussian random variables, the general version of the Hoeffding's inequality states that:
where c > 0 is an absolute constant. See Theorem 2.6.2 of for details.

Proof

In this section, we give a proof of Hoeffding's inequality. The proof uses Hoeffding's Lemma:
Using this lemma, we can prove Hoeffding's inequality. Suppose are independent random variables such that
Let
Then for, Markov's inequality and the independence of implies:
To get the best possible upper bound, we find the minimum of the right hand side of the last inequality as a function of. Define
Note that is a quadratic function and achieves its minimum at
Thus we get

Usage

Confidence intervals

Hoeffding's inequality is useful to analyse the number of required samples needed to obtain a confidence interval by solving the inequality in Theorem 1:
The inequality states that the probability that the estimated and true values differ by more than is bounded by e^−2nt².
Symmetrically, the inequality is also valid for another side of the difference:
By adding them both up, we can obtain two-sided variant of this inequality:
This probability can be interpreted as the level of significance for a confidence interval around of size 2:
Solving the above for gives us the following:
Therefore, we require at least samples to acquire -confidence interval.
Hence, the cost of acquiring the confidence interval is sublinear in terms of confidence level and quadratic in terms of precision.
Note that this inequality is the most conservative of the three in Theorem 1, and there are more efficient methods of estimating a confidence interval.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...