Probability space

In probability theory, a probability space or a probability triple is a mathematical construct that provides a formal model of a random process or "experiment". For example, one can define a probability space which models the throwing of a die.
A probability space consists of three elements:

A sample space,, which is the set of all possible outcomes.
An event space, which is a set of events, an event being a set of outcomes in the sample space.
A probability function, which assigns each event in the event space a probability, which is a number between 0 and 1.

In order to provide a sensible model of probability, these elements must satisfy a number of axioms, detailed in the article.
In the example of the throw of a standard die, we would take the sample space to be. For the event space, we could simply use the set of all subsets of the sample space, which would then contain simple events such as , as well as complex events such as . Finally, for the probability function, we would map each event to the number of outcomes in that event divided by 6 — so for example, would be mapped to, and would be mapped to.
When an experiment is conducted, we imagine that "nature" "selects" a single outcome,, from the sample space. All the events in the event space that contain the selected outcome are said to "have occurred". This "selection" happens in such a way that were the experiment repeated many times, the number of occurrences of each event, as a fraction of the total number of experiments, would tend towards the probability assigned to that event by the probability function.
The Russian mathematician Andrey Kolmogorov introduced the notion of probability space, together with other axioms of probability, in the 1930s. In modern probability theory there are a number of alternative approaches for axiomatization — for example, algebra of random variables.

Introduction

A probability space is a mathematical triplet that
presents a model for a particular class of real-world situations.
As with other models, its author ultimately defines which elements,, and will contain.

The sample space is the set of all possible outcomes. An outcome is the result of a single execution of the model. Outcomes may be states of nature, possibilities, experimental results and the like. Every instance of the real-world situation must produce exactly one outcome. If outcomes of different runs of an experiment differ in any way that matters, they are distinct outcomes. Which differences matter depends on the kind of analysis we want to do. This leads to different choices of sample space.
The σ-algebra is a collection of all the events we would like to consider. This collection may or may not include each of the elementary events. Here, an "event" is a set of zero or more outcomes, i.e., a subset of the sample space. An event is considered to have "happened" during an experiment when the outcome of the latter is an element of the event. Since the same outcome may be a member of many events, it is possible for many events to have happened given a single outcome. For example, when the trial consists of throwing two dice, the set of all outcomes with a sum of 7 pips may constitute an event, whereas outcomes with an odd number of pips may constitute another event. If the outcome is the element of the elementary event of two pips on the first die and five on the second, then both of the events, "7 pips" and "odd number of pips", are said to have happened.
The probability measure is a function returning an event's probability. A probability is a real number between zero and one. Thus is a function. The probability measure function must satisfy two simple requirements: First, the probability of a countable union of mutually exclusive events must be equal to the countable sum of the probabilities of each of these events. For example, the probability of the union of the mutually exclusive events and in the random experiment of one coin toss,, is the sum of probability for and the probability for,. Second, the probability of the sample space must be equal to 1. In the previous example the probability of the set of outcomes must be equal to one, because it is entirely certain that the outcome will be either or in a single coin toss.

Not every subset of the sample space must necessarily be considered an event: some of the subsets are simply not of interest, others cannot be "measured". This is not so obvious in a case like a coin toss. In a different example, one could consider javelin throw lengths, where the events typically are intervals like "between 60 and 65 meters" and unions of such intervals, but not sets like the "irrational numbers between 60 and 65 meters".

Definition

In short, a probability space is a measure space such that the measure of the whole space is equal to one.
The expanded definition is the following: a probability space is a triple consisting of:

the sample space — an arbitrary non-empty set,
the σ-algebra — a set of subsets of, called events, such that:
* contains the sample space:,
* is closed under complements: if, then also,
* is closed under countable unions: if for, then also
** The corollary from the previous two properties and De Morgan’s law is that is also closed under countable intersections: if for, then also
the probability measure — a function on such that:
* P is countably additive : if is a countable collection of pairwise disjoint sets, then
* the measure of entire sample space is equal to one:.
Discrete case

Discrete probability theory needs only at most countable sample spaces. Probabilities can be ascribed to points of by the probability mass function such that. All subsets of can be treated as events. The probability measure takes the simple form
The greatest σ-algebra describes the complete information. In general, a σ-algebra corresponds to a finite or countable partition, the general form of an event being. See also the examples.
The case is permitted by the definition, but rarely used, since such can safely be excluded from the sample space.

General case

If Ω is uncountable, still, it may happen that p ≠ 0 for some ω; such ω are called atoms. They are an at most countable set, whose probability is the sum of probabilities of all atoms. If this sum is equal to 1 then all other points can safely be excluded from the sample space, returning us to the discrete case. Otherwise, if the sum of probabilities of all atoms is between 0 and 1, then the probability space decomposes into a discrete part and a non-atomic part.

Non-atomic case

If p = 0 for all ω∈Ω, then equation fails: the probability of a set is not necessarily the sum over the probabilities of its elements, as summation is only defined for countable numbers of elements. This makes the probability space theory much more technical. A formulation stronger than summation, measure theory is applicable. Initially the probabilities are ascribed to some “generator” sets. Then a limiting procedure allows assigning probabilities to sets that are limits of sequences of generator sets, or limits of limits, and so on. All these sets are the σ-algebra. For technical details see Carathéodory's extension theorem. Sets belonging to are called measurable. In general they are much more complicated than generator sets, but much better than non-measurable sets.

Complete probability space

A probability space is said to be a complete probability space if for all with and all one has. Often, the study of probability spaces is restricted to complete probability spaces.

Examples

Discrete examples

Example 1

If the experiment consists of just one flip of a fair coin, then the outcome is either heads or tails:. The σ-algebra contains events, namely: , , , and ; in other words,. There is a fifty percent chance of tossing heads and fifty percent for tails, so the probability measure in this example is,,,.

Example 2

The fair coin is tossed three times. There are 8 possible outcomes: Ω = . The complete information is described by the σ-algebra = 2^Ω of 2⁸ = 256 events, where each of the events is a subset of Ω.
Alice knows the outcome of the second toss only. Thus her incomplete information is described by the partition Ω = A₁ ⊔ A₃ = ⊔, where ⊔ is the disjoint union, and the corresponding σ-algebra _Alice = . Bryan knows only the total number of tails. His partition contains four parts: Ω = B₀ ⊔ B₁ ⊔ B₂ ⊔ B₃ = ⊔ ⊔ ⊔ ; accordingly, his σ-algebra _Bryan contains 2⁴ = 16 events.
The two σ-algebras are incomparable: neither _Alice ⊆ _Bryan nor _Bryan ⊆ _Alice; both are sub-σ-algebras of 2^Ω.

Example 3

If 100 voters are to be drawn randomly from among all voters in California and asked whom they will vote for governor, then the set of all sequences of 100 Californian voters would be the sample space Ω. We assume that sampling without replacement is used: only sequences of 100 different voters are allowed. For simplicity an ordered sample is considered, that is a sequence is different from. We also take for granted that each potential voter knows exactly his/her future choice, that is he/she doesn’t choose randomly.
Alice knows only whether or not Arnold Schwarzenegger has received at least 60 votes. Her incomplete information is described by the σ-algebra _Alice that contains: the set of all sequences in Ω where at least 60 people vote for Schwarzenegger; the set of all sequences where fewer than 60 vote for Schwarzenegger; the whole sample space Ω; and the empty set ∅.
Bryan knows the exact number of voters who are going to vote for Schwarzenegger. His incomplete information is described by the corresponding partition Ω = B₀ ⊔ B₁... ⊔ B₁₀₀ and the σ-algebra _Bryan consists of 2¹⁰¹ events.
In this case Alice’s σ-algebra is a subset of Bryan’s: _Alice ⊂ _Bryan. Bryan’s σ-algebra is in turn a subset of the much larger “complete information” σ-algebra 2^Ω consisting of events, where n is the number of all potential voters in California.

Non-atomic examples

Example 4

A number between 0 and 1 is chosen at random, uniformly. Here Ω = , is the σ-algebra of Borel sets on Ω, and P is the Lebesgue measure on .
In this case the open intervals of the form, where 0 < a < b < 1, could be taken as the generator sets. Each such set can be ascribed the probability of P) =, which generates the Lebesgue measure on , and the Borel σ-algebra on Ω.

Example 5

A fair coin is tossed endlessly. Here one can take Ω = ^∞, the set of all infinite sequences of numbers 0 and 1. Cylinder sets may be used as the generator sets. Each such set describes an event in which the first n tosses have resulted in a fixed sequence, and the rest of the sequence may be arbitrary. Each such event can be naturally given the probability of 2⁻ⁿ.
These two non-atomic examples are closely related: a sequence ∈ ^∞ leads to the number 2⁻¹x₁ + 2⁻²x₂ +... ∈ . This is not a one-to-one correspondence between ^∞ and however: it is an isomorphism modulo zero, which allows for treating the two probability spaces as two forms of the same probability space. In fact, all non-pathological non-atomic probability spaces are the same in this sense. They are so-called standard probability spaces. Basic applications of probability spaces are insensitive to standardness. However, non-discrete conditioning is easy and natural on standard probability spaces, otherwise it becomes obscure.

Related concepts

Probability distribution

Any probability distribution defines a probability measure.

Random variables

A random variable X is a measurable function X: Ω → S from the sample space Ω to another measurable space S called the state space.
If A ⊂ S, the notation Pr is a commonly used shorthand for P.

Defining the events in terms of the sample space

If Ω is countable we almost always define as the power set of Ω, i.e. = 2^Ω which is trivially a σ-algebra and the biggest one we can create using Ω. We can therefore omit and just write to define the probability space.
On the other hand, if Ω is uncountable and we use = 2^Ω we get into trouble defining our probability measure P because is too “large”, i.e. there will often be sets to which it will be impossible to assign a unique measure. In this case, we have to use a smaller σ-algebra, for example the Borel algebra of Ω, which is the smallest σ-algebra that makes all open sets measurable.

Conditional probability

Kolmogorov’s definition of probability spaces gives rise to the natural concept of conditional probability. Every set A with non-zero probability defines another probability measure
on the space. This is usually pronounced as the “probability of B given A”.
For any event B such that P > 0 the function Q defined by Q = P for all events A is itself a probability measure.

Independence

Two events, A and B are said to be independent if P=PP.
Two random variables, X and Y, are said to be independent if any event defined in terms of X is independent of any event defined in terms of Y. Formally, they generate independent σ-algebras, where two σ-algebras G and H, which are subsets of F are said to be independent if any element of G is independent of any element of H.

Mutual exclusivity

Two events, A and B are said to be mutually exclusive or disjoint if the occurrence of one implies the non-occurrence of the other, i.e., their intersection is empty. This is a stronger condition than the probability of their intersection being zero.
If A and B are disjoint events, then P = P + P. This extends to a sequence of events. However, the probability of the union of an uncountable set of events is not the sum of their probabilities. For example, if Z is a normally distributed random variable, then P is 0 for any x, but P = 1.
The event A∩B is referred to as “A and B”, and the event A∪B as “A or B”.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...