Probability mass function


In probability and statistics, a probability mass function is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes it is also known as the discrete density function. The probability mass function is often the primary means of defining a discrete probability distribution, and such functions exist for either scalar or multivariate random variables whose domain is discrete.
A probability mass function differs from a probability density function in that the latter is associated with continuous rather than discrete random variables. A PDF must be integrated over an interval to yield a probability.
The value of the random variable having the largest probability mass is called the mode.

Formal definition

Probability mass function is the probability distribution of a discrete random variable, and provides the possible values and their associated probabilities. It is the function defined by
for, where is a probability measure. can also be simplified as.
The probabilities associated with each possible values must be positive and sum up to 1. For all other values, the probabilities need to be 0.
Thinking of probability as mass helps to avoid mistakes since the physical mass is conserved as is the total probability for all hypothetical outcomes.

Measure theoretic formulation

A probability mass function of a discrete random variable can be seen as a special case of two more general measure theoretic constructions:
the distribution of and the probability density function of with respect to the counting measure. We make this more precise below.
Suppose that is a probability space
and that is a measurable space whose underlying σ-algebra is discrete, so in particular contains singleton sets of. In this setting, a random variable is discrete provided its image is countable.
The pushforward measure —called a distribution of in this context—is a probability measure on whose restriction to singleton sets induces a probability mass function since for each.
Now suppose that is a measure space equipped with the counting measure μ. The probability density function of with respect to the counting measure, if it exists, is the Radon–Nikodym derivative of the pushforward measure of , so and is a function from to the non-negative reals. As a consequence, for any we have
demonstrating that is in fact a probability mass function.
When there is a natural order among the potential outcomes, it may be convenient to assign numerical values to them and to consider also values not in the image of. That is, may be defined for all real numbers and for all as shown in the figure.
The image of has a countable subset on which the probability mass function is one. Consequently, the probability mass function is zero for all but a countable number of values of.
The discontinuity of probability mass functions is related to the fact that the cumulative distribution function of a discrete random variable is also discontinuous. If is a discrete random variable, then means that the casual event is certain ; on the contrary, means that the casual event is always impossible. This statement isn't true for a continuous random variable, for which for any possible : in fact, by definition, a continuous random variable can have an infinite set of possible values and thus the probability it has a single particular value x is equal to. Discretization is the process of converting a continuous random variable into a discrete one.

Examples

Finite

There are three major distributions associated, the Bernoulli distribution, the Binomial distribution and the geometric distribution.
Other distributions that can be modeled using a probability mass function are the Categorical distribution and the multinomial distribution.
Two or more discrete random variables have a joint probability mass function, which gives the probability of each possible combination of realizations for the random variables.