Generalized normal distribution


The generalized normal distribution or generalized Gaussian distribution is either of two families of parametric continuous probability distributions on the real line. Both families add a shape parameter to the normal distribution. To distinguish the two families, they are referred to below as "version 1" and "version 2". However this is not a standard nomenclature.

Version 1

Known also as the exponential power distribution, or the generalized error distribution, this is a parametric family of symmetric distributions. It includes all normal and Laplace distributions, and as limiting cases it includes all continuous uniform distributions on bounded intervals of the real line.
This family includes the normal distribution when and it includes the Laplace distribution when. As, the density converges pointwise to a uniform density on.
This family allows for tails that are either heavier than normal or lighter than normal. It is a useful way to parametrize a continuum of symmetric, platykurtic densities spanning from the normal to the uniform density, and a continuum of symmetric, leptokurtic densities spanning from the Laplace to the normal density.

Parameter estimation

Parameter estimation via maximum likelihood and the method of moments has been studied. The estimates do not have a closed form and must be obtained numerically. Estimators that do not require numerical calculation have also been proposed.
The generalized normal log-likelihood function has infinitely many continuous derivates only if is a positive, even integer. Otherwise, the function has continuous derivatives. As a result, the standard results for consistency and asymptotic normality of maximum likelihood estimates of only apply when.

Maximum likelihood estimator

It is possible to fit the generalized normal distribution adopting an approximate maximum likelihood method. With initially set to the sample first moment,
is estimated by using a Newton–Raphson iterative procedure, starting from an initial guess of,
where
is the first statistical moment of the absolute values and is the second statistical moment. The iteration is
where
and
and where and are the digamma function and trigamma function.
Given a value for, it is possible to estimate by finding the minimum of:
Finally is evaluated as
For, median is a more appropriate estimator of . Once is estimated, and can be estimated as described above.

Applications

This version of the generalized normal distribution has been used in modeling when the concentration of values around the mean and the tail behavior are of particular interest. Other families of distributions can be used if the focus is on other deviations from normality. If the symmetry of the distribution is the main interest, the skew normal family or version 2 of the generalized normal family discussed below can be used. If the tail behavior is the main interest, the student t family can be used, which approximates the normal distribution as the degrees of freedom grows to infinity. The t distribution, unlike this generalized normal distribution, obtains heavier than normal tails without acquiring a cusp at the origin.

Properties

Moments

Let be zero mean generalized Gaussian distribution of shape and scaling parameter . The moments of exist and are finite for any k greater than −1. For any non-negative integer k, the plain central moments are

Connection to Positive-Definite Functions

The probability density function of this version of the generalized normal distribution is a positive-definite function for .

Infinite divisibility

This version of generalized Gaussian distribution is an infinitely divisible distribution if and only if.

Generalizations

The multivariate generalized normal distribution, i.e. the product of exponential power distributions with the same and parameters, is the only probability density that can be written in the form and has independent marginals. The results for the special case of the Multivariate normal distribution is originally attributed to Maxwell.

Version 2

This is a family of continuous probability distributions in which the shape parameter can be used to introduce skew. When the shape parameter is zero, the normal distribution results. Positive values of the shape parameter yield left-skewed distributions bounded to the right, and negative values of the shape parameter yield right-skewed distributions bounded to the left. Only when the shape parameter is zero is the density function for this distribution positive over the whole real line: in this case the distribution is a normal distribution, otherwise the distributions are shifted and possibly reversed log-normal distributions.

Parameter estimation

Parameters can be estimated via maximum likelihood estimation or the method of moments. The parameter estimates do not have a closed form, so numerical calculations must be used to compute the estimates. Since the sample space depends on the true value of the parameter, some standard results about the performance of parameter estimates will not automatically apply when working with this family.

Applications

This family of distributions can be used to model values that may be normally distributed, or that may be either right-skewed or left-skewed relative to the normal distribution. The skew normal distribution is another distribution that is useful for modeling deviations from normality due to skew. Other distributions used to model skewed data include the gamma, lognormal, and Weibull distributions, but these do not include the normal distributions as special cases.

Other distributions related to the normal

The two generalized normal families described here, like the skew normal family, are parametric families that extends the normal distribution by adding a shape parameter. Due to the central role of the normal distribution in probability and statistics, many distributions can be characterized in terms of their relationship to the normal distribution. For example, the log-normal, folded normal, and inverse normal distributions are defined as transformations of a normally-distributed value, but unlike the generalized normal and skew-normal families, these do not include the normal distributions as special cases.

Actually all distributions with finite variance are in the limit highly related to the normal distribution. The Student-t distribution, the Irwin–Hall distribution and the Bates distribution also extend the normal distribution, and include in the limit the normal distribution. So there is no strong reason to prefer the "generalized" normal distribution of type 1, e.g. over a combination of Student-t and a normalized extended Irwin–Hall – this would include e.g. the triangular distribution.


A symmetric distribution which can model both tail and center behavior completely independently could be derived e.g. by using X = IH/chi.