Approximate entropy

In statistics, an approximate entropy is a technique used to quantify the amount of regularity and the unpredictability of fluctuations over time-series data.
For example, there are two series of data:
Moment statistics, such as mean and variance, will not distinguish between these two series. Nor will rank order statistics distinguish between these series. Yet series 1 is "perfectly regular"; knowing one term has the value of 20 enables one to predict with certainty that the next term will have the value of 10. Series 2 is randomly valued; knowing one term has the value of 20 gives no insight into what value the next term will have.
Regularity was originally measured by exact regularity statistics, which has mainly centered on various entropy measures.
However, accurate entropy calculation requires vast amounts of data, and the results will be greatly influenced by system noise, therefore it is not practical to apply these methods to experimental data. ApEn was developed by Steve M. Pincus to handle these limitations by modifying an exact regularity statistic, Kolmogorov–Sinai entropy. ApEn was initially developed to analyze medical data, such as heart rate, and later spread its applications in finance, psychology, and human factors engineering.

The algorithm

in which is defined as
The are the scalar components of. represents the distance between the vectors and, given by the maximum difference in their respective scalar components. Note that takes on all values, so the match provided when will be counted.
where is the natural logarithm, for and fixed as in Step 2.
Parameter selection: typically choose or, and depends greatly on the application.
An implementation on Physionet, which is based on Pincus use whereas the original article uses in Step 4. While a concern for artificially constructed examples, it is usually not a concern in practice.

The interpretation

The presence of repetitive patterns of fluctuation in a time series renders it more predictable than a time series in which such patterns are absent. ApEn reflects the likelihood that similar patterns of observations will not be followed by additional similar observations. A time series containing many repetitive patterns has a relatively small ApEn; a less predictable process has a higher ApEn.

One example

Suppose, and the sequence consists of 51 samples of heart rate equally spaced in time:
. Let's choose and .
Form a sequence of vectors:
Distance is calculated as follows:
Note, so
Similarly,
Therefore, such that include, and the total number is 17.
Please note in Step 4, for, . So the such that include, and the total number is 16.
Then we repeat the above steps for m=3. First form a sequence of vectors:
By calculating distances between vector, we find the vectors satisfying the filtering level have the following characteristic:
Therefore,
Finally,
The value is very small, so it implies the sequence is regular and predictable, which is consistent with the observation.

Python implementation

import numpy as np
def ApEn -> float:
"""Approximate_entropy."""
def _maxdist:
return max for ua, va in zip
def _phi:
x = U for j in range] for i in range]
C =
return ** * sum
N = len
return abs - _phi)

Usage example

U = np.array
print
1.0996541105257052e-05
randU = np.random.choice
print
0.8626664154888908

Advantages

The advantages of ApEn include:

Lower computational demand. ApEn can be designed to work for small data samples and can be applied in real time.
Less effect from noise. If data are noisy, the ApEn measure can be compared to the noise level in the data to determine what quality of true information may be present in the data.
Applications

ApEn has been applied to classify EEG in psychiatric diseases, such as schizophrenia, epilepsy, and addiction.

Limitations

The ApEn algorithm counts each sequence as matching itself to avoid the occurrence of ln in the calculations. This step might cause bias of ApEn and this bias causes ApEn to have two poor properties in practice:

ApEn is heavily dependent on the record length and is uniformly lower than expected for short records.
It lacks relative consistency. That is, if ApEn of one data set is higher than that of another, it should, but does not, remain higher for all conditions tested.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...