Half-precision floating-point format

In computing, half precision is a binary floating-point computer number format that occupies 16 bits in computer memory.
They can express values in the range ±65,504, with precision up to 0.0000000596046.
In the IEEE 754-2008 standard, the 16-bit base-2 format is referred to as binary16. It is intended for storage of floating-point values in applications where higher precision is not essential for performing arithmetic computations.
Although implementations of the IEEE Half-precision floating point are relatively new, several earlier 16-bit floating point formats have existed including that of Hitachi's HD61810 DSP of 1982, Scott's WIF and the 3dfx Voodoo Graphics processor.
Nvidia and Microsoft defined the half datatype in the Cg language, released in early 2002, and implemented it in silicon in the GeForce FX, released in late 2002. ILM was searching for an image format that could handle a wide dynamic range, but without the hard drive and memory cost of floating-point representations that are commonly used for floating-point computation. The hardware-accelerated programmable shading group led by John Airey at SGI invented the s10e5 data type in 1997 as part of the 'bali' design effort. This is described in a SIGGRAPH 2000 paper and further documented in US patent 7518615.
This format is used in several computer graphics environments including MATLAB, OpenEXR, JPEG XR, GIMP, OpenGL, Cg, Direct3D, and D3DX. The advantage over 8-bit or 16-bit binary integers is that the increased dynamic range allows for more detail to be preserved in highlights and shadows for images. The advantage over 32-bit single-precision binary formats is that it requires half the storage and bandwidth.
The F16C extension allows x86 processors to convert half-precision floats to and from single-precision floats.
Depending on the computer half-precision can be over an order of magnitude faster than double precision, e.g. 37 PFLOPS vs. for half 550 "AI-PFLOPS ".

IEEE 754 half-precision binary floating-point format: binary16

The IEEE 754 standard specifies a binary16 as having the following format:

Sign bit: 1 bit
Exponent width: 5 bits
Significand precision: 11 bits

The format is laid out as follows:
The format is assumed to have an implicit lead bit with value 1 unless the exponent field is stored with all zeros. Thus only 10 bits of the significand appear in the memory format but the total precision is 11 bits. In IEEE 754 parlance, there are 10 bits of significand, but there are 11 bits of significand precision.

Exponent encoding

The half-precision binary floating-point exponent is encoded using an offset-binary representation, with the zero offset being 15; also known as exponent bias in the IEEE 754 standard.

E_min = 00001₂ − 01111₂ = −14
E_max = 11110₂ − 01111₂ = 15
Exponent bias = 01111₂ = 15

Thus, as defined by the offset binary representation, in order to get the true exponent the offset of 15 has to be subtracted from the stored exponent.
The stored exponents 00000₂ and 11111₂ are interpreted specially.
The minimum strictly positive value is
2⁻²⁴ ≈ 5.96 × 10⁻⁸.
The minimum positive normal value is 2⁻¹⁴ ≈ 6.10 × 10⁻⁵.
The maximum representable value is × 2¹⁵ = 65504.

Half precision examples

These examples are given in bit representation
of the floating-point value. This includes the sign bit, exponent, and significand.
0 00000 0000000001₂ = 0001₁₆ = ≈ 0.000000059604645

0 00000 1111111111₂ = 03ff₁₆ = ≈ 0.000060975552

0 00001 0000000000₂ = 0400₁₆ = ≈ 0.000061035156

0 11110 1111111111₂ = 7bff₁₆ = = 65504

0 01110 1111111111₂ = 3bff₁₆ = ≈ 0.99951172

0 01111 0000000000₂ = 3c00₁₆ = = 1

0 01111 0000000001₂ = 3c01₁₆ = ≈ 1.00097656

0 01101 0101010101₂ = 3555₁₆ = = 0.33325195

1 10000 0000000000₂ = c000₁₆ = −2
0 00000 0000000000₂ = 0000₁₆ = 0
1 00000 0000000000₂ = 8000₁₆ = −0
0 11111 0000000000₂ = 7c00₁₆ = infinity
1 11111 0000000000₂ = fc00₁₆ = −infinity
By default, 1/3 rounds down like for double precision, because of the odd number of bits in the significand. So the bits beyond the rounding point are 0101... which is less than 1/2 of a unit in the last place.

Precision limitations on decimal values in 0, 1

Decimals between 2⁻²⁴ and 2⁻¹⁴ : fixed interval 2⁻²⁴
Decimals between 2⁻¹⁴ and 2⁻¹³: fixed interval 2⁻²⁴
Decimals between 2⁻¹³ and 2⁻¹²: fixed interval 2⁻²³
Decimals between 2⁻¹² and 2⁻¹¹: fixed interval 2⁻²²
Decimals between 2⁻¹¹ and 2⁻¹⁰: fixed interval 2⁻²¹
Decimals between 2⁻¹⁰ and 2⁻⁹: fixed interval 2⁻²⁰
Decimals between 2⁻⁹ and 2⁻⁸: fixed interval 2⁻¹⁹
Decimals between 2⁻⁸ and 2⁻⁷: fixed interval 2⁻¹⁸
Decimals between 2⁻⁷ and 2⁻⁶: fixed interval 2⁻¹⁷
Decimals between 2⁻⁶ and 2⁻⁵: fixed interval 2⁻¹⁶
Decimals between 2⁻⁵ and 2⁻⁴: fixed interval 2⁻¹⁵
Decimals between 2⁻⁴ and 2⁻³: fixed interval 2⁻¹⁴
Decimals between 2⁻³ and 2⁻²: fixed interval 2⁻¹³
Decimals between 2⁻² and 2⁻¹: fixed interval 2⁻¹²
Decimals between 2⁻¹ and 2⁻⁰: fixed interval 2⁻¹¹
Precision limitations on decimal values in 1, 2048
Decimals between 1 and 2: fixed interval 2⁻¹⁰
Decimals between 2 and 4: fixed interval 2⁻⁹
Decimals between 4 and 8: fixed interval 2⁻⁸
Decimals between 8 and 16: fixed interval 2⁻⁷
Decimals between 16 and 32: fixed interval 2⁻⁶
Decimals between 32 and 64: fixed interval 2⁻⁵
Decimals between 64 and 128: fixed interval 2⁻⁴
Decimals between 128 and 256: fixed interval 2⁻³
Decimals between 256 and 512: fixed interval 2⁻²
Decimals between 512 and 1024: fixed interval 2⁻¹
Decimals between 1024 and 2048: fixed interval 2⁰
Precision limitations on integer values
Integers between 0 and 2048 can be exactly represented
Integers between 2048 and 4096 round to a multiple of 2
Integers between 4096 and 8192 round to a multiple of 4
Integers between 8192 and 16384 round to a multiple of 8
Integers between 16384 and 32768 round to a multiple of 16
Integers between 32768 and 65519 round to a multiple of 32
Integers above 65519 are rounded to "infinity" if using round-to-even, or above 65535 if using round-to-zero, or above 65504 if using round-to-infinity.
ARM alternative half-precision

ARM processors support an "alternative half-precision" format, which does away with the special case for an exponent value of 31. It is almost identical to the IEEE format, but there is no encoding for infinity or NaNs; instead, an exponent of 31 encodes normalized numbers in the range 65536 to 131008.

Uses

Hardware and software for machine learning or neural networks tend to use half precision: such applications usually do a large amount of calculation, but don't require a high level of precision.
On older computers that access 8 or 16 bits at a time, half precision arithmetic is faster than single precision, and substantially faster than double precision. On systems with instructions that can handle multiple floating point numbers with in one instruction, half-precision often offers a higher average throughput.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Half-precision floating-point format

IEEE 754 half-precision binary floating-point format: binary16

Exponent encoding

Half precision examples

Precision limitations on decimal values in 0, 1

Precision limitations on decimal values in 1, 2048

Precision limitations on integer values

ARM alternative half-precision

Uses