Video quality

Video quality is a characteristic of a video passed through a video transmission/processing system, a formal or informal measure of perceived video degradation. Video processing systems may introduce some amount of distortion or artifacts in the video signal, which negatively impacts the user's perception of a system. For many stakeholders such as content providers, service providers, and network operators, the assurance of video quality is an important task.
Video quality evaluation is performed to describe the quality of a set of video sequences under study. Video quality can be evaluated objectively or subjectively. Also, the quality of a system can be determined offline, or in-service.

From analog to digital video

Since the world's first video sequence was recorded and transmitted, many video processing systems have been designed. Such systems encode video streams and transmit them over various kinds of networks or channels. In the ages of analog video systems, it was possible to evaluate the quality aspects of a video processing system by calculating the system's frequency response using test signals.
Digital video systems have almost fully replaced analog ones, and quality evaluation methods have changed. The performance of a digital video processing and transmission system can vary significantly and depends, amongst others, on the characteristics of the input video signal, the settings used for encoding and transmission, and the channel fidelity or network performance.

Objective video quality

Objective video quality models are mathematical models that approximate results from subjective quality assessment, in which human observers are asked to rate the quality of a video. In this context, the term model may refer to a simple statistical model in which several independent variables are fit against results obtained in a subjective quality evaluation test using regression techniques. A model may also be a more complicated algorithm implemented in software or hardware.

Terminology

The terms model and metric are often used interchangeably in the field. However a metric has certain mathematical properties, which, by strict definition, do not apply to all video quality models.
The term “objective” relates to the fact that, in general, quality models are based on criteria that can be measured objectively – that is, free from human interpretation. They can be automatically evaluated by a computer program. Unlike a panel of human observers, an objective model should always deterministically output the same quality score for a given set of input parameters.
Objective quality models are sometimes also referred to as instrumental models, in order to emphasize their application as measurement instruments. Some authors suggest that the term “objective” is misleading, as it “implies that instrumental measurements bear objectivity, which they only do in case that they can be generalized.”

Classification of objective video quality models

Objective models can be classified by the amount of information available about the original signal, the received signal, or whether there is a signal present at all:

Full Reference Methods : FR models compute the quality difference by comparing the original video signal against the received video signal. Typically, every pixel from the source is compared against the corresponding pixel at the received video, with no knowledge about the encoding or transmission process in between. More elaborate algorithms may choose to combine the pixel-based estimation with other approaches such as described below. FR models are usually the most accurate at the expense of higher computational effort. As they require availability of the original video before transmission or coding, they cannot be used in all situations.
Reduced Reference Methods : RR models extract some features of both videos and compare them to give a quality score. They are used when all the original video is not available, or when it would be practically impossible to do so, e.g. in a transmission with a limited bandwidth. This makes them more efficient than FR models at the expense of lower accuracy.
No-Reference Methods : NR models try to assess the quality of a distorted video without any reference to the original signal. Due to the absence of an original signal, they may be less accurate than FR or RR approaches, but are more efficient to compute.
* Pixel-Based Methods : Pixel-based models use a decoded representation of the signal and analyze the quality based on the pixel information. Some of these evaluate specific degradation types only, such as blurring or other coding artifacts.
* Parametric/Bitstream Methods : These models make use of features extracted from the transmission container and/or video bitstream, e.g. MPEG-TS packet headers, motion vectors and quantization parameters. They do not have access to the original signal and require no decoding of the video, which makes them more efficient. In contrast to NR-P models, they have no access to the final decoded signal. However, the picture quality predictions they deliver are not very accurate.
* Hybrid Methods : Hybrid models combine parameters extracted from the bitstream with a decoded video signal. They are therefore a mix between NR-P and NR-B models.
Use of picture quality models for video quality estimation

Some models that are used for video quality assessment are simply image quality models, whose output is calculated for every frame of a video sequence. This quality measure of every frame can then be recorded and pooled over time to assess the quality of an entire video sequence. While this method is easy to implement, it does not factor in certain kinds of degradations that develop over time, such as the moving artifacts caused by packet loss and its concealment. A video quality model that considers the temporal aspects of quality degradations, like or the MOVIE Index, may be able to produce more accurate predictions of human-perceived quality.

Examples

No-reference metrics

An overview of recent no-reference image quality models has been given in a journal paper by Shahid et al. As mentioned above, these can be used for video applications as well. No-reference, pixel-based quality models designed specifically for video are however rare, with Video-BLIINDS being one example. The Video Quality Experts Group has a dedicated working group on developing no-reference metrics.

Simple full-reference metrics

The most traditional ways of evaluating quality of digital video processing system are FR-based. Among the oldest FR metrics are signal-to-noise ratio and peak signal-to-noise ratio, which are calculated between every frame of the original and the degraded video signal. PSNR is the most widely used objective image quality metric, and the average PSNR over all frames can be considered a video quality metric. PSNR is also used often during video codec development in order to optimize encoders. However, PSNR values do not correlate well with perceived picture quality due to the complex, highly non-linear behavior of the human visual system.

More complex full- or reduced-reference metrics

With the success of digital video, a large number of more precise FR metrics have been developed. These metrics are inherently more complex than PSNR, and need more computational effort to calculate predictions of video quality. Among those metrics specifically developed for video are and the MOVIE Index.
Based on the results of benchmarks by the and the HDTV Test Phase I ), some RR/FR metrics have been standardized in ITU-T as:

ITU-T Rec. , 2002
ITU-T Rec. , 2008
ITU-T Rec. , 2008
ITU-T Rec. , 2011
ITU-T Rec. , 2011

The Structural Similarity FR image quality metric is also often used for estimating video quality. Visual Information Fidelity – also an image quality metric – is a core element of the Netflix Video Multimethod Assessment Fusion, a tool that combines existing metrics to predict video quality.

Bitstream-based metrics

Full or reduced-reference metrics still require access to the original video bitstream before transmission, or at least part of it. In practice, an original stream may not always be available for comparison, for example when measuring the quality from the user side. In other situations, a network operator may want to measure the quality of video streams passing through their network, without fully decoding them. For a more efficient estimation of video quality in such cases, parametric/bitstream-based metrics have also been standardized:

ITU-T Rec. , 2012
ITU-T Rec. , 2012
ITU-T Rec. , 2016
Use in practice

Few of these standards have found commercial applications, including PEVQ and VQuad-HD. SSIM is also part of a commercially available video quality toolset. VMAF is used by Netflix to tune their encoding and streaming algorithms, and to quality-control all streamed content. It is also being used by other technology companies like Bitmovin and has been integrated into software such as FFmpeg.

Training and performance evaluation

Since objective video quality models are expected to predict results given by human observers, they are developed with the aid of subjective test results. During development of an objective model, its parameters should be trained so as to achieve the best correlation between the objectively predicted values and the subjective scores, often available as mean opinion scores.
The most widely used subjective test materials are in the public-domain and include still picture, motion picture, streaming video, high definition, 3-D and special-purposes picture quality related datasets. These so-called databases are created by various research laboratories around the world. Some of them have become de facto standards, including several public-domain subjective picture quality databases created and maintained by the as well the . A collection of databases can be found in the repository. The hosts freely available video test sequences for model development.
In theory, a model can be trained on a set of data in such a way that it produces perfectly matching scores on that dataset. However, such a model will be over-trained and will therefore not perform well on new datasets. It is therefore advised to validate models against new data and use the resulting performance as a real indicator of the model's prediction accuracy.
To measure the performance of a model, some frequently used metrics are the linear correlation coefficient, Spearman's rank correlation coefficient, and the root mean square error. Other metrics are the kappa coefficient and the outliers ratio. ITU-T Rec. gives an overview of statistical procedures to evaluate and compare objective models.

Uses and application of objective models

Objective video quality models can be used in various application areas. In video codec development, the performance of a codec is often evaluated in terms of PSNR or SSIM. For service providers, objective models can be used for monitoring a system. For example, an IPTV provider may choose to monitor their service quality by means of objective models, rather than asking users for their opinion, or waiting for customer complaints about bad video quality.
An objective model should only be used in the context that it was developed for. For example, a model that was developed using a particular video codec is not guaranteed to be accurate for another video codec. Similarly, a model trained on tests performed on a large TV screen should not be used for evaluating the quality of a video watched on a mobile phone.

Other approaches

When estimating quality of a video codec, all the mentioned objective methods may require repeating post-encoding tests in order to determine the encoding parameters that satisfy a required level of visual quality, making them time consuming, complex and impractical for implementation in real commercial applications. There is ongoing research into developing novel objective evaluation methods which enable prediction of the perceived quality level of the encoded video before the actual encoding is performed.

Subjective video quality

The main goal of many objective video quality metrics is to automatically estimate the average user's opinion on the quality of a video processed by a system. Procedures for subjective video quality measurements are described in ITU-R recommendation and ITU-T recommendation . In such tests, video sequences are shown to a group of viewers. The viewers' opinion is recorded and averaged into the mean opinion score to evaluate the quality of each video sequence. However, the testing procedure may vary depending on what kind of system is tested.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...