The multitaper method overcomes some of the limitations of conventional Fourier analysis. When applying the Fourier transform to extract spectral information from a signal, we assume that each Fourier coefficient is a reliable representation of the amplitude and relative phase of the corresponding component frequency. This assumption, however, is not always valid. For instance, a single trial represents only one noisy realization of the underlying process of interest. A comparable situation arises in statistics when estimating measures of central tendency i.e., it is bad practice to estimate qualities of a population using individuals or very small samples. Likewise, a single sample of a process does not necessarily provide a reliable estimate of its spectral properties. Moreover, the naive power spectral density obtained from the signal's Fourier transform is a biased estimate of the true spectral content. These problems are often overcome by averaging over many realizations of the same event. However, this method is unreliable with small data sets and undesirable when one does not wish to attenuate signal components that vary across trials. Instead of ensemble averaging, the multitaper method reduces estimation bias by obtaining multiple independent estimates from the same sample. Each data taper is multiplied element-wise by the signal to provide a windowed trial from which one estimates the power at each component frequency. As each taper is pairwise orthogonal to all other tapers, the windowed signals provide statistically independent estimates of the underlying spectrum. The final spectrum is obtained by averaging over all the tapered spectra. Thomson chose the Slepian or discrete prolate spheroidal sequences as tapers since these vectors are mutually orthogonal and possess desirable spectral concentration properties. In practice, a weighted average is often used to compensate for increased energy loss at higher order tapers.
The method
Consider a p-dimensional zero mean stationary stochastic process Here T denotes the matrix transposition. In neurophysiology for example, p refers to the total number of channels and hence can represent simultaneous measurement of electrical activity of those p channels. Let the sampling interval between observations be, so that the Nyquist frequency is. The multitaper spectral estimator utilizes several different data tapers which are orthogonal to each other. The multitaper cross-spectral estimator between channel l and m is the average of K direct cross-spectral estimators between the same pair of channels and hence takes the form Here, is the kth direct cross spectral estimator between channel l and m and is given by where
The Slepian sequences
The sequence is the data taper for the kth direct cross-spectral estimator and is chosen as follows: We choose a set of K orthogonal data tapers such that each one provides a good protection against leakage. These are given by the Slepian sequences, after David Slepian with parameter W and orders k = 0 to K − 1. The maximum order K is chosen to be less than the Shannon number. The quantity 2W defines the resolution bandwidth for the spectral concentration problem and. When l = m, we get the multitaper estimator for the auto-spectrum of the lth channel. In recent years, a dictionary based on modulated DPSS was proposed as an overcomplete alternative to DPSS. See alsoWindow function:DPSS or Slepian window