Confirmatory composite analysis

In statistics, confirmatory composite analysis is a sub-type of structural equation modeling.
Although, historically, CCA emerged from a re-orientation and re-start of partial least squares path modeling,
it has become an independent approach and the two should not be confused.
In many ways it is similar to, but also quite distinct from confirmatory factor analysis.
It shares with CFA the process of model specification, model identification, model estimation, and model assessment.
However, in contrast to CFA which always assumes the existence of latent variables, in CCA all variables can be observable, with their interrelationships expressed in terms of composites, i.e., linear compounds of subsets of the variables.
The composites are treated as the fundamental objects and path diagrams can be used to illustrate their relationships.
This makes CCA particularly useful for disciplines examining theoretical concepts that are designed to attain certain goals, so-called artifacts, and their interplay with theoretical concepts of behavioral sciences.

Development

The initial idea of CCA was sketched by Theo K. Dijkstra and Jörg Henseler in 2014.
The scholarly publishing process took its time until the first full description of CCA was published by Florian Schuberth, Jörg Henseler and Theo K. Dijkstra in 2018.
As common for statistical developments, interim developments of CCA were shared with the scientific community in written form.
Moreover, CCA was presented at several conferences including the 5th Modern Modeling Methods Conference, the 2nd International Symposium on Partial Least Squares Path Modeling, the 5th CIM Community Workshop, and the Meeting of the SEM Working Group in 2018.

Statistical model

A composite is typically a linear combination of observable random variables. However, also so-called second-order composites as linear combinations of latent variables and composites, respectively, are conceivable.
For a random column vector of observable variables that is partitioned into sub-vectors, composites can be defined as weighted linear combinations.
So the i-th composite equals:
where the weights of each composite are appropriately normalized.
In the following, it is assumed that the weights are scaled in such a way that each composite has a variance of one, i.e.,.
Moreover, without loss of generality, it is assumed that the observable random variables are standardized having a mean of zero and a unit variance.
Generally, the variance-covariance matrices of the sub-vectors are not constrained beyond being positive definite.
Similar to the latent variables of a factor model, the composites explain the covariances between the sub-vectors leading to the following inter-block covariance matrix:
where is the correlation between the composites and.
The composite model imposes rank one constraints on the inter-block covariance matrices, i.e.,.
Generally, the variance-covariance matrix of is positive definite iff the correlation matrix of the composites and the variance-covarianc matrices 's are both positive definite.
In addition, the composites can be related via a structural model which constrains the correlation matrix indirectly via a set of simultaneous equations:
where the vector is partitioned in an exogenous and an endogenous part, and the matrices and contain the so-called path coefficients.
Moreover, the vector contains the structural error terms having a zero mean and being uncorrelated with.
As the model needs not to be recursive, the matrix is not necessarily triangular and the elements of may be correlated.

Model identification

To ensure identification of the composite model, each composite must be correlated with at least one variable not forming the composite. Additionally to this non-isolation condition, each composite needs to be normalized, e.g., by fixing one weight per composite, the length of each weight vector, or the composite’s variance to a certain value.
If the composites are embedded in a structural model, also the structural model needs to be identified.
Finally, since the weight signs are still undetermined, it is recommended to select a dominant indicator per block of indicators that dictates the orientation of the composite.
The degrees of freedom of the basic composite model, i.e., with no constraints imposed on the composites' correlation matrix, are calculated as follows:

Model estimation

To estimate a composite model, various methods that create composites can be used such as generalized canonical correlation, principal component analysis, and linear discriminant analysis. Moreover composite-based methods for SEM can be employed to estimate weights and the correlations among the composites such as partial least squares path modeling, and generalized structured component analysis.

Evaluating model fit

In CCA, the model fit, i.e., the discrepancy between the estimated model-implied variance-covariance matrix and its sample counterpart, can be assessed in two non-exclusive ways.
On the one hand, measures of fit can be employed; on the other hand, a test for overall model fit can be used.
While the former relies on heuristic rules, the latter is based on statistical inferences.
Fit measures for composite models comprises statistics such as the standardized root mean square residual, and the root mean squared error of outer residuals
In contrast to fit measures for common factor models, fit measures for composite models are relatively unexplored and reliable thresholds still need to be determined.
To assess the overall model fit by means of statistical testing, the bootstrap test for overall model fit, also known as Bollen-Stine bootstrap test, can be used to investigate whether a composite model fits to the data.

Alternative views on CCA

Besides the originally proposed CCA, the evaluation steps known from partial least squares structural equation modeling are dubbed CCA.
It is emphasized that PLS-SEM's evaluation steps, in the following called PLS-CCA, differ from CCA in many regards:.
While PLS-CCA aims at conforming reflective and formative measurement models, CCA aims at assessing composite models; PLS-CCA omits overall model fit assessment, which is a crucial step in CCA as well as SEM; PLS-CCA is strongly linked to PLS-PM, while for CCA PLS-PM can be employed as one estimator, but this is in no way mandatory.
Hence, researchers who employ need to be aware to which technique they are referring to.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...