Major histocompatibility complex


The [Isotype (immunology)|]major histocompatibility complex is a large locus on vertebrate DNA containing a set of closely linked polymorphic genes that code for cell surface proteins essential for the adaptive immune system. This locus got its name because it was discovered in the study of tissue compatibility upon transplantation. Later studies revealed that tissue rejection due to incompatibility is an experimental artifact masking the real function of MHC molecules - binding an antigen derived from self-proteins or from pathogen and the antigen presentation on the cell surface for recognition by the appropriate T-cells. MHC molecules mediate interactions of leukocytes, also called white blood cells, which are immune cells, with other leukocytes or with body cells. The MHC determines compatibility of donors for organ transplant, as well as one's susceptibility to an autoimmune disease via cross-reacting immunization.
In a cell, protein molecules of the host's own phenotype or of other biologic entities are continually synthesized and degraded. Each MHC molecule on the cell surface displays a small peptide, molecular fraction of a protein, called an epitope. The presented self-antigens prevent an organism's immune system targeting its own cells. Presentation of pathogen-derived proteins results in the elimination of the infected cell by the immune system.
Diversity of antigen presentation, mediated by MHC antigens, is attained in at least three ways: an organism's MHC repertoire is polygenic ; MHC expression is codominant ; MHC gene variants are highly polymorphic. Sexual selection has been observed in male mice making mate choices of females with different MHCs and thus demonstrating sexual selection. Also, at least for MHC I presentation, there has been evidence of antigenic peptide splicing which can combine peptides from different proteins, vastly increasing antigen diversity.

Discovery

The first descriptions of the MHC were made by British immunologist Peter Gorer in 1936. MHC genes were first identified in inbred mice strains. Clarence Little transplanted tumors across differing strains and found rejection of transplanted tumors according to strains of host versus donor. George Snell selectively bred two mouse strains, attained a new strain nearly identical to one of the progenitor strains, but differing crucially in histocompatibility—that is, tissue compatibility upon transplantation—and thereupon identified an MHC locus. Later Jean Dausset demonstrated the existence of MHC genes in human and described the first human leucocyte antigen, the protein which we call now HLA-A2. Some years later  Baruj Benacerraf showed that polymorphic MHC genes not only determine an individual’s unique constitution of antigens but also regulate the interaction among the various cells of the immunological system. These three scientists have been awarded the for their discoveries concerning “genetically determined structures on the cell surface that regulate immunological reactions”.
The first fully sequenced and annotated MHC was published for humans in 1999 by a consortium of sequencing centers from UK, USA and Japan in Nature. It was a "virtual MHC" since it was a mosaic from different individuals. A much shorter MHC locus from chickens was published in the same issue of Nature. Many other species have been sequenced and the evolution of the MHC was studied, e.g. in the gray short-tailed opossum, a marsupial, MHC spans 3.95 Mb, yielding 114 genes, 87 shared with humans. Marsupial MHC genotypic variation lies between eutherian mammals and birds, taken as the minimal MHC encoding, but is closer in organization to that of nonmammals. The was created which provides a centralised repository for sequences of the Major Histocompatibility Complex from a number of different species. The database contains 77 species for the release from 2019-12-19.

Genes

MHC locus is present in all jawed vertebrates, it is assumed to have arisen about 450 million years ago. Despite the difference in the number of genes included in MHC of different species, the overall organization of the locus is rather similar. Usual MHC contains about hundred genes and pseudogenes, not all of them are involved in the immunity. In humans, the MHC region occurs on chromosome 6, between the flanking genetic markers MOG and COL11A2, and contains 224 genes spanning 3.6 megabase pairs. About half have known immune functions. The human MHC is also called the HLA complex. Similarly, there is SLA, BoLA, DLA for dogs, etc. However, historically, the MHC in mice is called the Histocompatibility system 2 or just the H-2, in rats - RT1, and in chicken - B-locus.
The MHC gene family is divided into three subgroups: MHC class I, MHC class II, and MHC class III. Among all those genes present in MHC, there are two types of genes coding for the proteins MHC class I molecules and MHC class II molecules that directly involved in the antigen presentation. These genes are highly polymorphic, 19031 alleles of class I HLA, and 7183 of class II HLA are deposited for human in the data base .
ClassEncodingExpression
I peptide-binding proteins, which select short sequences of amino acids for antigen presentation, as well as molecules aiding antigen-processing.One chain, called α, whose ligands are the CD8 receptor—borne notably by cytotoxic T cells—and inhibitory receptors borne by NK cells
II peptide-binding proteins and proteins assisting antigen loading onto MHC class II's peptide-binding proteins.Two chains, called α & β, whose ligands are the CD4 receptors borne by helper T cells.
IIIOther immune proteins, outside antigen processing and presentation, such as components of the complement cascade, the cytokines of immune signaling, and heat shock proteins buffering cells from stressesVarious

Proteins

MHC class I

molecules are expressed in all nucleated cells and also in platelets—in essence all cells but red blood cells. It presents epitopes to killer T cells, also called cytotoxic T lymphocytes. A CTL expresses CD8 receptors, in addition to TCRs. When a CTL's CD8 receptor docks to a MHC class I molecule, if the CTL's TCR fits the epitope within the MHC class I molecule, the CTL triggers the cell to undergo programmed cell death by apoptosis. Thus, MHC class I helps mediate cellular immunity, a primary means to address intracellular pathogens, such as viruses and some bacteria, including bacterial L forms, bacterial genus Mycoplasma, and bacterial genus Rickettsia. In humans, MHC class I comprises HLA-A, HLA-B, and HLA-C molecules.
The first crystal structure of Class I MHC molecule, human HLA-A2, was published in 1989. The structure revealed that MHC-I molecules are heterodimers, they have polymorphic heavy α-subunit whose gene occurs inside the MHC locus and small invariant β2 microglobulin subunit whose gene is located usually outside of it. Polymorphic heavy chain of MHC-I molecule contains N-terminal extra-cellular region composed by three domains, α1, α2, and α3, transmembrane helix to hold MHC-I molecule on the cell surface and short cytoplasmic tail. Two domains, α1 and α2 form deep peptide-binding groove between two long α-helices and the floor of the groove formed by eight β-strands. Immunoglobulin-like domain α3 involved in the interaction with CD8 co-receptor. β2 microglobulin provides stability of the complex and participates in the recognition of peptide-MHC class I complex by CD8 co-receptor. The peptide is non-covalently bound to MHC-I, it is held by the several pockets on the floor of the peptide-binding groove. Amino acid side-chains that are most polymorphic in human alleles fill up the central and widest portion of the binding groove, while conserved side-chains are clustered at the narrower ends of the groove.
Classical MHC molecules present epitopes to the TCRs of CD8+ T lymphocytes. Nonclassical molecules exhibit limited polymorphism, expression patterns, and presented antigens; this group is subdivided into a group encoded within MHC loci, as well as those not ; the antigen/ligand for many of these molecules remain unknown, but they can interact with each of CD8+ T cells, NKT cells, and NK cells.

MHC Class II

can be conditionally expressed by all cell types, but normally occurs only on "professional" antigen-presenting cells : macrophages, B cells, and especially dendritic cells. An APC takes up an antigenic protein, performs antigen processing, and returns a molecular fraction of it—a fraction termed the epitope—and displays it on the APC's surface coupled within an MHC class II molecule. On the cell's surface, the epitope can be recognized by immunologic structures like T cell receptors. The molecular region which binds to the epitope is the paratope.
On surfaces of helper T cells are CD4 receptors, as well as TCRs. When a naive helper T cell's CD4 molecule docks to an APC's MHC class II molecule, its TCR can meet and bind the epitope coupled within the MHC class II. This event primes the naive T cell. According to the local milieu, that is, the balance of cytokines secreted by APCs in the microenvironment, the naive helper T cell polarizes into either a memory Th cell or an effector Th cell of phenotype either type 1, type 2, type 17, or regulatory/suppressor, as so far identified, the Th cell's terminal differentiation.
MHC class II thus mediates immunization to—or, if APCs polarize Th0 cells principally to Treg cells, immune tolerance of—an antigen. The polarization during primary exposure to an antigen is key in determining a number of chronic diseases, such as inflammatory bowel diseases and asthma, by skewing the immune response that memory Th cells coordinate when their memory recall is triggered upon secondary exposure to similar antigens. B cells express MHC class II to present antigens to Th0, but when their B cell receptors bind matching epitopes, interactions which are not mediated by MHC, these activated B cells secrete soluble immunoglobulins: antibody molecules mediating humoral immunity.
Class II MHC molecules are also heterodimers, genes for both α and β subunits are polymorphic and located within MHC class II subregion. Peptide-binding groove of MHC-II molecules is forms by N-terminal domains of both subunits of the heterodimer, α1 and β1, unlike MHC-I molecules, where two domains of the same chain are involved. In addition, both subunits of MHC-II contain transmembrane helix and immunoglobulin domains α2 or β2 that can be recognized by CD4 co-receptors. In this way MHC molecules chaperone which type of lymphocytes may bind to the given antigen with high affinity, since different lymphocytes express different T-Cell Receptor co-receptors.
MHC class II molecules in humans have five to six
isotypes. Classic molecules present peptides to CD4+ lymphocytes. Nonclassic molecules, accessories, with intracellular functions, are not exposed on cell membranes, but in internal membranes in lysosomes, normally loading the antigenic peptides onto classic MHC class II molecules.
Sr.NoFeatureClass I MHCClass II MHC
1Constituting polypeptide chainsα chain
β2 chain
α chain
β chain
2Antigen binding domainα1and α2 domainsα1 and β1 domains
3Binds protein antigens of8-10 amino acids residues13-18 amino acids residues
4Peptide bending cleftFloor formed by β sheets and sides by a
helices, blocked at both the ends
Floor formed by β sheets and sides by a
helices, opened at both the ends
5Antigenic peptide motifs
involved in binding
Anchor residues located at amino and
carbon terminal ends
Anchor residues located almost uniformly
along the peptide
6Presents antigenic peptide toCD8+ T cellsCD4+ T cells

Class III

Class III molecules have physiologic roles unlike classes I and II, but are encoded between them in the short arm of human chromosome 6. Class III molecules include several secreted proteins with immune functions: components of the complement system, cytokines, and heat shock proteins.

Function

MHC is the tissue-antigen that allows the immune system to bind to, recognize, and tolerate itself. MHC is also the chaperone for intracellular peptides that are complexed with MHCs and presented to T cell receptors as potential foreign antigens. MHC interacts with TCR and its co-receptors to optimize binding conditions for the TCR-antigen interaction, in terms of antigen binding affinity and specificity, and signal transduction effectiveness.
Essentially, the MHC-peptide complex is a complex of auto-antigen/allo-antigen. Upon binding, T cells should in principle tolerate the auto-antigen, but activate when exposed to the allo-antigen. Disease states occur when this principle is disrupted.
Antigen presentation: MHC molecules bind to both T cell receptor and CD4/CD8 co-receptors on T lymphocytes, and the antigen epitope held in the peptide-binding groove of the MHC molecule interacts with the variable Ig-Like domain of the TCR to trigger T-cell activation
Autoimmune reaction: Having some MHC molecules increases the risk of autoimmune diseases more than having others. HLA-B27 is an example. It is unclear how exactly having the HLA-B27 tissue type increases the risk of ankylosing spondylitis and other associated inflammatory diseases, but mechanisms involving aberrant antigen presentation or T cell activation have been hypothesized.
Tissue allorecognition: MHC molecules in complex with peptide epitopes are essentially ligands for TCRs. T cells become activated by binding to the peptide-binding grooves of any MHC molecule that they were not trained to recognize during positive selection in the thymus.

Antigen processing and presentation

Peptides are processed and presented by two classical pathways:
CharacteristicMHC-I pathwayMHC-II pathway
Composition of the stable peptide-MHC complexPolymorphic chain α and β2 microglobulin, peptide bound to α chainPolymorphic chains α and β, peptide binds to both
Types of antigen-presenting cells All nucleated cellsDendritic cells, mononuclear phagocytes, B lymphocytes, some endothelial cells, epithelium of thymus
T lymphocytes able to respondCytotoxic T lymphocytes Helper T lymphocytes
Origin of antigenic proteinscytosolic proteins Proteins present in endosomes or lysosomes
Enzymes responsible for peptide generationCytosolic proteasomeProteases from endosomes and lysosomes
Location of loading the peptide on the MHC moleculeEndoplasmic reticulumSpecialized vesicular compartment
Molecules implicated in transporting the peptides and loading them on the MHC moleculesTAP DM, invariant chain

T lymphocyte recognition restrictions

In their development in the thymus, T lymphocytes are selected to recognize MHC molecules of the host, but not recognize other self antigens. Following selection, each T lymphocyte shows dual specificity: The TCR recognizes self MHC, but only non-self antigens.
MHC restriction occurs during lymphocyte development in the thymus through a process known as positive selection. T cells that do not receive a positive survival signal — mediated mainly by thymic epithelial cells presenting self peptides bound to MHC molecules — to their TCR undergo apoptosis. Positive selection ensures that mature T cells can functionally recognize MHC molecules in the periphery.
The TCRs of T lymphocytes recognise only sequential epitopes, also called linear epitopes, of only peptides and only if coupled within an MHC molecule.

In sexual mate selection

MHC molecules enable immune system surveillance of the population of protein molecules in a host cell, and greater MHC diversity permits greater diversity of antigen presentation. In 1976, Yamazaki et al demonstrated a sexual selection mate choice by male mice for females of a different MHC. Similar results have been obtained with fish. Some data find lower rates of early pregnancy loss in human couples of dissimilar MHC genes.
MHC may be related to mate choice in some human populations, a theory that found support by studies by Ober and colleagues in 1997, as well as by Chaix and colleagues in 2008. However, the latter findings have been controversial. If it exists, the phenomenon might be mediated by olfaction, as MHC phenotype appears strongly involved in the strength and pleasantness of perceived odour of compounds from sweat. Fatty acid esters—such as methyl undecanoate, methyl decanoate, methyl nonanoate, methyl octanoate, and methyl hexanoate—show strong connection to MHC.
In 1995, Claus Wedekind found that in a group of female college students who smelled T-shirts worn by male students for two nights, by far most women chose shirts worn by men of dissimilar MHCs, a preference reversed if the women were on oral contraceptives. Results of a 2002 experiment likewise suggest HLA-associated odors influence odor preference and may mediate social cues. In 2005 in a group of 58 subjects, women were more indecisive when presented with MHCs like their own, although with oral contraceptives, the women showed no particular preference. No studies show the extent to which odor preference determines mate selection.

Evolutionary diversity

Most mammals have MHC variants similar to those of humans, who bear great allelic diversity, especially among the nine classical genes—seemingly due largely to gene duplication—though human MHC regions have many pseudogenes. The most diverse loci, namely HLA-A, HLA-B, and HLA-C, have roughly 6000, 7200, and 5800 known alleles, respectively. Many HLA alleles are ancient, sometimes of closer homology to a chimpanzee MHC alleles than to some other human alleles of the same gene.
MHC allelic diversity has challenged evolutionary biologists for explanation. Most posit balancing selection, which is any natural selection process whereby no single allele is absolutely most fit, such as frequency-dependent selection and heterozygote advantage. Pathogenic coevolution, as a type of balancing selection, posits that common alleles are under greatest pathogenic pressure, driving positive selection of uncommon alleles—moving targets, so to say, for pathogens. As pathogenic pressure on the previously common alleles decreases, their frequency in the population stabilizes, and remain circulating in a large population. Genetic drift is also a major driving force in some species. It is possible that the combined effects of some or all of these factors cause the genetic diversity.
MHC diversity has also been suggested as a possible indicator for conservation, because large, stable populations tend to display greater MHC diversity, than smaller, isolated populations. Small, fragmented populations that have experienced a population bottleneck typically have lower MHC diversity. For example, relatively low MHC diversity has been observed in the cheetah, Eurasian beaver, and giant panda. In 2007 low MHC diversity was attributed a role in disease susceptibility in the Tasmanian devil, native to the isolated island of Tasmania, such that an antigen of a transmissible tumor, involved in devil facial tumour disease, appears to be recognized as a self antigen. To offset inbreeding, efforts to sustain genetic diversity in populations of endangered species and of captive animals have been suggested.
The MHC locus has 3 other paralogous loci in the human genome, namely 19pl3.1, 9q33-q34, and 1q21-q25. It is believed that the loci arouse from the two-round duplications in vertebrates of a single ProtoMHC locus, and the new domain organizations of the MHC genes were a result of later cis-duplication and exon shuffling in a process termed "the MHC Big Bang." Genes in this locus are apparently linked to intracellular intrinsic immunity in the basal Metazoan Trichoplax adhaerens.

In transplant rejection

In a transplant procedure, as of an organ or stem cells, MHC molecules act themselves as antigens and can provoke immune response in the recipient, thus causing transplant rejection. MHC molecules were identified and named after their role in transplant rejection between mice of different strains, though it took over 20 years to clarify MHC's role in presenting peptide antigens to cytotoxic T lymphocytes.
Each human cell expresses six MHC class I alleles and six to eight MHC class II alleles. The MHC variation in the human population is high, at least 350 alleles for HLA-A genes, 620 alleles for HLA-B, 400 alleles for DR, and 90 alleles for DQ. Any two individuals who are not identical twins will express differing MHC molecules. All MHC molecules can mediate transplant rejection, but HLA-C and HLA-DP, showing low polymorphism, seem least important.
When maturing in the thymus, T lymphocytes are selected for their TCR incapacity to recognize self antigens, yet T lymphocytes can react against the donor MHC's peptide-binding groove, the variable region of MHC holding the presented antigen's epitope for recognition by TCR, the matching paratope. T lymphocytes of the recipient take the incompatible peptide-binding groove as nonself antigen. The T lymphocytes' recognition of the foreign MHC as self is allorecognition.
Transplant rejection has various types known to be mediated by MHC :
In all of the above situations, immunity is directed at the transplanted organ, sustaining lesions. A cross-reaction test between potential donor cells and recipient serum seeks to detect presence of preformed anti-HLA antibodies in the potential recipient that recognize donor HLA molecules, so as to prevent hyperacute rejection. In normal circumstances, compatibility between HLA-A, -B, and -DR molecules is assessed. The higher the number of incompatibilities, the lower the five-year survival rate. Global databases of donor information enhance the search for compatible donors.

HLA biology

Human MHC class I and II are also called human leukocyte antigen. To clarify the usage, some of the biomedical literature uses HLA to refer specifically to the HLA protein molecules and reserves MHC for the region of the genome that encodes for this molecule, but this is not a consistent convention.
The most studied HLA genes are the nine classical MHC genes: HLA-A, HLA-B, HLA-C, HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQB1, HLA-DRA, and HLA-DRB1. In humans, the MHC gene cluster is divided into three regions: classes I, II, and III. The A, B and C genes belong to MHC class I, whereas the six D genes belong to class II.
MHC alleles are expressed in codominant fashion. This means the alleles inherited from both parents are expressed equally:
The set of alleles that is present in each chromosome is called the MHC haplotype. In humans, each HLA allele is named with a number. For instance, for a given individual, his haplotype might be HLA-A2, HLA-B5, HLA-DR3, etc... Each heterozygous individual will have two MHC haplotypes, one each from the paternal and maternal chromosomes.
The MHC genes are highly polymorphic; many different alleles exist in the different individuals inside a population. The polymorphism is so high, in a mixed population, no two individuals have exactly the same set of MHC molecules, with the exception of identical twins.
The polymorphic regions in each allele are located in the region for peptide contact. Of all the peptides that could be displayed by MHC, only a subset will bind strongly enough to any given HLA allele, so by carrying two alleles for each gene, a much larger set of peptides can be presented.
On the other hand, inside a population, the presence of many different alleles ensures there will always be an individual with a specific MHC molecule able to load the correct peptide to recognize a specific microbe. The evolution of the MHC polymorphism ensures that a population will not succumb to a new pathogen or a mutated one, because at least some individuals will be able to develop an adequate immune response to win over the pathogen. The variations in the MHC molecules are the result of the inheritance of different MHC molecules, and they are not induced by recombination, as it is the case for the antigen receptors.
Because of the high levels of allelic diversity found within its genes, MHC has also attracted the attention of many evolutionary biologists.