C3orf62


Chromosome 3 Open Reading Frame 62, is a protein that in humans is encoded by the C3orf62 gene. C3orf62 is a glycine depleted protein relative to the amount of glycine in proteins in the rest of the genome. C3orf62 has a KKXX-like motif and is predicted to be localized in the nucleus. Expression of C3orf62 remains highest in whole blood.

Gene

Locus

C3orf62 is mapped to the reverse strand of chromosome 3 at 3p21.31 and spans 9.313 bases. C3orf62 starts at 49,268,597 base pairs from the terminus of the short arm and ending at 49,277,909 base pairs pter. This gene is known to have 3 exons, 4 transcripts, and 37 orthologues.

Gene neighborhood

C3orf62 is flanked by Ubiquitin Specific Protease 4 and Coil-Coiled Domain Containing 36.

Aliases

C3orf62 possesses the following alternate names and synonyms: CC062; FLJ43654.

Protein

Primary sequence

C3orf62 human protein is 267 amino acids long, and has a molecular mass of 30,194 Daltons. The isoelectric point of C3orf62 is roughly 5.2.
The unmodified C3orf62 protein is a “glycine depleted protein” relative to amounts of glycine in proteins in the rest of the genome. It appears that glycine is evenly distributed throughout the C3orf62 sequence with no preference of areas to cluster in. Before post-translational modifications, C3orf62 is an acidic protein. No charge clusters are present in C3orf62, and no specific spacing of cysteine is found. The isoelectric point of C3orf62 is 5.211000.
NameEnsembl Transcript IDBase PairsProteinBiotypeCCDSUniprotRefseq
C3orf62-001ENST00000343010.74235267aaProtein encodingCCDS2792Q6ZUJ4NM_198562, NP_940964-
C3orf62-004ENST00000436325.1581190aaProtein encoding-C9JW57--
C3orf62-003ENST00000424960.160298aaNonsense mediated decay-H7BZX3--
C3orf62-002ENST00000479673.13330No proteinRetained intron----

Domains and motifs

There are no known transmembrane domains for C3orf62. C3orf62 has a KKXX-like motif in the C-terminus meaning C3orf62 may be responsible for retrieval of endoplasmic reticulum membrane proteins from the Golgi apparatus.

Secondary structure

Roughly 7 alpha helices are predicted for C3orf62 through Pele Protein Structure Protein Prediction and strengthened through orthologous secondary structure predictions by Ali2D.

Subcellular localization

C3orf62 is predicted to be localized in the nucleus. The k-nearest neighbors algorithm predicts C3orf62 to be classified as follows: k=9/23; 69.6% nuclear, 13.0% mitochondrial, 13.0% cytoskeletal, 4.3% cytoplasmic.

Expression

C3orf62 is expressed in more than 30 different tissues; highest expression is in whole blood. Specifically, highest expression of C3orf62 is in the following tissues: lung, tonsil, trachea, small intestine, mammary gland, and salivary gland. Through analysis of various microarray studies, C3orf62 is found to have consistently high expression compared to other genes tested in the datasets. C3orf62 has low expression in brain tissues.

Post-transcriptional modifications

C3orf62 possess two post-translational modifications, both are phosphorylation sites with locations at amino acid 210 and 224. A natural variant is found at amino acid 110.
It appears as though C3orf62 may have a YinOYang site at residue 115, meaning that this Threonine residue is predicted to be O-GlycNAcylated as well as phosphorylated. This site may be reversibly and dynamically modified by O-GlcNAc or Phosphate groups at different times in the cell.

Regulation of expression

Thirteen promoters have been predicted for C3orf62.

Transcript variants

Transcription of C3orf62 produces 5 alternatively spliced variants and 1 unspliced form. Of the four splice variants, two of them are protein coding, one is nonsense meditated decay, and one is a retained intron. QIAGEN denotes the following as transcription factor binding sites in the C3orf62 promoter: TFCP2, Pax-6, p53, MyoD, YY1, Ik-2, AREB6, IRF-7A3.

Function

Function of C3orf62 is not currently understood by the scientific community.

Interactions

Upwards of 12 interacting proteins have been predicted for C3orf62. Interacting proteins with the strongest confidence to interact with C3orf62 include: HAUS augmin-like complex subunit 1, Inhibitor of growth protein 5, Thioredoxin domain-containing protein 9, and MORF4-family associated proteins.
Chemicals known to interact with C3orf62 include the following: Aflatoxin B1, Hydralazine, Valproic acid, and Decitabine.

Clinical significance

Interstitial deletions of chromosome 3 are rare, and only a few patients with a microdeletion of 3p21.31 have been reported to date. Characteristic clinical features found in patients with a microdeletion of 3p21.31 include developmental delay and distinctive facial features.
In the gene region, NCBI SNP identified 1,326 SNPS on the reverse minus strand of C3orf62. In the coding region, NCBI SNP identified 147 common SNPs.

Homology

Paralogs

There are no known paralogs of C3orf62.

Orthologs

The ortholog space of C3orf62 is fairly narrow, with the majority of orthologs found in mammals. A small fraction of orthologs have also been found in the following classes: Reptila, Sarcopterygii, and Actinoptergii.
The groupings of nearly all Mammalia ortholog sequences of C3orf62 are as follows: E-value: 2e-94 to 1e-169; similarity 56-84%. Mammals in this group consist largely of primates but also include the following orders: Perissodactyla, Rodentia, Carnivora, Proboscidea, Cetartiodactyla, Cingulata, Artiodactyla, Eulipotyphla, Diselphimorphia, and Afrosoricida.
More distantly related ortholog sequences of C3orf62 include organisms from classes Reptilia, Sarcopterygii, and Actinopterygii ranging from an E-value of 8e-10 to 3e-59 with similarity of 24-39%. Organisms in this grouping consist of Testudines, Coelacanthiformes, Squamata, and Osteoglossiformes orders.
No ortholog sequences of C3orf62 were found for the following life forms: Bacteria, archaea, protist, plant, fungus, trichoplax, invertebrate, amphibian, or bird.
Genus and SpeciesCommon NameClassAccessionPercent Identity
Homo sapiensHumanMammaliaNP_940964100
Microcebus murinusGrey Mouse LemurMammaliaXP_01262671888
Propithecus coquereliCoquerel's sifaka MammaliaXP_01251088086.9
Equus caballusHorseMammaliaNP_00129587784.3
Loxodonta AfricanaAfrican elephantMammaliaXP_00340971183.2
Castor CanadensisNorth American BeaverMammaliaXP_02003731681.6
Otolemur garnettiiGarnett's Greater GalagoMammaliaXP_00380063381.6
Camelus bactrianusBactrian camelMammaliaXP_010967491.178.3
Ailuropoda melanoleucaGiant PandaMammaliaXP_01965662677.7
Canis lupus familiarisDogMammaliaXP_00343292477.2
Vicugna pacosAlpacaMammaliaXP_00619635677.2
Condylura cristataStar-nosed moleMammaliaXP_01257576076.8
Felis catusCatMammaliaXP_00398226975.1
Pteropus vampyrusLarge flying foxMammaliaXP_01137372073.3
Pantholops hodgsoniiTibetan antelopeMammaliaXP_00596931872.6
Ictidomys tridecemlineatusThirteen lines ground squirrelMammaliaXP_00532696771
Sorex araneusCommon ShrewMammaliaXP_01278968269.5
Monodelphis domesticaGray short-tailed opossumMammaliaXP_00136790765.4
Echinops telfairiLesser Hedgehog TenrecMammaliaXP_00471528363.7
Orcinus orcaKiller whaleMammaliaXP_00428398561.2
Dasypus novemcinctusNine banded armadilloMammaliaXP_00445195058.2
Dipodomys ordiiOrd's Kangaroo RatMammaliaXP_01288351156.3
Myotis lucifugusLittle Brown MyotisMammaliaXP_00610703339.3
Pelodiscus sinensisChinese softshell turtleReptilliaXP_01442623538.5
Chelonia mydasGreen Sea TurtleReptilliaXP_00706183737.1
Latimeria chalumnaeWest Indian Ocean coelacanth SarcopterygiiXP_00599274035.3
Anolis carolinensisGreen anole ReptilliaXP_00810322733.1
Gekko japonicusJapanese GeckoReptilliaXP_01526286130.1

Phylogeny

The most distant ortholog of C3orf62 are species of fish and amphibians. Orthologs of C3orf62 are not seen in birds, invertebrates, or bacteria.