SUPERFAMILY 1.73 HMM library and genome assignments server


E set domains superfamily

SCOP classification
Root:   SCOP hierarchy in SUPERFAMILY [ 0] (11)
Class:   All beta proteins [ 48724] (165)
Fold:   Immunoglobulin-like beta-sandwich [ 48725] (27)
  sandwich; 7 strands in 2 sheets; greek-key
some members of the fold have additional strands
Superfamily:   E set domains [ 81296] (20)
Families:   NF-kappa-B/REL/DORSAL transcription factors, C-terminal domain [ 81279] (7)
  subgroup of the larger IPT/TIG domain family
  E-set domains of sugar-utilizing enzymes [ 81282] (19)
  domains of unknown function associated with different type of catalytic domains in a different sequential location
subgroup of the larger IPT/TIG domain family
  Other IPT/TIG domains [ 89191]
  apart from the domains of transcription factors and sugar-utilizing enzymes
  Arthropod hemocyanin, C-terminal domain [ 81283]
  Class II viral fusion proteins C-terminal domain [ 81284] (2)
  Cytomegalovirus protein US2 [ 81285]
  Molybdenum-containing oxidoreductases-like dimerisation domain [ 81286]
  ML domain [ 81287] (2)
  implicated in lipid recognition, particularly in the recognition of pathogen related products
  RhoGDI-like [ 81288] (2)
  Cytoplasmic domain of inward rectifier potassium channel [ 81966] (3)
  Transglutaminase N-terminal domain [ 81289]
  Filamin repeat (rod domain) [ 81290] (4)
  Pfam 00630
  Arrestin [ 81291]
  Gingipain R (RgpB), C-terminal domain [ 81292]
  Copper resistance protein C (CopC, PcoC) [ 81969]
  Cellulosomal scaffoldin protein CipC, module x2.1 [ 81293]
  Quinohemoprotein amine dehydrogenase A chain, domains 4 and 5 [ 81294]
  Internalin Ig-like domain [ 81295] (3)
  truncated fold fused to an LRR domain
  SoxZ-like [ 141027]
  Enterochelin esterase N-terminal domain-like [ 141030]
  PfamB 013071


Superfamily statistics
Genomes (978) UniProt 15.0 PDB chains (SCOP 1.73)
Domains 24,905 20,882 217
Proteins 13,558 16,645 208


Functional annotation
General category Other
Detailed category Unknown function

Function annotation of SCOP domain superfamilies
InterPro annotation
Cross references IPR014756 SSF81296 Protein matches
Abstract

The immunoglobulin (Ig) like fold, which consists of a beta-sandwich of seven or more strands in two sheets with a greek-key topology, is one of the most common protein modules found in animals. Many different unrelated proteins share an Ig-like fold, which is often involved in interactions, commonly with other Ig-like domains via their beta-sheets [PubMed7932691]. Of these, the "early" set (E set) domains are possibly related to the immunoglobulin and/or fibronectin type III Ig-like protein superfamilies. Ig-like E set domains include:

  • C-terminal domain of certain transcription factors, such as the pro-inflammatory transcription factor NF-kappaB, and the T-cell transcription factors NFAT1 and NFAT5 [PubMed15380510].
  • Ig-like domains of sugar-utilising enzymes, such as galactose oxidase (C-terminal domain), sialidase (linker domain), and maltogenic amylase (N-terminal domain).
  • C-terminal domain of arthropod haemocyanin, where many loops are inserted into the fold. These proteins act as dioxygen-transporting proteins.
  • C-terminal domain of class II viral fusion proteins. These envelope glycoproteins are responsible for membrane fusion with target cells during viral invasion.
  • Cytomegaloviral US (unique short) proteins. These type I membrane proteins help suppress the host immune response by modulating surface expression of MHC class I molecules [PubMed14671122].
  • Molybdenium-containing oxidoreductase-like dimerisation domain found in enzymes such as sulphite reductase.
  • ML domains found in cholesterol-binding epididymal secretory protein E1, and in a major house-dust mite allergen; ML domains are implicated in lipid recognition, particularly the recognition of pathogen-related products.
  • Rho-GDI-like signalling proteins, which regulate the activity of small G proteins [PubMed15513926].
  • Cytoplasmic domain of inward rectifier potassium channels such as Girk1 and Kirbac1.1. These channels act as regulators of excitability in eukaryotic cells.
  • N-terminal domain of transglutaminases, including coagulation factor XIII; many loops are inserted into the fold in these proteins. These proteins act to catalyse the cross-linking of various protein substrates [PubMed15290350].
  • Filamin repeat rod domain found in proteins such as the F-actin cross-linking gelation factor ABP-120. These proteins interact with a variety of cellular proteins, acting as signalling scaffolds [PubMed15516996].
  • Arrestin family of proteins, which contain a tandem repeat of two elaborated Ig-like domains contacting each other head-to-head. These proteins are key to the redirection of GPCR signals to alternative pathways [PubMed15102497].
  • C-terminal domain of arginine-specific cysteine proteases, such as Gingipain-R, which act as major virulence factors of Porphyromonas gingivalis.
  • Copper-resistance proteins, such as CopC, which act as copper-trafficking proteins [PubMed12651950].
  • Cellulosomal scaffoldin proteins, such as CipC module x2.1. These proteins act as scaffolding proteins of cellulosomes, which contain cellulose-degrading enzymes [PubMed14756796].
  • Quinohaemoprotein amine dehydrogenases (A chain), which contain a tandem repeat of two Ig-like domains. These proteins function in electron transfer reactions.
  • Internalin Ig-like domains, which are truncated and fused to a leucine-rich repeat domain. These proteins are required for host cell invasion of Listeria species.


InterPro database

PDBeMotif information about ligands, sequence and structure motifs
Cross references PDB entries
Ligand binding statistics
Nucleic-acid binding statistics
Occurrence of secondary structure elements
Occurrence of small 3D structural motifs

PDBeMotif resource

Jump to [ Top of page · SCOP classification · InterPro annotation · PDBeMotif links · Functional annotation ]

Internal database links

Browse genome assignments for this superfamily. The SUPERFAMILY hidden Markov model library has been used to carry out SCOP domain assignments to all genomes at the superfamily level.


Alignments of sequences to 99 models in this superfamily are available by clicking on the 'Alignments' icon above. PDB sequences less than 40% identical are shown by default, but any other sequence(s) may be aligned. Select PDB sequences, genome sequences, or paste in or upload your own sequences.


Browse and view proteins in genomes which have different domain combinations including a E set domains domain.


Examine the distribution of domain superfamilies, or families, across the major taxonomic kingdoms or genomes within a kingdom. This gives an immediate impression of how superfamilies, or families, are restricted to certain kingdoms of life.


Explore domain occurrence network where nodes represent genomes and edges are domain architectures (shared between genomes) containing the superfamily of interest.

There are 99 hidden Markov models representing the E set domains superfamily. Information on how the models are built, and plots showing hydrophobicity, match emmission probabilities and insertion/deletion probabilities can be inspected.


Jump to [ Top of page · SCOP classification · InterPro annotation · PDBeMotif links · Functional annotation · Internal database links ]