SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.


Thyroglobulin type-1 domain superfamily

SCOP classification
Root:   SCOP hierarchy in SUPERFAMILY [ 0] (11)
Class:   Small proteins [ 56992] (90)
Fold:   Thyroglobulin type-1 domain [ 57609]
Superfamily:   Thyroglobulin type-1 domain [ 57610]
Families:   Thyroglobulin type-1 domain [ 57611] (4)


Superfamily statistics
Genomes (149) Uniprot 2017_06 genome PDB chains (SCOP 1.75)
Domains 3,349 12,083 5
Proteins 1,846 5,234 5


Functional annotation
General category Processes_IC
Detailed category Proteases

Document:
Function annotation of SCOP domain superfamilies

Gene Ontology (high-coverage)

(show details)
GO term FDR (all) SDFO level Annotation (direct or inherited)
Biological Process (BP) response to stimulus 0 Least Informative Direct
Biological Process (BP) multicellular organismal process 0.06832 Least Informative Inherited
Biological Process (BP) cellular component organization or biogenesis 1 Least Informative Inherited
Biological Process (BP) single-organism cellular process 1 Least Informative Inherited
Biological Process (BP) biological regulation 0.002109 Least Informative Inherited
Biological Process (BP) response to endogenous stimulus 0 Moderately Informative Direct
Biological Process (BP) response to organic substance 0 Moderately Informative Direct
Biological Process (BP) response to oxygen-containing compound 0 Moderately Informative Direct
Biological Process (BP) regulation of signaling 0.0001425 Moderately Informative Direct
Biological Process (BP) regulation of cell communication 0.000361 Moderately Informative Direct
Biological Process (BP) regulation of developmental process 0.0008021 Moderately Informative Direct
Biological Process (BP) regulation of cellular component organization 0.04375 Moderately Informative Inherited
Biological Process (BP) regulation of multicellular organismal process 0.2103 Moderately Informative Inherited
Biological Process (BP) regulation of molecular function 0.2225 Moderately Informative Inherited
Biological Process (BP) regulation of protein metabolic process 0.008964 Moderately Informative Inherited
Biological Process (BP) positive regulation of cellular process 1 Moderately Informative Inherited
Biological Process (BP) animal organ development 0.04347 Moderately Informative Inherited
Biological Process (BP) regulation of response to stimulus 0.01256 Moderately Informative Inherited
Biological Process (BP) regulation of cellular biosynthetic process 1 Moderately Informative Inherited
Biological Process (BP) regulation of gene expression 1 Moderately Informative Inherited
Biological Process (BP) regulation of macromolecule biosynthetic process 1 Moderately Informative Inherited
Biological Process (BP) regulation of nucleobase-containing compound metabolic process 1 Moderately Informative Inherited
Biological Process (BP) regulation of localization 0.1432 Moderately Informative Inherited
Biological Process (BP) negative regulation of metabolic process 0.6025 Moderately Informative Inherited
Biological Process (BP) negative regulation of cellular process 0.004077 Moderately Informative Inherited
Biological Process (BP) tissue development 0.09081 Moderately Informative Inherited
Biological Process (BP) anatomical structure morphogenesis 1 Moderately Informative Inherited
Biological Process (BP) cell differentiation 0.6742 Moderately Informative Inherited
Biological Process (BP) response to nitrogen compound 0 Informative Direct
Biological Process (BP) negative regulation of cellular component organization 0.00000007455 Informative Direct
Biological Process (BP) regulation of cell differentiation 0.000002174 Informative Direct
Biological Process (BP) regulation of cell proliferation 0.000004502 Informative Direct
Biological Process (BP) negative regulation of multicellular organismal process 0.000004574 Informative Direct
Biological Process (BP) regulation of cellular component movement 0.000004994 Informative Direct
Biological Process (BP) negative regulation of cellular protein metabolic process 0.000007338 Informative Direct
Biological Process (BP) regulation of locomotion 0.00001109 Informative Direct
Biological Process (BP) negative regulation of molecular function 0.00003303 Informative Direct
Biological Process (BP) regulation of proteolysis 0.00008928 Informative Direct
Biological Process (BP) regulation of anatomical structure morphogenesis 0.9917 Informative Inherited
Biological Process (BP) regulation of cellular component biogenesis 0.05944 Informative Inherited
Biological Process (BP) regulation of hydrolase activity 0.002805 Informative Inherited
Biological Process (BP) regulation of organelle organization 0.8088 Informative Inherited
Biological Process (BP) positive regulation of immune system process 0.002653 Informative Inherited
Biological Process (BP) regulation of nervous system development 0.2723 Informative Inherited
Biological Process (BP) generation of neurons 0.32 Informative Inherited
Biological Process (BP) positive regulation of developmental process 0.9286 Informative Inherited
Biological Process (BP) regulation of transcription from RNA polymerase II promoter 0.06359 Informative Inherited
Biological Process (BP) central nervous system development 0.2311 Informative Inherited
Biological Process (BP) epithelium development 0.3122 Informative Inherited
Biological Process (BP) localization of cell 0.002876 Informative Inherited
Biological Process (BP) movement of cell or subcellular component 0.06366 Informative Inherited
Biological Process (BP) locomotion 0.08524 Informative Inherited
Biological Process (BP) response to peptide hormone 0 Highly Informative Direct
Biological Process (BP) negative regulation of endopeptidase activity 0.000000000000001925 Highly Informative Direct
Biological Process (BP) negative regulation of cell development 0.00000000000235 Highly Informative Direct
Biological Process (BP) negative regulation of cytoskeleton organization 0.000000003339 Highly Informative Direct
Biological Process (BP) negative regulation of locomotion 0.00000004838 Highly Informative Direct
Biological Process (BP) negative regulation of cellular component movement 0.0000001219 Highly Informative Direct
Biological Process (BP) positive regulation of lymphocyte activation 0.0000002942 Highly Informative Direct
Biological Process (BP) regulation of cellular response to growth factor stimulus 0.0000007945 Highly Informative Direct
Biological Process (BP) regulation of epithelial cell migration 0.000003535 Highly Informative Direct
Biological Process (BP) positive regulation of cell adhesion 0.000009353 Highly Informative Direct
Biological Process (BP) extracellular matrix organization 0.000009858 Highly Informative Direct
Biological Process (BP) regulation of transmembrane receptor protein serine/threonine kinase signaling pathway 0.00008755 Highly Informative Direct
Biological Process (BP) positive regulation of leukocyte proliferation 0.0001784 Highly Informative Direct
Biological Process (BP) morphogenesis of an epithelium 0.0002946 Highly Informative Direct
Biological Process (BP) negative regulation of supramolecular fiber organization 0.0009352 Highly Informative Direct
Biological Process (BP) regulation of cell morphogenesis involved in differentiation 0.07264 Highly Informative Inherited
Biological Process (BP) regulation of T cell activation 0.0684 Highly Informative Inherited
Biological Process (BP) negative regulation of nervous system development 0.001903 Highly Informative Inherited
Biological Process (BP) regulation of lymphocyte proliferation 0.01088 Highly Informative Inherited
Biological Process (BP) biological adhesion 0.001447 Highly Informative Inherited
Molecular Function (MF) binding 0.08681 Least Informative Inherited
Molecular Function (MF) molecular function regulator 0.000003811 Moderately Informative Direct
Molecular Function (MF) carbohydrate derivative binding 0.0698 Moderately Informative Inherited
Molecular Function (MF) metal ion binding 0.003129 Moderately Informative Inherited
Molecular Function (MF) peptidase regulator activity 0.00001176 Informative Direct
Molecular Function (MF) enzyme inhibitor activity 0.0009228 Informative Direct
Molecular Function (MF) sulfur compound binding 0.0009645 Informative Direct
Molecular Function (MF) receptor binding 0.02376 Informative Inherited
Molecular Function (MF) enzyme activator activity 1 Informative Inherited
Molecular Function (MF) calcium ion binding 0.0000001112 Highly Informative Direct
Molecular Function (MF) cysteine-type endopeptidase inhibitor activity 0.0002757 Highly Informative Direct
Molecular Function (MF) phosphatase regulator activity 0.001543 Highly Informative Inherited
Cellular Component (CC) protein complex 0.9661 Least Informative Inherited
Cellular Component (CC) membrane 0.8171 Least Informative Inherited
Cellular Component (CC) extracellular region part 0 Moderately Informative Direct
Cellular Component (CC) cell projection 0.1954 Moderately Informative Inherited
Cellular Component (CC) plasma membrane 0.02185 Moderately Informative Inherited
Cellular Component (CC) plasma membrane region 0.00000009267 Informative Direct
Cellular Component (CC) cell surface 0.000002164 Informative Direct
Cellular Component (CC) extracellular space 0.0008509 Informative Direct
Cellular Component (CC) plasma membrane bounded cell projection part 0.003109 Informative Inherited
Cellular Component (CC) neuron projection 0.01033 Informative Inherited
Cellular Component (CC) sarcoplasm 0.00000000000005657 Highly Informative Direct
Cellular Component (CC) main axon 0.00000000000007126 Highly Informative Direct
Cellular Component (CC) basal part of cell 0.00000000000102 Highly Informative Direct
Cellular Component (CC) basolateral plasma membrane 0.000000003551 Highly Informative Direct
Cellular Component (CC) cell-cell junction 0.02797 Highly Informative Inherited

Document: GO annotation of SCOP domains

UniProtKB KeyWords (KW)

(show details)
KW termFDR (all)SDKW levelAnnotation (direct or inherited)
Biological processCell adhesion0.000002519InformativeDirect
Biological processGrowth regulation0Highly InformativeDirect
Cellular componentSecreted0Moderately InformativeDirect
Cellular componentCell junction0.0000242Moderately InformativeDirect
Cellular componentExtracellular matrix0.000000003462InformativeDirect
Cellular componentTight junction0.00000000000002606Highly InformativeDirect
Cellular componentBasement membrane0.00009938Highly InformativeDirect
DomainSignal0Least InformativeDirect
DomainEGF-like domain0.0001657InformativeDirect
Molecular functionCalcium0.0000000000004455Moderately InformativeDirect
Post-translational modificationDevelopmental protein0.000002224Moderately InformativeDirect
Post-translational modificationProtease inhibitor0.00000003921InformativeDirect
Post-translational modificationGrowth factor binding0Highly InformativeDirect
Post-translational modificationGlycoprotein0Least InformativeDirect
Post-translational modificationDisulfide bond0Least InformativeDirect
Post-translational modificationProteoglycan0.00000000000005484InformativeDirect
Post-translational modificationSulfation0.00000002733InformativeDirect
Post-translational modificationHeparan sulfate0.0000001528Highly InformativeDirect

Document: KW annotation of SCOP domains

InterPro annotation
Cross references IPR000716 SSF57610 Protein matches
Abstract

Thyroglobulin (Tg) is a large glycoprotein specific to the thyroid gland and is the precursor of the iodinated thyroid hormones thyroxine (T4) and triiodothyronine (T3). The N-terminal section of Tg contains 10 repeats of a domain of about 65 amino acids which is known as the Tg type-1 repeat [PubMed3595599, PubMed8797845]. Such a domain has also been found as a single or repeated sequence in the HLA class II associated invariant chain [PubMed3038530]; human pancreatic carcinoma marker proteins GA733-1 and GA733-2 [PubMed2333300]; nidogen (entactin), a sulphated glycoprotein which is widely distributed in basement membranes and that is tightly associated with laminin; insulin-like growth factor binding proteins (IGFBP) [PubMed1709161]; saxiphilin, a transferrin-like protein from Rana catesbeiana (Bull frog) that binds specifically to the neurotoxin saxitoxin [PubMed8146142]; chum salmon egg cysteine proteinase inhibitor, and equistatin, a thiol-protease inhibitor from Actinia equina (sea anemone) [PubMed9153250]. The existence of Thyr-1 domains in such a wide variety of proteins raises questions about their activity and function, and their interactions with neighbouring domains. The Thyr-1 and related domains belong to MEROPS proteinase inhibitor family I31, clan IX.

Equistatin from A. equina is composed of three Thyr-1 domains; as with other proteins that contains Thyr-1 domains, the thyropins, they bind reversibly and tightly to cysteine proteases (inhibitor family C1). In equistatin inhibition of papain is a function of domain-1. Unusually domain-2 inhibits cathepsin D, an aspartic protease (inhibitor family A1) and has no activity against papain. Domain-3, does not inhibit either papain or cathepsin D, and its function or its target peptidase has yet to be determined [PubMed9153250, PubMed12650938].


InterPro database


PDBeMotif information about ligands, sequence and structure motifs
Cross references PDB entries
Ligand binding statistics
Nucleic-acid binding statistics
Occurrence of secondary structure elements
Occurrence of small 3D structural motifs

PDBeMotif resource

Jump to [ Top of page · SCOP classification · InterPro annotation · PDBeMotif links · Functional annotation · Gene Ontology (high-coverage) · UniProtKB KeyWords (KW) ]

Internal database links

Browse genome assignments for this superfamily. The SUPERFAMILY hidden Markov model library has been used to carry out SCOP domain assignments to all genomes at the superfamily level.


Alignments of sequences to 4 models in this superfamily are available by clicking on the 'Alignments' icon above. PDB sequences less than 40% identical are shown by default, but any other sequence(s) may be aligned. Select PDB sequences, genome sequences, or paste in or upload your own sequences.


Browse and view proteins in genomes which have different domain combinations including a Thyroglobulin type-1 domain domain.


Examine the distribution of domain superfamilies, or families, across the major taxonomic kingdoms or genomes within a kingdom. This gives an immediate impression of how superfamilies, or families, are restricted to certain kingdoms of life.


Explore domain occurrence network where nodes represent genomes and edges are domain architectures (shared between genomes) containing the superfamily of interest.

There are 4 hidden Markov models representing the Thyroglobulin type-1 domain superfamily. Information on how the models are built, and plots showing hydrophobicity, match emmission probabilities and insertion/deletion probabilities can be inspected.


Jump to [ Top of page · SCOP classification · InterPro annotation · PDBeMotif links · Functional annotation · Gene Ontology (high-coverage) · UniProtKB KeyWords (KW) · Internal database links ]