Trefoil knot fold

  (Redirected from Trefoil domain)

The trefoil knot fold is a protein fold in which the protein backbone is twisted into a trefoil knot shape. "Shallow" knots in which the tail of the polypeptide chain only passes through a loop by a few residues are uncommon, but "deep" knots in which many residues are passed through the loop are extremely rare. Deep trefoil knots have been found in the SPOUT superfamily.[1] including methyltransferase proteins involved in posttranscriptional RNA modification in all three Domains of Life, including bacterium Thermus thermophilus[2] and proteins,[3] in archaea[1] and in eukaryota.[4]

A deep trefoil knot in a Thermus thermophilus RNA methyltransferase domain (PDB ID 1IPA). The knotted C-terminus of the protein is shown in blue.

In many cases the trefoil knot is part of the active site or a ligand-binding site and is critical to the activity of the enzyme in which it appears. Before the discovery of the first knotted protein, it was believed that the process of protein folding could not efficiently produce deep knots in protein backbones. Studies of the folding kinetics of a dimeric protein from Haemophilus influenzae have revealed that the folding of trefoil knot proteins may depend on proline isomerization.[5] Computational algorithms have been developed to identify knotted protein structures, both to canvas the Protein Data Bank for previously undetected natural knots and to identify knots in protein structure predictions, where they are unlikely to accurately reproduce the native-state structure due to the rarity of knots in known proteins.[6] Currently, there is a web server pKNOT available to detect knots in proteins as well as to provide information on knotted proteins in the Protein Data Bank.[7] Knottins are small, diverse and stable proteins with important drug design potential. They can be classified in 30 families which cover a wide range of sequences (1621 sequenced), three-dimensional structures (155 solved) and functions (> 10). Inter knottin similarity lies mainly between 20% and 40% sequence identity and 1.5 to 4 A backbone deviations although they all share a tightly knotted disulfide core. This important variability is likely to arise from the highly diverse loops which connect the successive knotted cysteines. The prediction of structural models for all knottin sequences would open new directions for the analysis of interaction sites and to provide a better understanding of the structural and functional organization of proteins sharing this scaffold.[8]

Trefoil domainEdit

Trefoil (P-type) domain
Structure of pancreatic spasmolytic polypeptide.[9]

Trefoil (P-type) domain is a cysteine-rich domain of approximately forty five amino-acid residues has been found in some extracellular eukaryotic proteins.[10][11][12][13] It is known as either the 'P', 'trefoil' or 'TFF' domain, and contains six cysteines linked by three disulphide bonds with connectivity 1-5, 2-4, 3-6.

The domain has been found in a variety of extracellular eukaryotic proteins,[10][12][13] including protein pS2 (TFF1) a protein secreted by the stomach mucosa; spasmolytic polypeptide (SP) (TFF2), a protein of about 115 residues that inhibits gastrointestinal motility and gastric acid secretion; intestinal trefoil factor (ITF) (TFF3); Xenopus laevis stomach proteins xP1 and xP4; xenopus integumentary mucins A.1 (preprospasmolysin) and C.1, proteins which may be involved in defense against microbial infections by protecting the epithelia from the external environment; xenopus skin protein xp2 (or APEG); Zona pellucida sperm-binding protein B (ZP-B); intestinal sucrase-isomaltase (EC / EC, a vertebrate membrane bound, multifunctional enzyme complex which hydrolyzes sucrose, maltose and isomaltose; and lysosomal alpha-glucosidase (EC


Human gene encoding proteins containing the trefoil domain include:

External linksEdit


  1. ^ Zarembinski TI, Kim Y, Peterson K, Christendat D, Dharamsi A, Arrowsmith CH, Edwards AM, Joachimiak A. (2003). Deep trefoil knot implicated in RNA binding found in an archaebacterial protein. Proteins 50(2):177-83
  2. ^ Nureki O, Shirouzu M, Hashimoto K, Ishitani R, Terada T, Tamakoshi M, Oshima T, Chijimatsu M, Takio K, Vassylyev DG, Shibata T, Inoue Y, Kuramitsu S, Yokoyama S. (2002). An enzyme with a deep trefoil knot for the active-site architecture. Acta Crystallogr D 58(Pt 7):1129-37
  3. ^ Nureki O, Watanabe K, Fukai S, Ishii R, Endo Y, Hori H, Yokoyama S. (2004). Deep knot structure for construction of active site and cofactor binding site of tRNA modification enzyme. Structure 12(4):593-602
  4. ^ Leulliot N, Bohnsack MT, Graille M, Tollervey D, Van Tilbeurgh H.(2008). The yeast ribosome synthesis factor Emg1 is a novel member of the superfamily of alpha/beta knot fold methyltransferases. Nucleic Acids Res 36(2):629-39
  5. ^ Mallam AL, Jackson SE. (2006). Probing nature's knots: the folding pathway of a knotted homodimeric protein. J Mol Biol 359(5):1420-36
  6. ^ Khatib F, Weirauch MT, Rohl CA. (2006). Rapid knot detection and application to protein structure prediction. Bioinformatics 22(14):e252-9
  7. ^ Lai YL, Yen SC, Yu SH, Hwang JK (2007). pKNOT: the protein KNOT web server. Nucleic Acids Research 35:W420-424
  8. ^ (Jerome Gracy and Laurent Chiche (2010). Optimizing structural modeling for a specific protein scaffold: knottins or inhibitor cystine knots. BMC Bioinformatics. 11:535)
  9. ^ Gajhede M, Petersen TN, Henriksen A, et al. (December 1993). "Pancreatic spasmolytic polypeptide: first three-dimensional structure of a member of the mammalian trefoil family of peptides". Structure. 1 (4): 253–62. doi:10.1016/0969-2126(93)90014-8. PMID 8081739.
  10. ^ a b Otto B, Wright N (1994). "Trefoil peptides. Coming up clover". Curr. Biol. 4 (9): 835–838. doi:10.1016/S0960-9822(00)00186-X. PMID 7820556. S2CID 11245174.
  11. ^ Thim L, Wright NA, Hoffmann W, Otto WR, Rio MC (1997). "Rolling in the clover: trefoil factor family (TFF)-domain peptides, cell migration and cancer". FEBS Lett. 408 (2): 121–123. doi:10.1016/S0014-5793(97)00424-9. PMID 9187350. S2CID 26946754.
  12. ^ a b Bork P (1993). "A trefoil domain in the major rabbit zona pellucida protein". Protein Sci. 2 (4): 669–670. doi:10.1002/pro.5560020417. PMC 2142363. PMID 8518738.
  13. ^ a b Hoffmann W, Hauser F (1993). "The P-domain or trefoil motif: a role in renewal and pathology of mucous epithelia?". Trends Biochem. Sci. 18 (7): 239–243. doi:10.1016/0968-0004(93)90170-R. PMID 8267796.


  • Tkaczuk KL, Dunin-Horkawicz S, Purta E, Bujnicki JM. (2007). Structural and evolutionary bioinformatics of the SPOUT superfamily of methyltransferases. BMC Bioinformatics. 8:73
This article incorporates text from the public domain Pfam and InterPro: IPR000519