User:Daniel Key Takemoto/Sandbox 2

Structure of FMRPStructure of FMRP

Predicted FMRP

 

Fragile X messenger ribonucleoprotein (FMRP) is encoded by the fragile X messenger ribonucleoprotein 1 (FMR1) gene, located in the X chromosome, and is associated with the fragile X syndrome (FXS), Fragile X Tremor/Ataxia Syndrome (FXTAS) and Premature Ovarian Failure (POF1). FMRP functions as a synaptic regulator by binding to mRNAs and inhibiting its translation, therefore regulating the synthesis of proteins in the synapse. It is also an RNA binding protein, which is responsible for the transportation of mRNAs to the cytoplasm. The FMRP can also bind to its own FMR1 transcripts, possibly as a self-regulatory mechanism.

The FMRP is highly expressed in neurons and genitalia, and it's located mostly in the cytoplasm and lower levels in the nucleus. It contains domains related to its RNA binding function, either in the N-terminal or C-terminal domain; the Agenet and the KH0-motif are located in the N-terminal domain, and they, respectively, exerce functions in binding to methylated lysin and RNA binding; the KH1 and KH2 motifs are located in the central region of the protein; and the RGG box, in the C-terminal domain, acts as a binding to RNA, especifically to G-quadruplexes, a secondary RNA structure. The KH1, KH2 and RGG box domains allow the FMRP to bind and translate a number of mRNAs related to the synaptic plasticity. [1]

The protein has 20 non-redundant isoforms and the most common is isoform 7, and the longest isoform contains 632 aminoacids. [2].

The predicted image was generated from Ensembl, by the AlphaFold program.

Overall structure

 


<StructureSection load='4QVZ' size='350' side='right' caption='N-terminal domain of the FMRP (PDB entry 4QVZ)' scene=>

N-terminal domainN-terminal domain

The N-terminal domain (NTD) of the FMRP is the first 215 aminoacids of the protein, containing two in tandem Agenet domains and one KH domain. The NTD is important for the functions of the protein as a RBP and its participation in the ribonucleoprotein (RNP) complex.

The NTD was shown to adopt independent folding, and its tertiary structure was found to be compatible with two tandem AGENET DOMAINS. Both of the Agenet domains resemble those of Argonaute, a protein involved in RNA interference, as they resemble structuraly Tudor domains, it is possible to assume functional similarities between them. The BETA SHEETS OF THE AGENET DOMAIN highlighted in red, characterize the Agenet domain and its structure, and there is a 13-RESIDUE LOOP, highlighted in light blue, between the beta sheets of the Agenet 1 and Agenet 2 connecting them.

It's been shown that, when the FMRP is in the nucleus, the Agenet domain allow the FMRP to interact with the chromatin and regulate DNA damage response. And has also been demonstrated that the Agenet domains bind to methylated lysine, in support to the histone binding function of the FMRP, due to its aromatic cage in the structure; specifically, the Agenet motif 2 binds to trimethylated lysine. The AMINOACID RESIDUES highlited in red are part of the Agenet 1 and the ones highlited in blue are part of the Agenet 2 domain related to the recognition of methylated lysine as they all have aromatic cages with the potential for this binding.

[3]. The K-homology domain (KH domain) folding is similar to other RBPs domains, which play an important role in RNA binding and protein-protein interactions.

Here it is the highlighted in red. Many RNA-binding proteins contain the KH motif, a conserved RNA-binding domain, but different from the KH1 and KH2 motifs, which contain a GXXG canonical motif, that will be explained in further details in the next topic, instead this one has an A-K-E-A. It is most likely involved in RNA binding and regulation, as seen in other proteins that also contain KH motifs. To give a more thorough response addressing the precise role played by the KH theme in FMRP, additional details would be needed. The THREE BETA SHEETS OF THE KH0 MOTIF

Overall, the FMRP NTD plays an important role as an RNA-binding protein, and its involvement in RNP complexes and its specific domains and motifs allow it to bind to specific RNAs and regulate their translation. [4]

The 3D structure of the N-terminal domain was obtained by X-ray diffraction with a 3.19 Å resolution and expressed in E. coli.









Central portionCentral portion

The central portion of the protein contains the in tandem KH1 AND KH2 MOTIFS. The type of KH domains in eukaryots is type 1 fold, in which a β sheet composed of three antiparallel strands is abutted by three helices, and they both have a canonical C-X-X-G (glycine followed by any two aminoacids and another glycine) conserved domain highlighted in magenta, between the central helices of the domain. It is highlighted in blue two alpha helices, three beta sheets in red, a canonical GXXG motif in between the alpha helices in magenta, and a variable loop, these are the structural conditions for the KH domain, in eukaryotes. There is a gap in the KH motif, which can hold four nucleic acid bases; it happens when an alpha-helix 1, alpha-helix 2, a GXXG on the left, and a variable loop are present in the domain, in the case of the FMRP another beta-sheet. This helps the FMRP in its RBP function, as they are motifs that are related to the RNA binding function. [5]





RGG motifRGG motif

An important motif of the FMRP is the RGG box, which the protein uses to bind to guanine G-quadruplexes, a structure that consists of nucleic acid folding in which four guanines arrange in a planar conformation stabilized by Hoogsteen-trype hydrogen bonds, named tetrad. FMRP RGG motifs seem to prefer binding to specific structures, not linear motifs.

Different domains and motifs mediate the RNA binding mechanism, and the exon 15-encoded RGG (arginine - glycine - glycine) motif is one of them. The FMRP RGG motif is located in the C-terminal region of the protein and is well conserved in vertebrates. To easily represent the RGG motif binding to RNA, this motif will be highlighted in the scene RGG MOTIF Crystal structure of the complex between the human FMRP RGG motif and G-quadruplex RNA The RGG motif binds to G-quadruplexes when it adopts a sharp turn and specifically binds to guanines from two consecutive G-C base pairs in the duplex-quadruplex junction. Several tetrads can stack in a single G-quadruplex structure and be stabilized further by potassium cations, in the case of FMRP targets, whereas they are destabilized by lithium cations.

The regulation of particular mRNAs and the binding of FMRP with ribosomes and proteins depend on this interaction between the RGG motif and G-quadruplex RNA.

Therefore, understanding the interaction RGG-RNA is important for the comprehension of FMRP and FXS. The structure being represented on the right represents the FMRP RGG motif and the G-quadruplex secondary structure in the RNA. The protein structure was obtained by X-ray diffraction with a resolution of 3 Å. [6]


 




Associated diseasesAssociated diseases

Trinucleotides repeats (CGG) are located in CpG islands in the 5' untranslated region (UTR) of the gene related to the expression of the gene. Individuals carrying up to 44 repeats of trinucleotides are of common aleles. Individuals that have between 44 and 55 repeats are known to carry the premutation, usually associated with FXTAS and PFO1. However, when the repeat expansion is above 55 the individual is carrying the full mutation, which leads to the silencing of the gene, due to methylation, therefore there is absence or reduced levels of the FMRP, causing abnormal synaptic development and symptons associated with the FXS.


ReferencesReferences

  1. SUARDI, G. A. M.; HADDAD, L. A. Chapter Three - FMRP ribonucleoprotein complexes and RNA homeostasis.[1]
  2. Hu, Y., Chen, Z., Fu, Y., He, Q., Jiang, L., Zheng, J., Gao, Y., Mei, P., Chen, Z. and Ren, X. (2015). The amino-terminal structure of human fragile X mental retardation protein obtained using precipitant-immobilized imprinted polymers. Nature Communications, [online] 6(1), p.6634. [2]
  3. MYRICK, L. K. et al. Human FMRP contains an integral tandem Agenet (Tudor) and KH motif in the amino terminal domain. Human Molecular Genetics, v. 24, n. 6, p. 1733–1740, 20 nov. 2014.[3]
  4. MYRICK, L. K. et al. Human FMRP contains an integral tandem Agenet (Tudor) and KH motif in the amino terminal domain. Human Molecular Genetics, v. 24, n. 6, p. 1733–1740, 20 nov. 2014.[4]
  5. Valverde, R., Edwards, L. and Regan, L. (2008). Structure and function of KH domains. FEBS Journal, 275(11), pp.2712–2726. [5]
  6. VASILYEV, N. et al. Crystal structure reveals specific recognition of a G-quadruplex RNA by a β-turn in the RGG motif of FMRP. Proceedings of the National Academy of Sciences, v. 112, n. 39, p. E5391–E5400, 15 set. 2015.[6]