Srp20-Human Alternative Splicing Factor: Difference between revisions

No edit summary
No edit summary
Line 20: Line 20:


==== RNA Recognition Motif ====
==== RNA Recognition Motif ====
The SRP20 RRM (aa 1-86) a βαββαβ <scene name='78/786034/Imager1/1'>pattern</scene>, common of many other RRMs3. For substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/5'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions which allow substrate binding to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/3'>syn</scene> conformation when bound to the RRM, something that was observed only for guanine in the 2 position previously. U3 <scene name='78/782597/Imager8/4'>Stacks</scene> onto F48 in β3, also W40 and A42 in β2, however when bound, U3 <scene name='78/786034/Imager9/1'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/4'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/3'>bonds</scene> between the C4 amino group, A2 2’ oxygen, and a main chain phosphate oxygen.
The SRP20 RRM (aa 1-86) contains a βαββαβ <scene name='78/786034/Imager1/1'>pattern</scene>, common of many other RRMs. To explore substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on the corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/5'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions that allow the substrate to bind to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/3'>syn</scene> conformation when bound to the RRM, something that was previously observed only for guanine in the 2 position. U3 <scene name='78/782597/Imager8/4'>Stacks</scene> onto F48 in β3, as W40 and A42 in β2.  However, when bound, U3 <scene name='78/786034/Imager9/1'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/4'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/3'>bonds</scene> between the C4 amino group, A2 2’ oxygen, and a main chain phosphate oxygen.
While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of substrate sequence does not significantly impact binding affinity, while C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The SRp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/4'>hydrogen bonds</scene>: from the C1 amino protons to Leu 80 backbone carbonyl oxygen and to Glu 79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and C1 O2 with Ser 81 side chain hydroxyl group.  
While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of the substrate sequence does not significantly impact binding affinity, while the C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The SRp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/4'>hydrogen bonds</scene>: from the C1 amino protons to Leu80 backbone carbonyl oxygen and to Glu79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and from C1 O2 to Ser81 side chain hydroxyl group.  
The semi specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning SRp20 can bind a more diverse range of substrates, or even original substrates that were mutated during replication (eg. H2A mRNA with a point mutation) thereby increasing organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref name="Hargous">PMID:17036044</ref>.  
The semi-specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning SRp20 can bind a diverse range of substrates, or even to original substrates that were mutated during replication (eg. H2A mRNA with a point mutation).  Therefore, this mechanism increases organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref name="Hargous">PMID:17036044</ref>.  


==== Tip Associated Protein Binding Domain ====
==== Tip Associated Protein Binding Domain ====


In addition to RNA recognition and alternative splicing functions, SRP20 has been shown to associate with Tip Associated Protein (TAP), an mRNA export factor, to promote transport of bound mRNA out of the nucleus for eventual translation3. In particular SRp20 promotes the export of H2A histone mRNA, by binding the CAUC consensus sequence on the mRNA and binding TAP. Previous experiments have shown that SRp20 binding TAP is dependent on presence of both the <scene name='78/786034/Rrmredgreen/1'>SRp20 RRM</scene> and a short arginine rich C-terminal segment after the RRM (aa 1-83 and 84-90 respectively) ('''Figure 1'''). Previous research also shows that mutation of any one of the three arginine residues present between residues 84-90 to glutamate prevents TAP binding, indicative of the importance of these arginine residues in TAP association. The structure is only solved to residue 86 so only the <scene name='78/786034/Arginine1/1'>first arginine residue</scene> is present. The same study also found that transfer of the TAP-binding motif to a non-functional REF2 RRM still allowed for TAP-binding and nuclear export of the target protein, suggesting that not only is the TAP-binding motif transferable, it does not depend on interaction with the host RRM to retain function<ref name="Hargous">PMID:17036044</ref>.
In addition to RNA recognition and alternative splicing functions, SRp20 has been shown to associate with Tip Associated Protein (TAP), an mRNA export factor, to promote transport of bound mRNA out of the nucleus for eventual translation. In particular, SRp20 promotes the export of H2A histone mRNA by binding the CAUC consensus sequence on the mRNA and TAP. Previous experiments show that SRp20 binding TAP is dependent on the presence of both the <scene name='78/786034/Rrmredgreen/1'>SRp20 RRM</scene> and a short arginine-rich C-terminal segment after the RRM (aa 1-83 and 84-90 respectively) ('''Figure 1'''). Previous research also shows that mutation of any one of the three arginine residues between residues 84-90 to glutamate prevents TAP binding, indicative of the importance of these arginine residues in TAP association. The structure is only solved to residue 86 so only the <scene name='78/786034/Arginine1/1'>first arginine residue</scene> is present. The same study also found that transfer of the TAP-binding motif to a non-functional REF2 RRM still allowed for TAP binding and nuclear export of the target protein, suggesting that not only is the TAP binding motif transferable, but it also does not depend on interaction with the host RRM to retain function<ref name="Hargous">PMID:17036044</ref>.


==== SR Domain ====
==== SR Domain ====
While the RNA-protein interaction occurs at the RRMs, SR domains are typically responsible for the recruitment of other proteins that act in the splicing mechanism. The serines within the SR domain are phosphorylated by kinases within the cell to direct them to pre-mRNA sites.  Phosphorylation acts as recruiting tools for the SRp20 protein and other SR proteins to promote splicing. However, some SR proteins have shown that phosphorylation actually leads to a decrease in splicing, as in the [https://www.ncbi.nlm.nih.gov/pubmed/14765198 SRp38] protein undergoing heat shock. Research has recently revealed that serines are dephosphorylated as splicing continues, indicating how far along the mRNA strand is in splicing. Dephosphorylation then serves as a signal to the cell that the mRNA is ready to be exported out of the nucleus. Rephosphorylation then appears to trigger the SR protein to enter back into the nucleus.  
While the RNA-protein interaction occurs at the RRMs, SR domains are typically responsible for the recruitment of other proteins that act in the splicing mechanism. The serines within the SR domain are phosphorylated by kinases within the cell to direct them to pre-mRNA sites.  Phosphorylation acts as a recruiting tool for the SRp20 protein and other SR proteins to promote splicing. However, some SR proteins have shown that phosphorylation actually leads to a decrease in splicing, as in the [https://www.ncbi.nlm.nih.gov/pubmed/14765198 SRp38] protein undergoing heat shock. Research has recently revealed that serines are dephosphorylated as splicing continues, indicating how much of the mRNA strand is being spliced. This suggests that dephosphorylation serves as a signal to the cell that the mRNA is ready to be exported out of the nucleus. Rephosphorylation then appears to trigger the SR protein to enter back into the nucleus.  
The SR domain also appears to modulate mRNA stability, though different SR proteins have been shown to have greater effects. Some SR proteins appears to increase the sensitivity of certain mRNA sequences to degradation; this degradation appears to be controlled by SR domain interactions with the 3’ UTR of the pre-mRNA. These effects are not present without the SR domain of the protein. The destabilizing effect of SRp20 specifically has yet to be studied<ref name="Hargous">PMID:17036044</ref>.
The SR domain also appears to modulate mRNA stability, though different SR proteins have been shown to have different effects. Some SR proteins appear to increase the sensitivity of certain mRNA sequences to degradation; this degradation appears to be controlled by SR domain interactions with the 3’ UTR of the pre-mRNA. These effects are not present without the SR domain of the protein. The destabilizing effect of SRp20 specifically has yet to be studied<ref name="Hargous">PMID:17036044</ref>.


=== 9G8 and SRP20 ===
=== 9G8 and SRP20 ===
[https://en.wikipedia.org/wiki/SFRS7 9G8] is another SR protein that is 80% similar in amino acid sequence. They are two of the smallest proteins in the SR family and both contain an RRM that promotes export of mRNA through interaction with the TAP protein. The only other protein shown to promote transport through the TAP protein is SF2, one of the first SR proteins discovered<ref name="Huang">PMID:12667464</ref>. Both RRMs adopt a βαββαβ mentioned earlier. Both RRMs interact with RNA with limited selectivity and therefore recognize many different RNA sequences. The 9G8 RRM contains an large hydrophobic core on its B-sheet. Observing the 9G8 protein has proved useful in understanding SRp20 less stable protein structure. In fact, the one of the only significant structural difference between the two proteins lies in the loops between a-helix 2 and B-4 where several amino acids are not conserved.  
[https://en.wikipedia.org/wiki/SFRS7 9G8] is another SR protein that is 80% similar in amino acid sequence to SRp20. They are two of the smallest proteins in the SR family and both contain an RRM that promotes export of mRNA through interaction with the TAP protein. The only other protein shown to promote transport through the TAP protein is SF2, one of the first SR proteins discovered<ref name="Huang">PMID:12667464</ref>. Both RRMs adopt the previously mentioned βαββαβ pattern. Both RRMs interact with RNA with limited selectivity and therefore recognize many different RNA sequences. The 9G8 RRM contains a large hydrophobic core on its B-sheet. Observing the 9G8 protein has proved useful in understanding SRp20's less stable protein structure. In fact, one of the only significant structural differences between the two proteins lies in the loops between a-helix 2 and B-4 where several amino acids are not conserved.  
Aside from the RRM, both proteins have one SR-rich domain although 9G8 includes about 40 more amino acids this domain<ref name="Corbo2013">PMID:23685143</ref>. [[Image:9G8srp20comparison.png|300px|right|thumb|'''Figure 5.''' SRp20 and 9G8 proteins. The PDB file for SRp20 (left; PDB file: 2i2y) does not include the SR-rich domain but 9G8 (right; PDB file: 2hvz) does. Boxed image shows an RSR region in the PG8 protein with oxygens highlighted in red. Images taken from PyMol software.jpg]] Within the bigger 9G8 protein, there is a zinc knuckle that allows for binding of pyrimidine-rich RNA sequences. This zinc knuckle is not present in SRp20, lending the protein to binding of more purine-rich sequences<ref name="Huang">PMID:12667464</ref>. Not only are these two proteins similar, but they also play similar roles in mRNA export out of the nucleus. Both move continuously between the nucleus and cytoplasm which requires phosphorylation of its serine residues located in the SR-rich domain. Serine phosphorylation has been shown to have great importance in the proper functioning of many SR proteins<ref name="Shepard">PMID:19857271</ref>.  
Aside from the RRM, both proteins have one SR-rich domain although 9G8 includes about 40 more amino acids this domain<ref name="Corbo2013">PMID:23685143</ref>. [[Image:9G8srp20comparison.png|300px|right|thumb|'''Figure 5.''' SRp20 and 9G8 proteins. The PDB file for SRp20 (left; PDB file: 2i2y) does not include the SR-rich domain but 9G8 (right; PDB file: 2hvz) does. Boxed image shows an RSR region in the PG8 protein with oxygens highlighted in red. Images taken from PyMol software.jpg]] Within the bigger 9G8 protein, there is a zinc knuckle that allows for binding of pyrimidine-rich RNA sequences. This zinc knuckle is not present in SRp20, allowing the protein to bind to more purine-rich sequences<ref name="Huang">PMID:12667464</ref>. Not only are these two proteins similar, but they also play similar roles in mRNA export out of the nucleus. Both move continuously between the nucleus and cytoplasm which requires phosphorylation of its serine residues located in the SR-rich domain. Serine phosphorylation has been shown to have great importance in the proper functioning of many SR proteins<ref name="Shepard">PMID:19857271</ref>.  
</StructureSection>
</StructureSection>
== Medical Significance ==
== Medical Significance ==
=== Cancer ===
=== Cancer ===
SRp20 has been linked to [https://en.wikipedia.org/wiki/Cancer cancer] in many instances, as well as other AS proteins. SRp20 has been seen to activate the AS of the [http://proteopedia.org/wiki/index.php/CD44 CD44 adhesion molecule.] SRp20 facilitates the splicing of exon v9, one important for function of CD44. Loss of function of SRp20 leading to loss of function of CD44 will lead to loss of “stickiness” of cells, a way that cancer cells can spread to other areas of the body. SRp20 can also affect the alternative splicing of [https://en.wikipedia.org/wiki/Oncogene oncogenes] and [https://en.wikipedia.org/wiki/Tumor_suppressor_gene tumor suppressors]. Expression of the signaling pathway for SRp20 translation is directly triggered by oncogenic signalling pathways. For example, the [https://en.wikipedia.org/wiki/Wnt_signaling_pathway Wnt] pathway that is associated with cancer development also drives further expression of SRp20 and other SR proteins<ref name="Corbo2012">PMID:22623141</ref>. These proteins aid in cell growth and proliferation, in that they increase expression of proteins they are involved in splicing. One study found that silencing SRp20 slowed cell proliferation, further supporting the idea that SRp20 helps with this process<ref name="Corbo2013">PMID:23685143</ref>. Cell cycle regulator proteins [http://proteopedia.org/wiki/index.php/FOXM1 FoxM1], [https://en.wikipedia.org/wiki/CDC25B Cdc25B], and [http://proteopedia.org/wiki/index.php/Plk1 PLK1] are alternatively spliced by SRp20, specifically regulating the G2/M cycles. These proteins are also associated with regulation of cell apoptosis. When these genes are overexpressed due to overexpressed SRp20, cells will no longer be able to apoptose when they are damaged, and the cell cycle will continue without regulation, causing overactive cell proliferation and eventually cancer. SRp20 has been seen to be overexpressed in ovarian, lung, breast, stomach, skin, bladder, colon, liver, thyroid, and kidney cancer tissue<ref name="Jia">PMID:21179588</ref>. SRp20 also regulates genes associated with cellular senescence, or cellular immortality, which contributes to cancer formation. Specifically, SRp20 alternatively splices the [https://ghr.nlm.nih.gov/gene/TP53 TP53] gene, which generates the [http://proteopedia.org/wiki/index.php/P53 p53] senescence protein.  
SRp20 has been linked to [https://en.wikipedia.org/wiki/Cancer cancer] in many instances, as have other AS proteins. SRp20 has been seen to activate the AS of the [http://proteopedia.org/wiki/index.php/CD44 CD44 adhesion molecule] by facilitating the splicing of exon v9, which is important for CD44 functionality. Loss of function of SRp20 leading to loss of function of CD44 will lead to loss of “stickiness” of cells, allowing cancer cells to spread to other areas of the body. SRp20 can also affect the alternative splicing of [https://en.wikipedia.org/wiki/Oncogene oncogenes] and [https://en.wikipedia.org/wiki/Tumor_suppressor_gene tumor suppressors]. Expression of the signaling pathway for SRp20 translation is directly triggered by oncogenic signaling pathways. For example, the [https://en.wikipedia.org/wiki/Wnt_signaling_pathway Wnt] pathway, that is associated with cancer development, also drives further expression of SRp20 and other SR proteins<ref name="Corbo2012">PMID:22623141</ref>. These proteins aid in cell growth and proliferation, by increasing expression of proteins that are involved in splicing. One study found that silencing SRp20 slowed cell proliferation, further supporting the idea that SRp20 aids in the process<ref name="Corbo2013">PMID:23685143</ref>. Cell cycle regulator proteins [http://proteopedia.org/wiki/index.php/FOXM1 FoxM1], [https://en.wikipedia.org/wiki/CDC25B Cdc25B], and [http://proteopedia.org/wiki/index.php/Plk1 PLK1] regulate the G2/M cycles and are alternatively spliced by SRp20. These proteins are also associated with regulation of cell apoptosis. When these genes are overexpressed due to overexpressed SRp20, cells will no longer be able to apoptose when they are damaged, and the cell cycle will continue without regulation, causing overactive cell proliferation and eventually cancer. SRp20 has been seen to be overexpressed in ovarian, lung, breast, stomach, skin, bladder, colon, liver, thyroid, and kidney cancer tissue<ref name="Jia">PMID:21179588</ref>. SRp20 also regulates genes associated with cellular senescence, or cellular immortality, which contributes to cancer formation. Specifically, SRp20 alternatively splices the [https://ghr.nlm.nih.gov/gene/TP53 TP53] gene, which generates the [http://proteopedia.org/wiki/index.php/P53 p53] senescence protein, promoting "cell immortality," which is characteristic of cancer.  


=== Neurological Disorders ===
=== Neurological Disorders ===
Along with cancer, SRp20 mutations have been linked to [https://en.wikipedia.org/wiki/Alzheimer%27s_disease Alzheimers], a neurodegenerative disorder. SRp20 is involved in AS of a wide array of RNAs, including that of the [https://en.wikipedia.org/wiki/Tropomyosin_receptor_kinase_B TRKB] gene to generate TrkB-Shc transcripts that are involved in generation of the disorder. SRp20 also promotes exclusion of exon 10 in the [http://proteopedia.org/wiki/index.php/Tau TAU] gene, a gene important in establishing microtubules in axons or transport processes.Underexpression of SRp20 results in dysfunction of the TAU gene, less microtubule functionality, and the brain deterioration characteristic of Alzheimers<ref name="Ebneth">PMID:9813097</ref><ref name="Corbo2012">PMID:22623141</ref>.
Along with cancer, SRp20 mutations have been linked to [https://en.wikipedia.org/wiki/Alzheimer%27s_disease Alzheimers], a neurodegenerative disorder. SRp20 is involved in AS of a wide array of RNAs, including that of the [https://en.wikipedia.org/wiki/Tropomyosin_receptor_kinase_B TRKB] gene which generates TrkB-Shc transcripts that are involved in generation of the disorder. SRp20 also promotes exclusion of exon 10 in the [http://proteopedia.org/wiki/index.php/Tau TAU] gene, a gene important in establishing microtubules in axons or transport processes. Underexpression of SRp20 results in dysfunction of the TAU gene, less microtubule functionality, and the brain deterioration characteristic of Alzheimers<ref name="Ebneth">PMID:9813097</ref><ref name="Corbo2012">PMID:22623141</ref>.


=== Other Genetic Disorders ===
=== Other Genetic Disorders ===
SRp20 and other SR proteins have been shown to prevent [https://en.wikipedia.org/wiki/R-loop R-loops] from forming, 3-stranded nucleic acid structures consisting of RNA and DNA. R-loops have been known to promote mutations, recombination, and chromosome rearrangement. One proposed mechanism for R-loop prevention by SRp20 is that SRp20, being a protein involved in RNA metabolism, is a binding partner of the [https://www.uniprot.org/uniprot/P11387 TOP1] protein. Underexpression of TOP1 also promotes R-loop formation. TOP1 has kinase activity that potentially phosphorylates the SR domain of SRp20, which contributes to its function. Underexpression of TOP1 would lead to loss-of-function of SRp20, which could lead to cancer and Alzheimers as mentioned above, as well as cause R-loops to form. R-loops have been associated with disorders such as [https://www.ndss.org/about-down-syndrome/down-syndrome/ Down Syndrome]<ref name="Naro">PMID:25926848</ref>.
SRp20 and other SR proteins have been shown to prevent [https://en.wikipedia.org/wiki/R-loop R-loops] from forming, triple-stranded nucleic acid structures consisting of RNA and DNA. R-loops have been known to promote mutations, recombination, and chromosome rearrangement. One proposed mechanism for R-loop prevention by SRp20 is that SRp20, being a protein involved in RNA metabolism, is a binding partner of the [https://www.uniprot.org/uniprot/P11387 TOP1] protein. Underexpression of TOP1 also promotes R-loop formation. TOP1 has kinase activity that potentially phosphorylates the SR domain of SRp20, which contributes to its function. Underexpression of TOP1 would lead to loss-of-function of SRp20, which could lead to cancer and Alzheimers as mentioned above, as well as cause R-loops to form. R-loops have been associated with disorders such as [https://www.ndss.org/about-down-syndrome/down-syndrome/ Down Syndrome]<ref name="Naro">PMID:25926848</ref>.


== References ==
== References ==
<references/>
<references/>

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Neel Bhagat