Tag | Content |
---|
EKPD ID | EPS-HOS-00066 |
Classification | Group/Family | Score | E-Value | Start | End | Domain Length |
---|
Asp-Based PTP/FCP | 164.6 | 8.8E-48 | 91 | 249 | 159 |
|
Status | Reviewed |
Ensembl Protein | ENSP00000392248 |
UniProt Accession | Q9GZU7; Q7Z5Q3; Q7Z5Q4; |
Protein Name | Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 |
Protein Synonyms/Alias | Nuclear LIM interactor-interacting factor 3; NLI-IF; NLI-interacting factor 3; Small C-terminal domain phosphatase 1; SCP1; Small CTD phosphatase 1; |
Gene Name | CTDSP1 |
Gene Synonyms/Alias | CTDSP1; NIF3, NLIIF, SCP1; |
Ensembl Information | |
Organism | Homo sapiens |
Functional Description | Preferentially catalyzes the dephosphorylation of 'Ser-5' within the tandem 7 residues repeats in the C-terminal domain (CTD) of the largest RNA polymerase II subunit POLR2A. Negatively regulates RNA polymerase II transcription, possibly by controlling the transition from initiation/capping to processive transcript elongation. Recruited by REST to neuronal genes that contain RE-1 elements, leading to neuronal gene silencing in non-neuronal cells. |
Protein Length | 260 |
Protein Sequence (FASTA) | MVAAPWATQE QEEGRGIQPG DRGDQKSAAS QKPRSRGILH SLFCCVCRDD GEALPAHSGA 60 | PLLVEENGAI PKTPVQYLLP EAKAQDSDKI CVVIDLDETL VHSSFKPVNN ADFIIPVEID 120 | GVVHQVYVLK RPHVDEFLQR MGELFECVLF TASLAKYADP VADLLDKWGA FRARLFRESC 180 | VFHRGNYVKD LSRLGRDLRR VLILDNSPAS YVFHPDNAVP VASWFDNMSD TELHDLLPFF 240 | EQLSRVDDVY SVLRQPRPGS 260 |
|
Nucleotide Sequence (FASTA) | ATGGTGGCCG CCCCGTGGGC TACCCAGGAG CAGGAGGAGG GCCGAGGGAT CCAGCCCGGG 60 | GACCGGGGTG ACCAGAAGTC AGCAGCTTCC CAGAAGCCCC GAAGCCGGGG CATCCTCCAC 120 | TCACTCTTCT GCTGTGTCTG CCGGGATGAT GGGGAGGCCC TGCCTGCTCA CAGCGGGGCG 180 | CCCCTGCTTG TGGAGGAGAA TGGCGCCATC CCTAAGACCC CAGTCCAATA CCTGCTCCCT 240 | GAGGCCAAGG CCCAGGACTC AGACAAGATC TGCGTGGTCA TCGACCTGGA CGAGACCCTG 300 | GTGCACAGCT CCTTCAAGCC AGTGAACAAC GCGGACTTCA TCATCCCTGT GGAGATTGAT 360 | GGGGTGGTCC ACCAGGTCTA CGTGTTGAAG CGTCCTCACG TGGATGAGTT CCTGCAGCGA 420 | ATGGGCGAGC TCTTTGAATG TGTGCTGTTC ACTGCTAGCC TCGCCAAGTA CGCAGACCCA 480 | GTAGCTGACC TGCTGGACAA ATGGGGGGCC TTCCGGGCCC GGCTGTTTCG AGAGTCCTGC 540 | GTCTTCCACC GGGGGAACTA CGTGAAGGAC CTGAGCCGGT TGGGTCGAGA CCTGCGGCGG 600 | GTGCTCATCC TGGACAATTC ACCTGCCTCC TATGTCTTCC ATCCAGACAA TGCTGTACCG 660 | GTGGCCTCGT GGTTTGACAA CATGAGTGAC ACAGAGCTCC ACGACCTCCT CCCCTTCTTC 720 | GAGCAACTCA GCCGTGTGGA CGACGTGTAC TCAGTGCTCA GGCAGCCACG GCCAGGGAGC 780 | TAG 783 |
|
Domain Profile | S: 2 tLVLDLDetLvhsksekdee.........e.ektkasvkkRPgldeFLkelskayevvif 51 | ++V+DLDetLvhs++++ ++ + ++++v kRP++deFL+++ + +e v+f | Q: 91 CVVIDLDETLVHSSFKPVNNadfiipveiDgVVHQVYVLKRPHVDEFLQRMGELFECVLF 150 | 9*************98888346888776527899************************** |
| S: 52 tasskeyadavldkldpkkklfkkrlyresctlengkyvKdLsllgrdlskvvivDdspr 111 | tas +yad+v d ld+ f++rl+resc++++g+yvKdLs lgrdl++v+i+D+sp | Q: 151 TASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPA 209 | ****************7.778*************************************** |
| S: 112 sfelqpdnlipiepfygdsdkdtellkllpflealeksddv 152 | s+ ++pdn++p+ +++++ +dtel++llpf+e+l+++ddv | Q: 210 SYVFHPDNAVPVASWFDN-MSDTELHDLLPFFEQLSRVDDV 249 | ******************.999**************99998 |
|
Domain Sequence (FASTA) | CVVIDLDETL VHSSFKPVNN ADFIIPVEID GVVHQVYVLK RPHVDEFLQR MGELFECVLF 60 | TASLAKYADP VADLLDKWGA FRARLFRESC VFHRGNYVKD LSRLGRDLRR VLILDNSPAS 120 | YVFHPDNAVP VASWFDNMSD TELHDLLPFF EQLSRVDDV 159 |
|
Keyword | 3D-structure; Alternative splicing; Complete proteome; Hydrolase; Metal-binding; Nucleus; Polymorphism; Protein phosphatase; Reference proteome. |
Sequence Source | Ensembl |
Orthology | |
Gene Ontology | |
KEGG | |
InterPros | |
Pfam | |
SMARTs | |
Prosites | |
Prints | |
Created Date | 20-Feb-2013 |