EPS-HOS-00066
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEPS-HOS-00066
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Asp-Based PTP/FCP164.68.8E-4891249159
StatusReviewed
Ensembl ProteinENSP00000392248
UniProt AccessionQ9GZU7; Q7Z5Q3; Q7Z5Q4;
Protein NameCarboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1
Protein Synonyms/Alias Nuclear LIM interactor-interacting factor 3; NLI-IF; NLI-interacting factor 3; Small C-terminal domain phosphatase 1; SCP1; Small CTD phosphatase 1;
Gene NameCTDSP1
Gene Synonyms/Alias CTDSP1; NIF3, NLIIF, SCP1;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSG00000144579ENSP00000273062ENST00000273062
ENSG00000144579ENSP00000411308ENST00000431127
ENSG00000144579ENSP00000392248ENST00000443891
ENSG00000144579ENSP00000404301ENST00000452977
ENSG00000144579ENSP00000403256ENST00000428361
OrganismHomo sapiens
Functional DescriptionPreferentially catalyzes the dephosphorylation of 'Ser-5' within the tandem 7 residues repeats in the C-terminal domain (CTD) of the largest RNA polymerase II subunit POLR2A. Negatively regulates RNA polymerase II transcription, possibly by controlling the transition from initiation/capping to processive transcript elongation. Recruited by REST to neuronal genes that contain RE-1 elements, leading to neuronal gene silencing in non-neuronal cells.
Protein Length260
Protein Sequence
(FASTA)
MVAAPWATQE QEEGRGIQPG DRGDQKSAAS QKPRSRGILH SLFCCVCRDD GEALPAHSGA 60
PLLVEENGAI PKTPVQYLLP EAKAQDSDKI CVVIDLDETL VHSSFKPVNN ADFIIPVEID 120
GVVHQVYVLK RPHVDEFLQR MGELFECVLF TASLAKYADP VADLLDKWGA FRARLFRESC 180
VFHRGNYVKD LSRLGRDLRR VLILDNSPAS YVFHPDNAVP VASWFDNMSD TELHDLLPFF 240
EQLSRVDDVY SVLRQPRPGS 260
Nucleotide Sequence
(FASTA)
ATGGTGGCCG CCCCGTGGGC TACCCAGGAG CAGGAGGAGG GCCGAGGGAT CCAGCCCGGG 60
GACCGGGGTG ACCAGAAGTC AGCAGCTTCC CAGAAGCCCC GAAGCCGGGG CATCCTCCAC 120
TCACTCTTCT GCTGTGTCTG CCGGGATGAT GGGGAGGCCC TGCCTGCTCA CAGCGGGGCG 180
CCCCTGCTTG TGGAGGAGAA TGGCGCCATC CCTAAGACCC CAGTCCAATA CCTGCTCCCT 240
GAGGCCAAGG CCCAGGACTC AGACAAGATC TGCGTGGTCA TCGACCTGGA CGAGACCCTG 300
GTGCACAGCT CCTTCAAGCC AGTGAACAAC GCGGACTTCA TCATCCCTGT GGAGATTGAT 360
GGGGTGGTCC ACCAGGTCTA CGTGTTGAAG CGTCCTCACG TGGATGAGTT CCTGCAGCGA 420
ATGGGCGAGC TCTTTGAATG TGTGCTGTTC ACTGCTAGCC TCGCCAAGTA CGCAGACCCA 480
GTAGCTGACC TGCTGGACAA ATGGGGGGCC TTCCGGGCCC GGCTGTTTCG AGAGTCCTGC 540
GTCTTCCACC GGGGGAACTA CGTGAAGGAC CTGAGCCGGT TGGGTCGAGA CCTGCGGCGG 600
GTGCTCATCC TGGACAATTC ACCTGCCTCC TATGTCTTCC ATCCAGACAA TGCTGTACCG 660
GTGGCCTCGT GGTTTGACAA CATGAGTGAC ACAGAGCTCC ACGACCTCCT CCCCTTCTTC 720
GAGCAACTCA GCCGTGTGGA CGACGTGTAC TCAGTGCTCA GGCAGCCACG GCCAGGGAGC 780
TAG 783
Domain Profile
S: 2     tLVLDLDetLvhsksekdee.........e.ektkasvkkRPgldeFLkelskayevvif  51
         ++V+DLDetLvhs++++ ++         +   ++++v kRP++deFL+++ + +e v+f
Q: 91    CVVIDLDETLVHSSFKPVNNadfiipveiDgVVHQVYVLKRPHVDEFLQRMGELFECVLF  150
         9*************98888346888776527899**************************
S: 52    tasskeyadavldkldpkkklfkkrlyresctlengkyvKdLsllgrdlskvvivDdspr  111
         tas  +yad+v d ld+    f++rl+resc++++g+yvKdLs lgrdl++v+i+D+sp 
Q: 151   TASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPA  209
         ****************7.778***************************************
S: 112   sfelqpdnlipiepfygdsdkdtellkllpflealeksddv  152
         s+ ++pdn++p+ +++++  +dtel++llpf+e+l+++ddv
Q: 210   SYVFHPDNAVPVASWFDN-MSDTELHDLLPFFEQLSRVDDV  249
         ******************.999**************99998
Domain Sequence
(FASTA)
CVVIDLDETL VHSSFKPVNN ADFIIPVEID GVVHQVYVLK RPHVDEFLQR MGELFECVLF 60
TASLAKYADP VADLLDKWGA FRARLFRESC VFHRGNYVKD LSRLGRDLRR VLILDNSPAS 120
YVFHPDNAVP VASWFDNMSD TELHDLLPFF EQLSRVDDV 159
Keyword3D-structure; Alternative splicing; Complete proteome; Hydrolase; Metal-binding; Nucleus; Polymorphism; Protein phosphatase; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Cavia porcellus"; ?>Ciona intestinalis"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Dipodomys ordii"; ?>Felis catus"; ?>Gadus morhua"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Macropus eugenii"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Nomascus leucogenys"; ?>Ochotona princeps"; ?>Oreochromis niloticus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pan troglodytes"; ?>Pongo abelii"; ?>Procavia capensis"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Tursiops truncatus"; ?>Xenopus tropicalis"; ?>Xiphophorus maculatus"; ?>
EPS-AIM-00082
EPS-ANC-00083
EPS-BOT-00067
EPS-CAE-00093
EPS-CAJ-00066
EPS-CAF-00085
EPS-CAP-00064
EPS-CII-00035
EPS-CIS-00025
EPS-DAR-00123
EPS-DIO-00027
EPS-FEC-00081
EPS-GAM-00066
EPS-GAA-00115
EPS-GOG-00083
EPS-ICT-00064
EPS-LAC-00085
EPS-LOA-00064
EPS-MAM-00070
EPS-MAE-00026
EPS-MOD-00068
EPS-MUM-00067
EPS-MUP-00059
EPS-NOL-00080
EPS-OCP-00034
EPS-ORN-00089
EPS-ORC-00064
EPS-ORL-00103
EPS-OTG-00066
EPS-PAT-00063
EPS-POA-00085
EPS-PRC-00029
EPS-RAN-00070
EPS-SAH-00065
EPS-TAR-00109
EPS-TEN-00084
EPS-TUT-00046
EPS-XET-00080
EPS-XIM-00123
Gene Ontology
GO:0005634; C:nucleus
GO:0008420; F:CTD phosphatase activity
GO:0046872; F:metal ion binding
GO:0050768; P:negative regulation of neurogenesis
GO:0045665; P:negative regulation of neuron differentiation
GO:0006470; P:protein dephosphorylation
GO:0006357; P:regulation of transcription from RNA polymerase II promoter
KEGG
hsa:58190;
InterPros
IPR011948; Dullard_phosphatase.
IPR023214; HAD-like_dom.
IPR004274; NIF.
Pfam
PF03031; NIF; 1.
SMARTs
SM00577; CPDc; 1.
Prosites
PS50969; FCP1; 1.
Prints
Created Date20-Feb-2013