EPS-GLM-00031
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEPS-GLM-00031
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
DSP/aDSP40.58.1E-1013420875
StatusUnreviewed
Ensembl ProteinGLYMA19G33690.1
UniProt AccessionI1N9F9;
Protein Name
Protein Synonyms/Alias
Gene Name
Gene Synonyms/Alias
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
GLYMA19G33690GLYMA19G33690.1GLYMA19G33690.1
OrganismGlycine max
Functional Description
Protein Length634
Protein Sequence
(FASTA)
MSYHMHNNEE THRGLKRKPP HMDRGQERRP YNTNSRPYDM NAVPDGWLDC PPHGQEIGCI 60
IPSKVPLGDS FNDYISSQKY TPKQAIHQQR VLGRELGLVI DLTNTTRYYR VSDWTKEGIG 120
HVKIRCTGRD AVPDDESVKK FCDKVLDFCS QRTNTKKYIL VHCTHGHNRT GYMIVHFLVR 180
TESISVTEAI NKFACARHPG IYKQDYIDAL YMFYHEKKPE DLVCPQTPEW KRISDPDFRG 240
TAVPAVDNCA HIPEQGTIVR NEVLTSDDAL GDPIPPNQLR SMQELCYQLL KLGTGGRGCW 300
QFPGSHPVSL NRDNLQLLRQ RYYYATWKAD GTRYMMLITC DGCYLIDRKF LFQRINMRFP 360
CRYTNGGTPE RNHHYTLLDG EMIIDTDPHT HKQERRYLIY DLIAINQVSL TELPFYERWK 420
LLEKEVIEPR NMEREGLSKS INPYYRYDLE PFSVRRKGFW LLSTVSKLLH KFIPQLSHSS 480
DGLVFQGWDD PYVPRTHEGL LKWKYPEMNS VDFLCEVGAG DRPLLFLFER GRKKLMEENV 540
IFKDASDISS YSGKIIECYW DSAEHHWVCM RIRIDKATPN DINTYRKVMR SIKDNITEDV 600
LLNEINQIIR LPLYADRIQR DIKAHQHMIS LRRK 634
Nucleotide Sequence
(FASTA)
ATGAGCTATC ACATGCATAA TAATGAAGAA ACACACAGAG GTTTGAAAAG AAAGCCCCCA 60
CATATGGATC GAGGGCAAGA ACGGAGACCA TACAATACGA ATTCTAGACC TTATGATATG 120
AACGCGGTTC CTGATGGTTG GTTAGACTGC CCACCACATG GGCAAGAAAT AGGATGTATA 180
ATCCCTTCAA AGGTTCCACT TGGTGATTCC TTCAATGATT ACATTTCAAG TCAAAAATAT 240
ACTCCCAAGC AAGCAATTCA TCAACAAAGA GTCTTAGGTA GAGAACTTGG TTTAGTGATT 300
GATCTGACAA ATACTACTCG CTACTATCGT GTATCAGATT GGACAAAAGA AGGGATTGGT 360
CATGTCAAGA TAAGATGCAC AGGAAGAGAT GCTGTACCTG ATGATGAATC CGTAAAAAAA 420
TTTTGTGATA AGGTTCTGGA TTTTTGCTCC CAAAGAACAA ACACAAAGAA ATATATACTT 480
GTACATTGCA CACATGGGCA TAATCGTACA GGTTATATGA TTGTTCACTT TCTTGTGCGC 540
ACGGAATCTA TTTCTGTTAC TGAGGCGATA AATAAATTTG CTTGCGCACG CCACCCAGGA 600
ATTTACAAGC AGGACTACAT TGATGCCCTG TACATGTTTT ATCATGAAAA GAAGCCTGAA 660
GATCTTGTTT GTCCTCAAAC TCCTGAATGG AAGAGAATTT CTGATCCCGA TTTTCGTGGC 720
ACAGCTGTTC CAGCTGTGGA CAATTGTGCA CACATTCCTG AACAGGGGAC TATTGTGAGG 780
AATGAAGTAT TAACAAGTGA TGATGCTTTG GGAGATCCAA TTCCTCCAAA CCAGCTGCGT 840
TCAATGCAAG AACTTTGTTA TCAACTGCTC AAGTTGGGCA CAGGGGGAAG AGGATGCTGG 900
CAGTTTCCTG GATCACACCC TGTTTCTCTG AACAGGGACA ATCTGCAACT CTTAAGACAA 960
CGATATTACT ATGCCACGTG GAAAGCTGAT GGGACACGAT ATATGATGCT GATTACTTGC 1020
GATGGATGTT ATCTGATTGA CAGAAAATTT CTTTTCCAAA GGATCAATAT GCGATTTCCT 1080
TGCAGATATA CTAATGGGGG TACACCTGAG AGGAATCACC ACTACACATT ACTTGATGGG 1140
GAGATGATTA TTGACACAGA TCCACATACA CATAAGCAAG AGAGAAGATA CCTCATTTAT 1200
GATTTGATAG CTATTAATCA AGTCTCTTTG ACAGAGCTGC CATTCTATGA AAGGTGGAAA 1260
TTGCTGGAGA AAGAAGTAAT TGAACCTCGT AATATGGAAC GTGAGGGCCT CTCAAAGAGT 1320
ATAAATCCTT ATTATAGATA TGACTTGGAA CCATTTAGTG TGAGGAGGAA GGGTTTTTGG 1380
TTACTATCTA CCGTGTCAAA GCTCCTTCAT AAATTCATTC CACAACTATC GCATTCATCA 1440
GATGGTCTTG TATTCCAGGG TTGGGATGAT CCATATGTCC CTCGCACACA TGAAGGTCTT 1500
CTGAAGTGGA AATACCCTGA GATGAACTCT GTTGATTTCC TCTGTGAGGT GGGAGCTGGT 1560
GATCGTCCCT TGCTTTTTCT ATTTGAACGA GGAAGGAAGA AGTTAATGGA AGAGAATGTT 1620
ATTTTTAAAG ATGCATCGGA CATTTCTTCC TATTCAGGGA AGATCATTGA ATGCTACTGG 1680
GATTCTGCGG AGCATCATTG GGTCTGCATG CGGATCAGAA TAGACAAAGC CACTCCAAAT 1740
GATATCAATA CCTATAGAAA GGTAATGCGA AGCATAAAGG ATAACATCAC AGAAGATGTA 1800
TTATTGAATG AGATCAATCA GATCATTCGC CTACCCTTGT ATGCAGACAG AATACAGAGG 1860
GATATCAAGG CTCACCAGCA TATGATCTCT TTGCGGCGCA AGTGA 1905
Domain Profile
S: 64    dvtdllqyfdeaaefideakekggkvlvHCaaGvsRSatlvlaYLmkveglslsdAieav  123
         d++ + ++ d++ +f  + ++ ++ +lvHC++G +R++++++ +L+++e +s+++Ai+++
Q: 134   DDESVKKFCDKVLDFCSQRTNTKKYILVHCTHGHNRTGYMIVHFLVRTESISVTEAINKF  193
         567899****************************************************99
S: 124   krkrpsiepnegfle  138
            r   + +++++ 
Q: 194   ACARHPGIYKQDYID  208
         987754445555555
Domain Sequence
(FASTA)
DDESVKKFCD KVLDFCSQRT NTKKYILVHC THGHNRTGYM IVHFLVRTES ISVTEAINKF 60
ACARHPGIYK QDYID 75
KeywordComplete proteome; Hydrolase; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Gene Ontology
GO:0005634; C:nucleus
GO:0004484; F:mRNA guanylyltransferase activity
GO:0004651; F:polynucleotide 5'-phosphatase activity
GO:0004725; F:protein tyrosine phosphatase activity
GO:0008138; F:protein tyrosine/serine/threonine phosphatase activity
GO:0006370; P:7-methylguanosine mRNA capping
GO:0035335; P:peptidyl-tyrosine dephosphorylation
KEGG
gmx:100778292;
InterPros
IPR000340; Dual-sp_phosphatase_cat-dom.
IPR017074; mRNA_cap_enz_bifunc.
IPR001339; mRNA_cap_enzyme.
IPR013846; mRNA_cap_enzyme_C.
IPR012340; NA-bd_OB-fold.
IPR016027; NA-bd_OB-fold-like.
IPR000387; Tyr/Dual-specificity_Pase.
IPR016130; Tyr_Pase_AS.
Pfam
PF00782; DSPc; 1.
PF03919; mRNA_cap_C; 1.
PF01331; mRNA_cap_enzyme; 1.
SMARTs
Prosites
PS00383; TYR_PHOSPHATASE_1; 1.
PS50056; TYR_PHOSPHATASE_2; 1.
Prints
Created Date20-Feb-2013