EPS-ART-00096
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEPS-ART-00096
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
PPP/PP7265.87.5E-79631891261
StatusReviewed
Ensembl ProteinAT1G48120.1
UniProt AccessionQ9LNG5;
Protein NameSerine/threonine-protein phosphatase 7 long form homolog
Protein Synonyms/Alias
Gene NameAT1G48120
Gene Synonyms/Alias At1g48120; F21D18.16;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
AT1G48120AT1G48120.1AT1G48120.1
OrganismArabidopsis thaliana
Functional Description
Protein Length1340
Protein Sequence
(FASTA)
MEVQSLLNFD LDPGPVDQSI LVWQHEHRSA AIWEDEVPPR ELTCRHKLLG MRDWPLDPLV 60
CQKLIEFGLY GVYKVAFIQL DYALITALVE RWRPETHTFH LPAGEITVTL QDVNILLGLR 120
VDGPAVTGST KYNWADLCED LLGHRPGPKD LHGSHVSLAW LRENFRNLPA DPDEVTLKCH 180
TRAFVLALMS GFLYGDKSKH DVALTFLPLL RDFDEVAKLS WGSATLALLY RELCRASKRT 240
VSTICGPLVL LQLWAWERLH VGRPGRLKDV GASYMDGIDG PLPDPLGCRW RASLSHKENP 300
RGGLDFYRDQ FDQQKDEQVI WQPYTPDLLA KIPLICVSGE NIWRTVAPLI CFDVVEWHRP 360
DRVLRQFGLH QTIPAPCDNE KALHAIDKRG KSEYDWSARH SRHIGLWEAR VSSVVSGEPE 420
CSPMDYNDPY MEWYRRITRR IISPMNERRP GQFLPTGFAF QVLVQRVAAI HARSRASLEE 480
ELTVGSARQT LQDIVDMCAG ALQLNAPLGS LSNGSVAQAP TPEPFLMLPQ PTPTIIPQKP 540
MGGEMVCLPL NDMEIDDGLA AEPLELMPPV QDIGCEQSLS SVSQKPLFWP SGGKLTFSWV 600
CEVMLVFDWS SKNLPPCEFS SVLPFNVLDE LVLFASKILK KEPNCVRIDS EKAEVVVVGD 660
LHGQLHDLLY LMQDAGFPDG DRFYVFNGNY VDIGAWGLET FLLLLSWKVL LPARVYLLRG 720
SHESESCTSM YGFKNEVLTK YGDKGAAVYK KCLECFQLLP LASVIAGKVY TAHGGLFRDV 780
SSFLSDKQER NRKRKRTQKK QTDNTVLDTE DRSESLPLGS LKDLSKVKRR VIDPPTEGSN 840
LIPGDILWSD PSKDTGLFLN KERGIGLLWG PDCTAKFLQD NNLKWIIRGK GAPDERAKRD 900
DLAPMNGGYA EDHEGLITLF SAPDHPQFQD TEERHNNKAA YIILQIPECE ELKFQPLEAV 960
SPRPKAEAYY DFRRLIHPPS NLVHNITNSV DSPSSVPDDK DNLISSENVE YKSMDLSEQM 1020
EVDEKDDVDS KYSESITDEV AAFGTPASGD RDMVDFSDKT ENGSKEADHS ETAEISKDLS 1080
DTVGKPESCS RTRGTYEAIG TDAKLKSNTP EAINLEPQPG CDLYVPDSGN STESRTEKAA 1140
EEACVGRISI DDCSTTGDAA VELEITYDEK LDRVVTEITG NDAAECMTDG NRDIATDGAE 1200
NLEPSTSKLN YSEPSEDIDD STMKFRHNTS CVADSDLETV NGGVNADCSS SSKCLTSKPV 1260
VAHDKFTNLT KPSHDKGYGE SADKPERVIK LVTYSKRKSS DKKHMIESNE DPQQKVNDSV 1320
DSKNKGSLDK SQSVPGDMDS 1340
Nucleotide Sequence
(FASTA)
ATGGAGGTGC AAAGTCTATT AAACTTTGAT TTGGATCCTG GTCCAGTTGA TCAATCTATA 60
TTGGTGTGGC AACATGAGCA TAGATCAGCT GCTATATGGG AAGATGAGGT TCCTCCTCGT 120
GAACTGACAT GTCGGCACAA GTTATTGGGG ATGCGAGATT GGCCTCTGGA TCCTCTCGTG 180
TGTCAAAAGT TGATAGAGTT TGGTCTATAT GGAGTTTACA AGGTTGCCTT TATACAACTT 240
GATTATGCTC TGATAACAGC TTTGGTGGAG AGATGGAGAC CCGAAACGCA TACTTTTCAT 300
CTTCCTGCTG GAGAGATCAC TGTGACTTTA CAAGATGTGA ATATTTTGTT GGGTCTACGT 360
GTCGATGGAC CTGCAGTAAC TGGTAGTACA AAATACAACT GGGCTGATTT ATGTGAGGAT 420
TTGCTTGGTC ATAGGCCAGG TCCCAAGGAT CTTCATGGTT CTCATGTGTC CTTAGCGTGG 480
CTGCGTGAGA ATTTTCGGAA TTTACCTGCT GATCCTGATG AAGTGACGTT GAAGTGTCAC 540
ACTAGAGCTT TTGTATTAGC TTTGATGAGT GGTTTTCTTT ATGGAGATAA GTCTAAACAT 600
GATGTGGCTT TGACGTTTCT CCCTCTGCTA AGGGACTTTG ATGAAGTGGC TAAGCTTAGC 660
TGGGGTAGTG CTACTCTAGC ACTTCTCTAC AGAGAGTTGT GCCGGGCAAG TAAGAGGACT 720
GTTTCTACCA TATGTGGCCC TCTTGTTCTG CTGCAGTTAT GGGCATGGGA ACGACTGCAT 780
GTTGGTCGTC CAGGAAGACT TAAAGATGTT GGTGCTTCTT ATATGGATGG CATAGATGGT 840
CCTTTACCAG ATCCATTAGG CTGTAGGTGG AGGGCTTCTC TGAGTCATAA AGAGAATCCT 900
CGTGGAGGAC TGGACTTTTA CAGGGACCAG TTTGACCAGC AGAAAGATGA GCAGGTTATA 960
TGGCAGCCTT ACACTCCAGA TCTTCTAGCA AAGATTCCTC TGATATGTGT TAGTGGTGAG 1020
AACATATGGC GAACTGTTGC ACCACTGATT TGTTTTGATG TGGTTGAGTG GCATCGTCCT 1080
GATCGGGTAC TTAGACAGTT TGGTCTTCAC CAAACAATAC CAGCACCTTG TGATAATGAA 1140
AAAGCTCTTC ATGCTATTGA CAAAAGAGGC AAATCAGAAT ATGACTGGTC AGCACGTCAT 1200
AGTCGACATA TTGGATTGTG GGAGGCACGT GTGTCATCTG TTGTATCCGG TGAGCCAGAG 1260
TGTAGCCCAA TGGACTATAA TGATCCTTAT ATGGAGTGGT ACCGCAGAAT TACTAGAAGA 1320
ATTATTAGTC CTATGAACGA GAGGCGCCCC GGACAATTTC TACCCACTGG TTTTGCTTTC 1380
CAAGTGCTTG TTCAGAGGGT TGCAGCCATT CATGCTCGAT CTAGGGCTTC ACTTGAGGAG 1440
GAACTTACTG TTGGCTCCGC GAGGCAAACA TTACAAGATA TCGTTGATAT GTGTGCAGGC 1500
GCCCTTCAAC TCAATGCGCC TTTAGGCTCG TTATCTAATG GCTCCGTTGC TCAAGCTCCA 1560
ACTCCAGAAC CATTTTTGAT GTTACCTCAA CCTACACCTA CAATAATACC CCAAAAGCCG 1620
ATGGGTGGAG AAATGGTATG CTTACCATTG AATGACATGG AGATAGATGA TGGTCTTGCA 1680
GCAGAACCGT TGGAACTGAT GCCACCTGTT CAGGATATAG GATGTGAGCA GTCTTTATCA 1740
TCAGTTTCCC AGAAACCTCT TTTCTGGCCG TCGGGAGGGA AACTTACTTT CAGCTGGGTT 1800
TGTGAAGTAA TGTTGGTGTT TGATTGGTCG TCTAAGAATC TCCCACCTTG TGAATTTTCA 1860
AGTGTTCTGC CTTTTAATGT CTTGGATGAG CTGGTTCTCT TTGCTTCCAA AATCCTGAAA 1920
AAAGAGCCAA ACTGCGTTAG GATAGACAGT GAGAAAGCAG AGGTCGTTGT GGTTGGAGAT 1980
CTACATGGGC AGCTACATGA TTTGCTGTAC CTTATGCAGG ATGCTGGATT CCCGGATGGT 2040
GATCGGTTCT ATGTGTTCAA TGGGAATTAT GTAGACATAG GAGCATGGGG TTTAGAAACT 2100
TTCTTGCTTC TATTATCATG GAAGGTTCTT CTACCGGCAA GAGTGTATCT TCTACGTGGC 2160
AGTCACGAGA GTGAAAGTTG TACATCAATG TATGGATTTA AAAATGAAGT ATTAACCAAG 2220
TATGGAGATA AAGGAGCAGC TGTCTACAAG AAGTGCTTGG AGTGCTTCCA GTTACTCCCT 2280
TTGGCTTCTG TAATTGCTGG AAAGGTATAT ACAGCTCATG GAGGTCTTTT CCGTGATGTA 2340
TCAAGCTTTC TTTCAGACAA GCAAGAACGG AACAGAAAAC GCAAGCGAAC TCAGAAAAAG 2400
CAAACGGATA ATACTGTCCT TGACACTGAA GATAGATCTG AGTCCTTGCC TCTTGGCTCG 2460
TTGAAGGATT TATCCAAAGT GAAAAGACGT GTTATAGATC CTCCTACTGA AGGTTCAAAT 2520
TTGATTCCTG GTGACATTCT TTGGTCAGAT CCATCGAAAG ACACCGGCCT CTTTCTTAAC 2580
AAAGAGAGAG GTATCGGTTT GTTGTGGGGT CCTGATTGCA CCGCAAAGTT TCTACAAGAC 2640
AATAATTTAA AGTGGATCAT CAGAGGGAAG GGAGCCCCTG ACGAAAGGGC AAAACGAGAT 2700
GACCTTGCAC CAATGAATGG AGGATATGCA GAAGACCATG AAGGGCTTAT AACTTTGTTC 2760
AGTGCTCCTG ATCATCCTCA GTTTCAGGAT ACAGAGGAGA GACACAACAA CAAAGCAGCC 2820
TACATAATAT TACAAATCCC TGAATGTGAG GAGCTTAAAT TTCAACCGCT TGAAGCAGTA 2880
TCTCCGAGGC CAAAGGCTGA GGCGTACTAC GACTTTAGAC GCCTGATTCA TCCACCCAGT 2940
AATTTAGTAC ACAATATTAC GAATTCTGTT GATTCACCAT CCTCAGTGCC TGATGACAAA 3000
GATAATTTAA TATCTTCTGA AAATGTGGAA TATAAGTCCA TGGATCTCTC TGAACAGATG 3060
GAAGTTGATG AAAAAGATGA TGTTGATTCC AAGTACTCAG AATCTATAAC TGATGAAGTT 3120
GCAGCTTTTG GAACTCCAGC ATCTGGTGAT AGAGATATGG TTGATTTTTC AGATAAGACT 3180
GAAAATGGTT CTAAAGAAGC TGACCATTCC GAAACTGCAG AGATTAGTAA AGATTTATCT 3240
GATACAGTTG GAAAACCGGA GTCTTGCAGC AGAACAAGAG GTACATATGA AGCTATCGGA 3300
ACTGATGCAA AGCTAAAATC TAACACTCCT GAAGCTATTA ATCTGGAACC GCAGCCAGGT 3360
TGTGATCTGT ATGTGCCTGA TTCTGGAAAT TCTACAGAAT CTAGGACCGA AAAAGCTGCA 3420
GAAGAGGCTT GTGTTGGCAG AATAAGCATA GATGACTGTA GCACCACTGG TGATGCTGCA 3480
GTGGAATTAG AAATTACATA TGATGAGAAA CTGGATAGAG TGGTAACTGA AATCACAGGG 3540
AACGATGCAG CTGAATGTAT GACAGACGGT AATAGAGACA TAGCCACCGA TGGTGCTGAG 3600
AACTTGGAAC CTTCGACTAG CAAACTTAAT TATTCAGAGC CCAGTGAAGA TATTGATGAT 3660
TCTACCATGA AGTTCAGACA TAATACTAGC TGCGTTGCTG ATTCTGATCT TGAGACTGTG 3720
AATGGTGGGG TTAATGCAGA CTGTAGCAGT TCATCGAAAT GTCTAACCAG CAAGCCAGTG 3780
GTGGCTCATG ACAAGTTTAC AAATCTTACA AAACCTTCTC ATGATAAAGG TTATGGAGAG 3840
TCTGCAGATA AGCCAGAGAG AGTAATTAAA CTGGTTACAT ATTCTAAGCG TAAGTCATCT 3900
GACAAAAAAC ACATGATTGA ATCTAATGAA GATCCCCAAC AGAAGGTCAA TGATTCTGTC 3960
GATTCAAAGA ACAAAGGTTC TCTTGACAAG TCACAATCAG TTCCAGGAGA TATGGACTCA 4020
TGA 4023
Domain Profile
S: 22    vlfeakkvlkqlpnishvstavskevtvvGdlhGklddlllvlkknGlPsernpyvfnGd  81
         ++  a+k+lk+ pn  ++++    ev+vvGdlhG+l+dll +++++G+P  ++ yvfnG+
Q: 631   LVLFASKILKKEPNCVRIDSE-KAEVVVVGDLHGQLHDLLYLMQDAGFPDGDRFYVFNGN  689
         5556899***********865.569***********************************
S: 82    fvdrGkrglevllvllslflvfPnavylnrGnhedkvmnaryGfekevlskykrkgkril  141
         +vd G++gle++l+lls+++++P +vyl+rG he++ ++++yGf++evl+ky++kg  ++
Q: 690   YVDIGAWGLETFLLLLSWKVLLPARVYLLRGSHESESCTSMYGFKNEVLTKYGDKGAAVY  749
         ************************************************************
S: 142   rileevyewlPlgsivdsrvlvvhGGlsests..ldllksverkkrksvllpPeessrd.  198
         +++ e+++ lPl+s++ ++v+++hGGl++  s  l   ++++rk++++   +  ++  d 
Q: 750   KKCLECFQLLPLASVIAGKVYTAHGGLFRDVSsfLSDKQERNRKRKRTQKKQTDNTVLDt  809
         ****************************96551155555556666665555555555558
S: 199   ........lgrldtdgepresltetewe..qii..dilwsdPratnGlvPntlrGaGvyf  246
                 lg+l++  + ++++ +++ e  ++i  dilwsdP+  +Gl+ n+ rG+G+++
Q: 810   edrseslpLGSLKDLSKVKRRVIDPPTEgsNLIpgDILWSDPSKDTGLFLNKERGIGLLW  869
         88889999****************9998889*****************************
S: 247   GPdvtedflkkyelklvirshe  268
         GPd+t +fl+  +lk +ir + 
Q: 870   GPDCTAKFLQDNNLKWIIRGKG  891
         *******************875
Domain Sequence
(FASTA)
LVLFASKILK KEPNCVRIDS EKAEVVVVGD LHGQLHDLLY LMQDAGFPDG DRFYVFNGNY 60
VDIGAWGLET FLLLLSWKVL LPARVYLLRG SHESESCTSM YGFKNEVLTK YGDKGAAVYK 120
KCLECFQLLP LASVIAGKVY TAHGGLFRDV SSFLSDKQER NRKRKRTQKK QTDNTVLDTE 180
DRSESLPLGS LKDLSKVKRR VIDPPTEGSN LIPGDILWSD PSKDTGLFLN KERGIGLLWG 240
PDCTAKFLQD NNLKWIIRGK G 261
KeywordComplete proteome; Hydrolase; Iron; Manganese; Metal-binding; Protein phosphatase; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Brassica rapa"; ?>
EPS-BRR-00148
Gene Ontology
GO:0046872; F:metal ion binding
GO:0004721; F:phosphoprotein phosphatase activity
KEGG
ath:AT1G48120;
InterPros
IPR019557; AminoTfrase-like_pln_mobile.
IPR004843; Metallo_PEstase_dom.
IPR006186; Ser/Thr-sp_prot-phosphatase.
Pfam
PF00149; Metallophos; 1.
PF10536; PMD; 1.
SMARTs
SM00156; PP2Ac; 1.
Prosites
PS00125; SER_THR_PHOSPHATASE; FALSE_NEG.
Prints
PR00114; STPHPHTASE.
Created Date20-Feb-2013