EKS-PHP-00125
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-PHP-00125
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Other/Haspin174.26.6E-51493761269
StatusUnreviewed
Ensembl ProteinPP1S218_52V6.1
UniProt AccessionA9TF75;
Protein Name
Protein Synonyms/Alias Predicted protein;
Gene NameA9TF75_PHYPA
Gene Synonyms/Alias PHYPADRAFT_144655;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
PP1S218_52V6PP1S218_52V6.1PP1S218_52V6.1
OrganismPhyscomitrella patens
Functional Description
Protein Length804
Protein Sequence
(FASTA)
MAEHAEAAES SFFHDIFSME PKKVTAVYKR RPKKAEPQHP DRSSIGLGTT TGKESGIFVF 60
RDEGAKSRDI LGRKIKGRKD SWLHRSLSFR GRRSSLMPTH RQPLGSLASN IYGNPSNILL 120
KSNPGNLASK TRRTWAGVPS SYVKEQRAIF ADVDAFDLDE KEAGFSPTPK KQPPVRDSLD 180
FAKSQMRAYS FDEAFRHPTL SDSELLSAAL RTPGRPFGKA DSRKYDVSRP LSSFTEQSIP 240
AQNSFWLHTD SGINDGQDED ISSPQTLRDK SIQLRSSRSS ADFVSLKENS LRLSDSSWIS 300
NDEENQLALP SPEVSWDQQM RRRSSKSSVD SISQGRNSRS ISSIDQEIPT PVGTRKNSSL 360
DMETASLLNR LEALHIREEE SHSMFSKSFR TLVPEEAEEE REGIGHCIFR KMSHCTPVPE 420
EDEDENEEQL DRPDGSQTGD GLNASLEEDI AEKAVEQLEA DTFLSVFEVL VQECRQTEIL 480
TLGEALSGFC DLRQIKKLGE GTFGEAFKGG SSVFKIVPMD GKFQVNGEAQ KTSAEMLSEV 540
VLSNALNELR GGPMRNEPNI CSTFVETKAT RICQGCYDPE LVRAWEEWDS LNTSENDHPS 600
IFPNQQLYVV FFLTDGGRDL ESFSLENFNE ARSLLLQIVL ALAVAEEACE FEHRDLHWGN 660
IVLSRDQREH VVFRLLGQEK QVKTYGLSVS LIDFTLSRIN TGNQVLFCNL AADPALFEGP 720
KNDVQANTYR RMKKVTGGQW EQRFLQTNCL WIHYVADILL TKKTFSSSPA EKRSLRAFRK 780
RVMLYESSGA AVLDEFFNGM WADV 804
Nucleotide Sequence
(FASTA)
ATGGCAGAGC ATGCTGAGGC TGCGGAATCT TCCTTTTTTC ATGATATCTT CAGCATGGAA 60
CCAAAGAAAG TTACTGCTGT TTACAAACGC CGTCCGAAAA AGGCCGAACC CCAACATCCT 120
GATCGCTCGA GTATCGGTTT GGGAACCACT ACCGGCAAAG AAAGTGGAAT ATTTGTTTTC 180
AGAGATGAAG GGGCCAAAAG CAGGGATATC CTAGGAAGGA AAATTAAAGG AAGGAAAGAT 240
AGTTGGCTCC ATCGATCCCT CTCATTCAGA GGAAGACGAA GTAGCCTAAT GCCTACTCAC 300
CGGCAGCCTT TGGGCTCTCT TGCGAGCAAT ATCTACGGCA ATCCTTCAAA TATACTTCTC 360
AAGAGCAACC CAGGCAATTT GGCGTCGAAA ACCAGGAGGA CTTGGGCTGG AGTTCCATCT 420
TCATACGTGA AAGAACAGCG AGCGATTTTT GCAGATGTTG ATGCTTTTGA CTTAGATGAA 480
AAAGAGGCTG GTTTCAGTCC GACTCCCAAG AAGCAGCCAC CAGTGCGGGA TAGTTTAGAT 540
TTCGCAAAAT CCCAAATGCG AGCTTACAGC TTTGATGAAG CATTTCGTCA TCCAACTCTC 600
TCTGACAGTG AACTATTAAG TGCTGCTCTA CGGACTCCAG GCCGACCTTT TGGTAAAGCG 660
GATTCCAGAA AGTACGATGT CTCTCGTCCT CTATCTTCGT TTACTGAACA AAGTATTCCA 720
GCACAAAATT CATTCTGGTT ACATACTGAT TCTGGCATCA ACGACGGTCA AGATGAAGAT 780
ATTTCTTCAC CACAAACACT CAGAGATAAA TCCATTCAAC TACGCAGCAG TAGGAGTTCT 840
GCAGACTTTG TCAGCCTTAA AGAGAACTCT CTCCGTTTAT CGGATAGTTC TTGGATCAGC 900
AACGACGAAG AAAATCAACT GGCTCTGCCA TCTCCAGAAG TGTCATGGGA CCAACAGATG 960
AGACGACGAA GCAGCAAATC TTCTGTGGAT TCTATTAGCC AAGGACGAAA CAGCAGAAGC 1020
ATAAGCAGCA TTGATCAGGA GATCCCAACT CCTGTGGGAA CTCGGAAGAA TTCCAGCCTT 1080
GATATGGAAA CTGCGTCCCT CCTCAATCGA TTGGAGGCCT TGCATATTAG GGAAGAGGAA 1140
AGTCACAGCA TGTTCAGCAA GAGCTTTCGC ACTCTAGTAC CCGAGGAGGC TGAGGAGGAG 1200
AGGGAAGGAA TAGGTCACTG TATATTCCGT AAGATGAGTC ATTGTACTCC AGTACCTGAG 1260
GAGGATGAAG ACGAAAATGA AGAGCAGTTG GACAGACCTG ATGGCTCACA GACTGGAGAT 1320
GGGCTTAATG CATCATTGGA AGAAGACATT GCGGAAAAGG CTGTAGAACA GTTGGAAGCA 1380
GATACCTTTC TATCAGTGTT TGAGGTTCTC GTACAAGAAT GTCGTCAAAC GGAGATCCTG 1440
ACATTGGGTG AAGCACTTTC GGGTTTCTGC GATTTACGGC AAATTAAGAA ATTAGGAGAG 1500
GGAACATTTG GAGAAGCCTT TAAAGGGGGT AGTAGTGTTT TTAAGATAGT GCCAATGGAT 1560
GGGAAATTTC AAGTTAATGG TGAGGCACAA AAGACTTCTG CGGAAATGCT GAGCGAGGTG 1620
GTTTTGTCCA ACGCTTTAAA CGAATTGCGG GGTGGGCCAA TGAGGAATGA ACCCAATATT 1680
TGCAGCACAT TCGTTGAGAC CAAAGCCACG CGAATCTGCC AGGGCTGCTA TGACCCTGAG 1740
CTCGTCAGGG CTTGGGAGGA ATGGGATTCC CTCAATACGT CTGAAAATGA TCATCCCTCT 1800
ATCTTCCCCA ATCAGCAGTT GTATGTGGTA TTTTTCCTGA CAGATGGAGG CAGGGATCTC 1860
GAGAGTTTTT CTTTGGAGAA TTTCAATGAA GCACGGAGTC TCTTACTCCA GATTGTATTA 1920
GCACTCGCAG TAGCAGAGGA GGCTTGCGAA TTTGAGCACC GGGATCTTCA CTGGGGCAAC 1980
ATAGTGCTGT CAAGGGATCA ACGTGAACAT GTAGTCTTTC GATTGCTTGG CCAAGAGAAG 2040
CAAGTGAAAA CATATGGCCT GTCTGTGTCC CTCATCGACT TCACGCTCTC CCGAATCAAT 2100
ACAGGAAATC AAGTTCTGTT CTGTAACTTA GCTGCAGATC CGGCTTTATT TGAGGGCCCC 2160
AAGAACGATG TGCAGGCTAA TACATATCGA AGAATGAAGA AGGTAACTGG CGGCCAATGG 2220
GAGCAGCGGT TTCTTCAGAC GAATTGTCTC TGGATTCACT ATGTAGCGGA TATTCTTCTT 2280
ACCAAAAAGA CTTTCAGCTC TTCCCCAGCG GAGAAGCGGT CATTGCGTGC TTTCAGAAAG 2340
AGGGTGATGC TATATGAGTC ATCAGGAGCT GCTGTTTTGG ATGAATTCTT TAATGGCATG 2400
TGGGCTGATG TCTAA 2415
Domain Profile
S: 2     ekvkklGeGayGevfsvtwkgkevvlKiiplegsdv........keealkEliilkklsk  53
         ++ kklGeG +Ge+f+    g + v Ki+p+ g+ +        + e+l+E++ ++ l++
Q: 493   RQIKKLGEGTFGEAFK----GGSSVFKIVPMDGKFQvngeaqktSAEMLSEVVLSNALNE  548
         5789**********95....9999*********9999999999999**************
S: 54    lske......nstpnFlellgakvvkgdvpkellkawdkydkekes..........dqly  97
         l+        n +  F+e   + + +g +++el++aw++ d+ + s          +qly
Q: 549   LRGGpmrnepNICSTFVETKATRICQGCYDPELVRAWEEWDSLNTSendhpsifpnQQLY  608
         *98888889999************************************************
S: 98    lvllleeggtsldniklksesqllsilqqlvlslaiaekelkfeHrDlhlgNvLikktkk  157
         +v++l +gg++l++++l++ +++ s l q+vl+la+ae++ +feHrDlh+gN+ +++ + 
Q: 609   VVFFLTDGGRDLESFSLENFNEARSLLLQIVLALAVAEEACEFEHRDLHWGNIVLSRDQR  668
         ************************************************************
S: 158   keleYtldgkkiklkskgvlvtiIDftksrlvkkdkevvyedlevdeelfegkg.dkqfd  216
         +++ ++l g++ ++k++g+ v++IDft+sr ++ +++v++++l  d +lfeg + d q +
Q: 669   EHVVFRLLGQEKQVKTYGLSVSLIDFTLSR-INTGNQVLFCNLAADPALFEGPKnDVQAN  727
         ******************************.9********************76488***
S: 217   vyrlmranvknewkefepktnllwltylsskllk  250
         +yr m++    +w++   +tn lw++y+++ ll+
Q: 728   TYRRMKKVTGGQWEQRFLQTNCLWIHYVADILLT  761
         *****************************99987
Domain Sequence
(FASTA)
RQIKKLGEGT FGEAFKGGSS VFKIVPMDGK FQVNGEAQKT SAEMLSEVVL SNALNELRGG 60
PMRNEPNICS TFVETKATRI CQGCYDPELV RAWEEWDSLN TSENDHPSIF PNQQLYVVFF 120
LTDGGRDLES FSLENFNEAR SLLLQIVLAL AVAEEACEFE HRDLHWGNIV LSRDQREHVV 180
FRLLGQEKQV KTYGLSVSLI DFTLSRINTG NQVLFCNLAA DPALFEGPKN DVQANTYRRM 240
KKVTGGQWEQ RFLQTNCLWI HYVADILLT 269
KeywordComplete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Arabidopsis lyrata"; ?>Arabidopsis thaliana"; ?>Brachypodium distachyon"; ?>Chlamydomonas reinhardtii"; ?>Glycine max"; ?>Hordeum vulgare"; ?>Musa acuminata"; ?>Oryza brachyantha"; ?>Oryza glaberrima"; ?>Oryza indica"; ?>Oryza sativa"; ?>Populus trichocarpa"; ?>Selaginella moellendorffii"; ?>Setaria italica"; ?>Solanum lycopersicum"; ?>Solanum tuberosum"; ?>Vitis vinifera"; ?>Zea mays"; ?>
EKS-ARL-00123
EKS-ART-00100
EKS-BRD-00127
EKS-CHR-00047
EKS-GLM-00001
EKS-HOV-00106
EKS-MUA-00217
EKS-ORB-00121
EKS-ORG-00113
EKS-ORI-00198
EKS-ORS-00154
EKS-POT-00215
EKS-SEM-00155
EKS-SEI-00134
EKS-SOL-00126
EKS-SOT-00085
EKS-VIV-00145
EKS-ZEM-00193
Gene Ontology
GO:0005524; F:ATP binding
GO:0004672; F:protein kinase activity
KEGG
ppp:PHYPADRAFT_144655;
InterPros
IPR024604; DUF3635.
IPR011009; Kinase-like_dom.
IPR000719; Prot_kinase_cat_dom.
Pfam
PF12330; DUF3635; 1.
SMARTs
Prosites
PS50011; PROTEIN_KINASE_DOM; 1.
Prints
Created Date20-Feb-2013