EKS-GLM-01171
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-GLM-01171
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TKL/IRAK202.92.5E-599281200273
StatusUnreviewed
Ensembl ProteinGLYMA04G12860.2
UniProt AccessionK7KJI2;
Protein Name
Protein Synonyms/Alias
Gene Name
Gene Synonyms/Alias
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
GLYMA04G12860GLYMA04G12860.2GLYMA04G12860.2
OrganismGlycine max
Functional Description
Protein Length1227
Protein Sequence
(FASTA)
MKHESEKPQK GEKMKREKPY LMKKMPWSPR REAPIVVRFV VIALFIMITV PTPPTAADAE 60
AEAATTTSDA VLLIQFKHLH VSSDPYSFLS DWDPHAPSPC AWRGITCSSS GGVSAIDLSG 120
AALSGTLHLP TLTSLSSLQN LILRGNSFSS FNLTVSPICT LETLDLSHNN FSGKFPFANL 180
APCIRLSYLN LSNNLITAGP GPWPELAQLD LSRNRVSDVD LLVSALGSST LVFLNFSDNK 240
LAGQLSETLV SKSLNLSTLD LSYNLFSGKV PPRLLNDAVQ VLDFSFNNFS EFDFGFGSCE 300
NLVRLSFSHN AISSNEFPRG LGNCNNLEVL DLSHNELMME IPSEILLNLK SLKSLFLAHN 360
KFSGEIPSEL GSLCKTLVEL DLSENNLSGS LPLSFTQCSS LQSLNLARNY FSGNFLVSVV 420
NKLRSLKYLN AAFNNITGPV PVSLVSLKEL RVLDLSSNRF SGNVPSSLCP SGLENLILAG 480
NYLSGTVPSQ LGECRNLKTI DFSFNSLNGS IPWKVWALPN LTDLIMWANK LTGEIPEGIC 540
VKGGNLETLI LNNNLISGSI PKSIANCTNM IWVSLASNRL TGEITAGIGN LNALAILQLG 600
NNSLSGRIPP EIGECKRLIW LDLNSNNLTG DIPFQLADQA GLVIPGRVSG KQFAFVRNEG 660
GTSCRGAGGL VEFEDIRTER LEGFPMVHSC PLTRIYSGWT VYTFASNGSM IYLDLSYNLL 720
SGSIPENLGE MAYLQVLNLG HNRLSGNIPD RLGGLKAIGV LDLSHNSLNG SIPGALEGLS 780
FLSDLDVSNN NLTGSIPSGG QLTTFPAARY ENNSGLCGVP LSACGASKNH SVAVGGWKKK 840
QPAAAGVVIG LLCFLVFALG LVLALYRVRK TQRKEEMREK YIESLPTSGG SSWKLSSFPE 900
PLSINVATFE KPLRKLTFAH LLEATNGFSA ESLIGSGGFG EVYKAKLKDG CVVAIKKLIH 960
VTGQGDREFM AEMETIGKIK HRNLVQLLGY CKVGEERLLV YEYMRWGSLE AVLHERAKGG 1020
GSKLDWAARK KIAIGSARGL AFLHHSCIPH IIHRDMKSSN ILLDENFEAR VSDFGMARLV 1080
NALDTHLTVS TLAGTPGYVP PEYYQSFRCT AKGDVYSYGV ILLELLSGKR PIDSSEFGDD 1140
SNLVGWSKML YKEKRINEIL DPDLIVQTSS ESELLQYLRI AFECLDERPY RRPTMIQVMA 1200
MFKELQVDTF NDMLDSFSLR DNVIDEA 1227
Nucleotide Sequence
(FASTA)
ATGAAGCATG AGAGTGAAAA ACCTCAAAAA GGTGAAAAAA TGAAAAGAGA GAAACCATAC 60
CTTATGAAGA AGATGCCATG GTCACCAAGA AGAGAAGCAC CGATAGTAGT GAGGTTCGTT 120
GTGATTGCCT TATTCATCAT GATAACAGTA CCAACACCAC CAACAGCAGC AGATGCAGAA 180
GCAGAAGCAG CAACAACAAC TTCTGATGCA GTTTTATTGA TACAGTTCAA GCATTTACAC 240
GTTTCCTCTG ATCCCTACAG CTTCCTCTCC GACTGGGACC CACACGCGCC ATCCCCGTGC 300
GCGTGGCGAG GTATCACCTG CTCCTCCTCC GGCGGCGTCA GCGCCATCGA CCTCAGCGGC 360
GCTGCCCTCT CCGGCACACT GCACCTCCCC ACACTCACGT CACTTTCATC GCTCCAAAAC 420
CTAATCCTAC GTGGCAACTC CTTCTCCTCT TTCAACCTCA CCGTTTCGCC AATTTGTACG 480
CTCGAAACAC TCGACCTCTC TCACAACAAC TTCTCCGGCA AGTTTCCTTT CGCAAACCTC 540
GCTCCATGTA TCCGCCTTAG CTATCTCAAC CTCTCTAATA ATCTCATCAC CGCTGGGCCT 600
GGGCCCTGGC CCGAGCTGGC CCAACTTGAT TTGTCTAGAA ACCGTGTCTC CGACGTGGAC 660
CTTCTCGTTT CCGCTCTCGG AAGCTCAACT CTCGTTTTTC TTAACTTCTC GGACAATAAA 720
CTCGCGGGTC AACTCAGCGA AACGCTCGTT TCGAAGAGCC TGAATCTCTC CACTTTGGAC 780
CTCTCTTATA ATCTTTTCTC CGGAAAGGTT CCGCCGAGGC TTCTAAACGA CGCCGTTCAG 840
GTCCTGGATT TCTCGTTCAA CAATTTCTCG GAATTTGACT TCGGTTTCGG TTCGTGTGAG 900
AATCTAGTTC GGTTGAGTTT CTCGCACAAT GCAATCTCTT CAAACGAGTT TCCGCGCGGG 960
TTGGGTAACT GCAACAATCT TGAGGTTCTA GATCTTTCTC ACAATGAGCT CATGATGGAG 1020
ATTCCGTCGG AAATTCTTCT GAATTTGAAG AGTTTGAAGT CTCTGTTTCT CGCACACAAC 1080
AAATTTTCCG GCGAAATCCC GAGTGAGCTT GGAAGCCTTT GCAAAACTCT AGTTGAACTT 1140
GATCTCTCGG AGAACAATCT TTCTGGTTCG TTGCCTTTGA GTTTCACTCA ATGTTCTTCT 1200
CTGCAGAGTC TGAATCTCGC GAGAAACTAT TTTTCTGGGA ACTTCCTTGT TTCCGTGGTG 1260
AACAAGCTTC GGAGTCTAAA GTATCTAAAC GCAGCGTTTA ACAACATAAC GGGACCGGTT 1320
CCGGTGTCGC TTGTGAGCTT GAAAGAGCTT CGGGTTCTTG ACCTGAGCTC GAACCGGTTC 1380
AGCGGCAATG TTCCGTCGTC TTTATGTCCT TCCGGGTTGG AGAATTTGAT CCTCGCTGGC 1440
AATTACCTTT CAGGGACGGT ACCGTCACAG CTCGGTGAGT GTAGGAACTT GAAAACTATT 1500
GATTTCAGCT TTAACAGTTT GAACGGTTCC ATACCGTGGA AGGTGTGGGC TTTGCCTAAT 1560
TTAACTGATT TGATTATGTG GGCTAATAAA CTCACTGGAG AAATCCCCGA GGGAATTTGT 1620
GTTAAGGGAG GGAACTTGGA GACGTTGATT TTGAACAACA ATTTAATTTC TGGGTCCATT 1680
CCGAAGTCAA TTGCGAATTG CACCAACATG ATATGGGTGT CGTTGGCGAG CAACCGGTTA 1740
ACCGGGGAGA TAACGGCTGG GATTGGGAAT TTGAATGCAT TGGCGATTCT TCAGCTGGGG 1800
AATAACTCGC TCAGTGGGAG GATTCCGCCG GAGATAGGCG AGTGCAAGAG GTTGATATGG 1860
TTGGATTTGA ATAGCAATAA CCTAACCGGG GATATCCCTT TCCAGCTTGC TGATCAGGCA 1920
GGGTTGGTTA TCCCAGGTAG GGTTTCGGGG AAGCAGTTTG CGTTTGTGAG GAATGAGGGT 1980
GGGACTAGTT GCAGGGGTGC TGGTGGGTTG GTTGAGTTTG AGGATATCAG GACAGAGAGG 2040
CTTGAAGGTT TTCCAATGGT GCATTCATGC CCGTTGACAC GGATTTACTC CGGTTGGACT 2100
GTGTATACTT TTGCTTCCAA TGGGAGTATG ATCTACCTTG ACCTTTCCTA CAACTTGTTG 2160
TCTGGTAGCA TCCCTGAGAA TTTGGGTGAG ATGGCCTATT TGCAGGTGCT GAATTTGGGG 2220
CACAATAGGT TGAGTGGGAA CATTCCTGAT AGGCTTGGTG GTTTGAAAGC AATAGGGGTG 2280
CTTGATCTGT CTCATAATAG TCTTAATGGG TCCATCCCTG GGGCATTGGA GGGTCTTTCT 2340
TTTCTCAGTG ACCTTGATGT GTCTAATAAT AATCTCACTG GGTCCATTCC TTCTGGAGGT 2400
CAGTTAACTA CTTTTCCAGC TGCCAGATAT GAGAACAACT CTGGCCTTTG TGGGGTGCCT 2460
TTGTCAGCGT GTGGGGCTTC AAAGAATCAC TCCGTTGCTG TTGGGGGTTG GAAGAAGAAG 2520
CAGCCTGCTG CAGCTGGGGT TGTCATTGGT TTGCTTTGCT TCCTCGTGTT TGCACTTGGG 2580
CTTGTGTTGG CTTTGTACCG AGTGAGGAAG ACACAGAGGA AGGAGGAGAT GAGGGAAAAG 2640
TATATAGAGA GTCTTCCAAC TTCTGGGGGC AGTAGTTGGA AGCTATCCAG CTTTCCTGAG 2700
CCTTTGAGCA TCAATGTTGC CACCTTTGAG AAGCCTCTGC GGAAGCTGAC GTTTGCGCAT 2760
CTTCTTGAGG CTACTAATGG TTTCAGTGCC GAGAGTTTGA TAGGTTCTGG GGGGTTTGGT 2820
GAGGTGTACA AAGCTAAGCT AAAAGATGGT TGTGTTGTTG CTATCAAGAA GCTCATTCAC 2880
GTGACGGGTC AGGGAGATAG GGAGTTCATG GCTGAGATGG AAACTATTGG GAAGATTAAG 2940
CATAGGAACC TGGTTCAGCT GCTGGGTTAC TGTAAAGTTG GAGAGGAGAG GCTGCTTGTG 3000
TATGAGTACA TGAGATGGGG AAGTCTCGAG GCTGTTTTAC ATGAGAGAGC AAAAGGAGGA 3060
GGTTCAAAGC TTGATTGGGC AGCAAGGAAG AAGATTGCCA TAGGGTCAGC AAGAGGTCTC 3120
GCATTTCTTC ACCATAGTTG CATTCCTCAC ATTATACACA GGGACATGAA GTCCAGCAAT 3180
ATTCTTCTTG ATGAAAATTT TGAGGCCAGA GTTTCTGATT TTGGCATGGC GAGATTGGTT 3240
AATGCCCTCG ACACTCATCT CACTGTTAGC ACACTTGCCG GAACACCTGG TTATGTACCC 3300
CCTGAGTACT ACCAGAGTTT TAGATGTACA GCAAAAGGGG ATGTCTATAG CTATGGTGTC 3360
ATACTGCTAG AGCTTCTATC AGGGAAGAGA CCAATTGACT CTTCTGAGTT TGGTGATGAT 3420
AGCAATCTTG TTGGATGGTC AAAGATGCTT TACAAAGAGA AAAGAATTAA TGAAATACTT 3480
GATCCTGATT TAATTGTGCA AACATCTAGT GAAAGTGAAC TATTACAATA TTTGAGAATT 3540
GCTTTTGAAT GTCTGGATGA GAGACCATAC CGGCGGCCAA CCATGATACA AGTGATGGCT 3600
ATGTTTAAAG AGCTTCAGGT TGACACATTT AATGATATGC TTGATAGCTT CTCTCTGAGA 3660
GACAATGTTA TTGATGAAGC ATGA 3684
Domain Profile
S: 1     fsednrigeGgfgevyrgelknt.avavkklkeeadlslkelkqsfltElkvlarfrHdn  59
         fs +  ig Ggfgevy+++lk+  +va+kkl + +     + +++f +E++++ +++H n
Q: 928   FSAESLIGSGGFGEVYKAKLKDGcVVAIKKLIHVT----GQGDREFMAEMETIGKIKHRN  983
         788999**************986599*****8765....567899***************
S: 60    ilellgysaeseklcLvYqymknGsLedrLqcq..kgsepLsWpqRlsillGtaraiefL  117
         +++llgy+  +e+  LvY+ym+ GsLe+ L+ +  +g + L W  R +i++G ar++ fL
Q: 984   LVQLLGYCKVGEERLLVYEYMRWGSLEAVLHERakGGGSKLDWAARKKIAIGSARGLAFL  1043
         ******************************86654899**********************
S: 118   Heasp.slihgdiksaNiLLDekltpKlgDfglarfapesekqsstvlrtskvrgtlaYl  176
         H+    ++ih+d+ks+NiLLDe+++++++Dfg+ar      +  +t l++s++ gt  Y+
Q: 1044  HHSCIpHIIHRDMKSSNILLDENFEARVSDFGMAR----LVNALDTHLTVSTLAGTPGYV  1099
         998766*****************************....444556667799*********
S: 177   peefirvgqltvkvDvySfGiVllEvltGlravdedr..ktkyLkdllkeeieeekveil  234
         p+e+ ++ + t k DvyS+G++llE+l+G r +d ++  +  +L+   k   +e+++   
Q: 1100  PPEYYQSFRCTAKGDVYSYGVILLELLSGKRPIDSSEfgDDSNLVGWSKMLYKEKRI---  1156
         *******************************96665533788999998888888885...
S: 235   ekfldkk..agkleeellealielalaclaekakkrPtmsqvle  276
         ++ ld    +++  e+ l + +++a +cl+e+  +rPtm qv+ 
Q: 1157  NEILDPDliVQTSSESELLQYLRIAFECLDERPYRRPTMIQVMA  1200
         4444444113334455566677899****************986
Domain Sequence
(FASTA)
FSAESLIGSG GFGEVYKAKL KDGCVVAIKK LIHVTGQGDR EFMAEMETIG KIKHRNLVQL 60
LGYCKVGEER LLVYEYMRWG SLEAVLHERA KGGGSKLDWA ARKKIAIGSA RGLAFLHHSC 120
IPHIIHRDMK SSNILLDENF EARVSDFGMA RLVNALDTHL TVSTLAGTPG YVPPEYYQSF 180
RCTAKGDVYS YGVILLELLS GKRPIDSSEF GDDSNLVGWS KMLYKEKRIN EILDPDLIVQ 240
TSSESELLQY LRIAFECLDE RPYRRPTMIQ VMA 273
KeywordComplete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Arabidopsis lyrata"; ?>Arabidopsis thaliana"; ?>Brachypodium distachyon"; ?>Brassica rapa"; ?>Hordeum vulgare"; ?>Oryza glaberrima"; ?>Oryza indica"; ?>Oryza sativa"; ?>Populus trichocarpa"; ?>Setaria italica"; ?>Solanum lycopersicum"; ?>Solanum tuberosum"; ?>Sorghum bicolor"; ?>Vitis vinifera"; ?>
EKS-ARL-00343
EKS-ART-00370
EKS-BRD-00276
EKS-BRR-00478
EKS-HOV-00249
EKS-ORG-00210
EKS-ORI-00334
EKS-ORS-00269
EKS-POT-00565
EKS-SEI-00257
EKS-SOL-00418
EKS-SOT-00395
EKS-SOB-00281
EKS-VIV-00380
Gene Ontology
KEGG
InterPros
Pfam
SMARTs
Prosites
Prints
Created Date20-Feb-2013