EKS-SOB-00281
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-SOB-00281
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TKL/IRAK210.27.5E-629121182271
StatusUnreviewed
Ensembl ProteinSb02g019470.1
UniProt AccessionC5X896;
Protein Name
Protein Synonyms/Alias Putative uncharacterized protein Sb02g019470;
Gene NameSb02g019470
Gene Synonyms/Alias Sb02g019470; SORBIDRAFT_02g019470;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
Sb02g019470Sb02g019470.1Sb02g019470.1
OrganismSorghum bicolor
Functional Description
Protein Length1214
Protein Sequence
(FASTA)
MAASTTAAAW FLVAVVLVLL HATAPAIAGA EDEAAALLAF RRASVADDPR GALSGWAMAN 60
ATAAAPCSWA GVSCAPPPDG RVVAINLTGM ALVGELRLDA LLALPALQRL DLRGNAFYGN 120
LSHAHAAASA SPCALVEVDM SSNTFNGTLP AAFLATCGAL QSLNLSRNAL VGGGFPFAPS 180
LRSLDLSRNH LADVGLLNYS FAGCHGLRYL NLSANQFVGR LPELATCSAV SVLDVSWNHM 240
SGALPAGFMA AAPPNLTHLS IAGNNFSGDV SAYDFGGCAN LTVLDWSFNG LSSSELPPSL 300
ANCGRLEMLD VSGNKLLGGP IPTFLTGFSS LKRLALAGNE FSGTIPDELS QLCGRIVELD 360
LSSNRLVGGL PASFAKCRSL EVLDLSGNQL SGSFVDSVVS TISSLRELRL SFNNITGQNP 420
LPVLAAGCPL LEVIDLGSNE LDGEIMEDLC SSLPSLRKLF LPNNYLKGTV PKSLGNCANL 480
ESIDLSFNFL VGQIPKEIIL LPKLIDLVMW ANGLSGEIPD MLCSNGTTLE TLVLSYNNFT 540
GGIPPSITRC VNLIWVSFSG NHLIGSVPHG FGKLQKLAIL QLNKNQLSGP VPAELGSCIN 600
LIWLDLNSNS FTGIIPPELA SQTGLIPGGI VSGKQFAFLR NEAGNICPGA GVLFEFFGIR 660
PERLAAFPTV HLCPSTRIYV GTMDYKFQSN GSMIFLDLSY NRLTGTIPAG LGNMMFLEVM 720
NLGHNDLNGT IPYEFSGLKL VGAMDLSNNH LTGGIPPGLG TLSFLADLDV SSNNLSGPIP 780
LTGQLSTFPQ SRYANNPGLC GIPLPPCGHD PGQGSVPSAS SGRRKTVGGS ILVGIALSML 840
ILLLLLVTLC KLRKNQKTEE IRTGYIESLP TSGTSSWKLS GVHEPLSINV ATFEKPLRKL 900
TFAHLLEATD GFSAETLIGS GGFGEVYKAK LKDGTVVAIK KLIHFTGQGD REFTAEMETI 960
GKIKHRNLVP LLGYCKIGDE RLLVYEYMKH GSLDVVLHDQ AKAGVKLDWA ARKKIAIGSA 1020
RGLAFLHHSC IPHIIHRDMK SSNVLLDSNL DARVSDFGMA RLMNALDTHL SVSTLAGTPG 1080
YVPPEYYQSF RCTTKGDVYS YGVVLLELLS GKKPIDPTEF GDNNLVGWVK QMVKENRSSE 1140
IFDPTLTNTK SGEAELYQSL KIARECLDDR PNQRPTMIQV MAMFKELQLD SDSDFLDGFS 1200
INSSTIDESA EKSS 1214
Nucleotide Sequence
(FASTA)
ATGGCCGCCT CAACGACGGC GGCCGCGTGG TTCCTTGTGG CGGTGGTGCT GGTGCTGCTT 60
CACGCGACGG CTCCCGCCAT TGCTGGCGCA GAGGACGAGG CGGCGGCGCT GCTCGCCTTC 120
AGGCGTGCGT CCGTGGCGGA CGACCCGCGC GGCGCGCTCT CGGGCTGGGC GATGGCGAAC 180
GCCACGGCGG CGGCGCCGTG CTCGTGGGCC GGCGTCTCGT GCGCGCCGCC GCCCGACGGC 240
CGCGTCGTGG CCATCAACCT CACCGGCATG GCGCTCGTCG GCGAGCTCCG TCTCGACGCG 300
CTCCTCGCGC TCCCGGCGCT CCAGCGCCTC GACCTCCGCG GCAATGCCTT CTACGGCAAC 360
CTCTCGCACG CGCACGCCGC GGCGTCGGCT TCGCCGTGCG CGCTCGTGGA GGTGGACATG 420
TCGTCGAACA CGTTCAACGG GACGCTCCCC GCGGCGTTCC TGGCGACGTG CGGCGCGCTG 480
CAGTCCTTGA ACCTGTCGCG CAACGCCCTC GTCGGCGGGG GCTTCCCGTT CGCGCCGTCG 540
CTGCGGTCGC TCGACCTGTC GCGCAACCAC CTGGCGGACG TCGGCCTCCT CAACTACTCC 600
TTCGCCGGCT GCCACGGCCT GCGGTACCTC AACCTCTCCG CCAACCAGTT CGTCGGCCGG 660
CTGCCGGAGC TCGCGACGTG CAGCGCGGTC TCCGTGCTCG ACGTGTCGTG GAACCACATG 720
TCCGGCGCGC TGCCCGCCGG GTTCATGGCC GCGGCGCCGC CGAACCTGAC GCACCTCAGC 780
ATCGCCGGGA ACAACTTCTC CGGGGATGTC TCGGCGTACG ACTTCGGCGG CTGCGCCAAC 840
CTGACGGTGC TGGACTGGTC CTTCAACGGC CTGAGCAGCA GCGAGCTGCC GCCGAGCCTC 900
GCCAACTGCG GCCGCCTCGA GATGCTGGAC GTGTCCGGGA ACAAGCTCCT CGGCGGCCCG 960
ATCCCGACGT TCCTAACTGG CTTCTCCTCG CTGAAGCGGC TGGCATTGGC CGGCAACGAG 1020
TTCTCCGGCA CCATTCCGGA CGAGCTCAGC CAGCTGTGCG GCAGAATTGT TGAGCTGGAC 1080
CTGTCAAGCA ACCGGCTGGT TGGCGGCTTG CCGGCAAGCT TTGCCAAGTG CAGGTCACTC 1140
GAGGTGCTTG ACCTCAGTGG CAACCAACTC TCGGGGAGCT TCGTGGACAG TGTCGTCAGC 1200
ACCATCTCCT CGCTGCGTGA GCTGCGGCTT TCGTTCAACA ACATCACCGG GCAGAACCCG 1260
CTGCCGGTGC TGGCGGCGGG TTGCCCATTG CTCGAGGTGA TCGACCTCGG TTCCAATGAG 1320
CTTGACGGTG AGATAATGGA AGATCTCTGC TCGTCACTGC CGTCGCTGCG GAAGCTCTTC 1380
CTCCCCAACA ATTACCTCAA GGGCACAGTG CCGAAGTCGC TGGGCAACTG CGCCAATCTG 1440
GAGTCCATTG ATCTGAGCTT CAACTTCCTT GTTGGTCAGA TTCCAAAGGA GATAATTCTG 1500
CTGCCAAAGC TTATTGACCT GGTGATGTGG GCGAACGGCC TGTCCGGCGA GATCCCGGAC 1560
ATGCTTTGCT CCAACGGCAC TACGCTGGAG ACGCTGGTGC TGAGCTACAA CAACTTCACT 1620
GGAGGCATTC CCCCTTCCAT CACCAGATGT GTCAACCTCA TCTGGGTGTC GTTCTCCGGC 1680
AACCACCTCA TAGGGAGTGT GCCTCACGGC TTCGGTAAGC TCCAGAAGCT TGCCATCCTG 1740
CAGCTCAACA AAAACCAGCT GTCTGGCCCT GTGCCGGCAG AGCTCGGCAG CTGCATCAAC 1800
CTCATCTGGC TGGACCTCAA CAGCAACAGC TTCACCGGCA TAATACCGCC GGAGTTGGCT 1860
AGCCAGACAG GGCTTATTCC GGGGGGCATT GTGTCAGGGA AGCAGTTTGC ATTCCTGCGG 1920
AACGAGGCCG GCAACATCTG TCCCGGTGCC GGTGTGCTCT TCGAGTTCTT CGGCATCAGG 1980
CCTGAGAGGC TGGCTGCGTT CCCAACTGTG CACCTTTGCC CATCCACAAG GATATACGTC 2040
GGCACAATGG ACTACAAGTT TCAGAGCAAT GGGAGCATGA TCTTCCTCGA CCTCTCCTAC 2100
AACCGCCTCA CCGGTACAAT CCCGGCAGGC CTTGGGAACA TGATGTTTCT GGAAGTCATG 2160
AACCTGGGGC ACAATGACCT CAACGGTACA ATACCATATG AATTCTCAGG GCTCAAGCTA 2220
GTTGGCGCCA TGGACCTCTC AAACAATCAT CTCACTGGCG GCATCCCCCC GGGGCTTGGT 2280
ACTCTGAGTT TCCTTGCTGA TTTGGATGTA TCCAGCAACA ACCTGTCTGG TCCAATTCCT 2340
TTGACTGGCC AGCTCAGTAC GTTCCCGCAA TCCCGGTATG CCAACAATCC TGGCCTCTGT 2400
GGCATCCCTC TACCTCCCTG TGGTCATGAT CCAGGGCAGG GAAGTGTGCC ATCAGCTTCA 2460
TCTGGAAGGA GGAAGACCGT TGGGGGAAGC ATTCTTGTTG GAATCGCGCT CTCCATGCTC 2520
ATACTGCTGT TGCTGCTGGT CACTCTCTGC AAGCTCAGGA AGAACCAGAA GACTGAAGAG 2580
ATTAGAACTG GGTACATCGA AAGCCTTCCA ACATCTGGCA CGTCTAGCTG GAAGCTTTCA 2640
GGTGTTCATG AGCCACTAAG CATCAATGTG GCCACATTTG AGAAGCCTCT GAGGAAGCTC 2700
ACCTTTGCGC ATCTCCTCGA GGCCACCGAT GGCTTCAGTG CTGAGACTCT CATAGGCTCA 2760
GGGGGGTTTG GTGAGGTGTA CAAGGCCAAG CTCAAGGACG GCACAGTTGT TGCCATCAAG 2820
AAGCTGATCC ATTTCACAGG CCAAGGCGAC AGGGAATTCA CAGCAGAGAT GGAGACCATT 2880
GGCAAGATCA AGCACCGCAA CCTTGTGCCT CTGCTCGGAT ATTGCAAGAT TGGCGATGAG 2940
CGGCTCCTTG TGTACGAGTA CATGAAGCAT GGCAGTCTAG ACGTGGTGCT GCATGACCAG 3000
GCCAAGGCTG GTGTGAAGCT TGACTGGGCA GCAAGGAAGA AGATCGCCAT CGGTTCAGCA 3060
AGGGGTCTCG CCTTCCTCCA CCACAGCTGC ATCCCACACA TCATCCACCG GGACATGAAG 3120
TCAAGCAATG TGCTTCTTGA CAGCAACCTG GATGCCCGTG TCTCTGACTT TGGGATGGCG 3180
AGGCTGATGA ATGCACTAGA CACACATCTG AGTGTGAGCA CACTTGCAGG TACGCCTGGA 3240
TACGTGCCAC CTGAGTACTA CCAGAGTTTC AGGTGCACGA CCAAGGGTGA TGTATACAGC 3300
TACGGTGTCG TGCTCTTGGA GCTTCTGTCA GGGAAGAAGC CGATCGATCC AACCGAGTTT 3360
GGAGACAACA ATCTTGTCGG CTGGGTGAAG CAGATGGTCA AGGAGAACAG AAGCAGTGAG 3420
ATCTTTGATC CTACTCTGAC AAACACAAAG TCAGGGGAAG CTGAGCTCTA CCAGTCTCTG 3480
AAGATCGCTC GCGAGTGCTT GGACGACAGA CCGAACCAAA GGCCAACCAT GATACAGGTG 3540
ATGGCCATGT TCAAAGAGCT GCAGCTTGAC TCCGACAGTG ACTTCCTCGA TGGATTCTCA 3600
ATCAACTCAT CAACGATAGA TGAATCAGCA GAGAAATCCT CGTA 3644
Domain Profile
S: 1     fsednrigeGgfgevyrgelkn.tavavkklkeeadlslkelkqsfltElkvlarfrHdn  59
         fs ++ ig Ggfgevy+++lk+ t+va+kkl +     + + +++f +E++++ +++H n
Q: 912   FSAETLIGSGGFGEVYKAKLKDgTVVAIKKLIHF----TGQGDREFTAEMETIGKIKHRN  967
         78899***************962699*****875....4567999***************
S: 60    ilellgysaeseklcLvYqymknGsLedrLqcq.kgsepLsWpqRlsillGtaraiefLH  118
         +++llgy+  +++  LvY+ymk GsL   L+ q k+   L W  R +i++G ar++ fLH
Q: 968   LVPLLGYCKIGDERLLVYEYMKHGSLDVVLHDQaKAGVKLDWAARKKIAIGSARGLAFLH  1027
         *******************************998999**********************9
S: 119   easp.slihgdiksaNiLLDekltpKlgDfglarfapesekqsstvlrtskvrgtlaYlp  177
         +    ++ih+d+ks+N+LLD +l ++++Dfg+ar      +  +t l +s++ gt  Y+p
Q: 1028  HSCIpHIIHRDMKSSNVLLDSNLDARVSDFGMAR----LMNALDTHLSVSTLAGTPGYVP  1083
         98766*****************************....555667777899**********
S: 178   eefirvgqltvkvDvySfGiVllEvltGlra.vdedrktkyLkdllkeeieeekv.eile  235
         +e+ ++ + t+k DvyS+G+VllE+l+G +     +   ++L+  +k+ ++e++  ei++
Q: 1084  PEYYQSFRCTTKGDVYSYGVVLLELLSGKKPiDPTEFGDNNLVGWVKQMVKENRSsEIFD  1143
         *****************************98345666899********998877525665
S: 236   kfldkkagkleeellealielalaclaekakkrPtmsqvle  276
           l +  +  e el ++ +++a +cl+++ ++rPtm qv+ 
Q: 1144  PTLTNTKSG-EAELYQS-LKIARECLDDRPNQRPTMIQVMA  1182
         555443333.5555555.567899**************986
Domain Sequence
(FASTA)
FSAETLIGSG GFGEVYKAKL KDGTVVAIKK LIHFTGQGDR EFTAEMETIG KIKHRNLVPL 60
LGYCKIGDER LLVYEYMKHG SLDVVLHDQA KAGVKLDWAA RKKIAIGSAR GLAFLHHSCI 120
PHIIHRDMKS SNVLLDSNLD ARVSDFGMAR LMNALDTHLS VSTLAGTPGY VPPEYYQSFR 180
CTTKGDVYSY GVVLLELLSG KKPIDPTEFG DNNLVGWVKQ MVKENRSSEI FDPTLTNTKS 240
GEAELYQSLK IARECLDDRP NQRPTMIQVM A 271
KeywordComplete proteome; Leucine-rich repeat; Reference proteome; Repeat.
Sequence SourceEnsembl
Orthology
Ortholog group
Arabidopsis lyrata"; ?>Arabidopsis thaliana"; ?>Brachypodium distachyon"; ?>Brassica rapa"; ?>Glycine max"; ?>Hordeum vulgare"; ?>Oryza brachyantha"; ?>Oryza glaberrima"; ?>Oryza indica"; ?>Oryza sativa"; ?>Populus trichocarpa"; ?>Setaria italica"; ?>Solanum lycopersicum"; ?>Solanum tuberosum"; ?>Vitis vinifera"; ?>Zea mays"; ?>
EKS-ARL-00343
EKS-ART-00370
EKS-BRD-00276
EKS-BRR-00478
EKS-GLM-01171
EKS-HOV-00249
EKS-ORB-00350
EKS-ORG-00210
EKS-ORI-00482
EKS-ORS-00269
EKS-POT-00565
EKS-SEI-00257
EKS-SOL-00418
EKS-SOT-00395
EKS-VIV-00380
EKS-ZEM-00396
Gene Ontology
GO:0005886; C:plasma membrane
GO:0005524; F:ATP binding
GO:0004674; F:protein serine/threonine kinase activity
KEGG
sbi:SORBI_02g019470;
InterPros
IPR013320; ConA-like_subgrp.
IPR011009; Kinase-like_dom.
IPR001611; Leu-rich_rpt.
IPR013210; LRR-contain_N2.
IPR000719; Prot_kinase_cat_dom.
IPR017441; Protein_kinase_ATP_BS.
IPR008271; Ser/Thr_kinase_AS.
Pfam
PF00560; LRR_1; 2.
PF08263; LRRNT_2; 1.
PF00069; Pkinase; 1.
SMARTs
Prosites
PS51450; LRR; 15.
PS00107; PROTEIN_KINASE_ATP; 1.
PS50011; PROTEIN_KINASE_DOM; 1.
PS00108; PROTEIN_KINASE_ST; 1.
Prints
Created Date20-Feb-2013