EKS-ORS-00532
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-ORS-00532
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TKL/IRAK186.32.7E-5416821950269
StatusUnreviewed
Ensembl ProteinLOC_Os08g02990.1
UniProt AccessionQ9FW76;
Protein Name
Protein Synonyms/Alias Putative gag-pol polyprotein;
Gene NameOSJNBa0026L12.26
Gene Synonyms/Alias OSJNBa0026L12.26;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
LOC_Os08g02990LOC_Os08g02990.1LOC_Os08g02990.1
OrganismOryza sativa
Functional Description
Protein Length2015
Protein Sequence
(FASTA)
MDTDGKLDVL LKLVEGNEKK RVEADECTRA EYQELKKTVE SRIPVVEKKV EVLSEALLEL 60
NLKRPQDLDS ACAIALLQEE ALEGVKASHY RRSDPGVMLK TNRTGPIVGV TQHTKNNLNQ 120
SEDRRGTESA RAKDDKVAAL RAYRRSKGLC FVCGERWGKE HKCATTVQLH VVEQLLAVLQ 180
PEIDSDSDNI ELGTSPEQHG TLLTISQQAL WGTESNQSIK VNGWVQGMEL IMLIDSGSTH 240
SFVDEQIALK LMGARRLQQP LTVRIADGGV MQCTREIADC RWWMQGQSFC SNFRLLPLGN 300
YDAILGMDWL TMHSPMKVDW VQKWMEFQYL GRDVRIQGIT NQPAQCSHIT HNQLSGMNRK 360
GSLLYFIQLQ NMAIQENSTI SDSVKPMLLE FADLFEEPTE LPPRRACDHS INLIPGAKPI 420
NLRPYRHNPA LKDEIERQIA EMLKSGVIQP SQSPFSSPAI LVKKKDHTWR LVIDYRKLNA 480
ITVKTKYLVP VMEELLDELS GSKWFTKLDL RSGYHQIRMA EGEEHKTAFQ THSGHYEYKV 540
MSFGLTGAPA TFLGAMNDTL KYVLRKFVLV FFDDILIYSP DFESHLTHVR QVLQLLQQYQ 600
WKSQLSYLGH IIGANGVATD PQKVQDILNW EIPTNVKKLR GFLGLAGYYR KFVQGFGLKS 660
KPLTNLLRKG VPFVWSTEAD SAFQALKTSL ASAPVLALPN FQKTFVVETD ASDYGIGAVL 720
SQEKHPIAYI SKALGPRTRG LSTYEKECLA MIMAVDHWRS YLQHVEFIIL TDHHSLMHLS 780
DQRLHTPWQH KAFTKLLGLQ YKICYRKGST NAAADALSRK PQQSEEEFNA ISQCVPQWLM 840
EVLQSYDTDP HATQLVAALT LNPNSKPHFS LQHGVLRYKG NIWIGNSPDL QLKIINEMHA 900
SPVGGHSGFP VTYRRIKQLF AWNGMKSQIK ETLANCQICA QAKPDRSRYP RLLQPLPVPK 960
GAWQTISLDF IEGLPRSSHY NCILVVVDKF SKYSHFIPLR HPFTAIDVAK AFMSNVYKLH 1020
GLPQIIISDR DKIFTSQLWE QLFLRSGTKL HYSSAYHPQS DGQTERGNQC LEIYLKCFVQ 1080
SAPSKWPSWL HLAEYWYNNS YHSTIDRTPF EALYGYPPRH FGISIRDCDN SELITWMQDR 1140
KLMQQLVQQH LHRAQQQMKL FADKKRSFRQ FQVGEWVYLK LQPYVQSSVA PRANHKLAFK 1200
YFGPFQILEK LGTVAYKLQL PATSSIHPVF HVSLLKEAKG FQPSVHTPLS PTYSAVQYPI 1260
ALLDHRLTKK AQATWEDLHD LKARFPNALA WGQAKTQGEG IVRSSSDSAE EDTEEELSLE 1320
AEQDDDEKMQ AEAEDAEQGG VMDHQPVLRR TSRTVKPNPL YHGPQLDPNI FFLSVFCDDL 1380
YSPALVVVAL TFNHTNFGPD EQTNIRLEGD AAFSADVSFS GDGGGWVDIS ANRLDGNIDH 1440
SRGRVSYALP VAAGDDRFVA MEFDTFNDTI VHDPDATYDH LGVDVNSVVS KRILTLPSFT 1500
LVGNMTAVVE YDNVSSILAM RLHLGYGLSG PRHRPDYNLS YKVDLKSVLP EQVAVGFSAA 1560
TSTSVELHQL RSWYFSSSLE PKATPPPVAP PSPSPPPTSG SGSGGVVAGA IVGAALFVVL 1620
LFAMVAVVVL VRRRHQRKKM REAEEANDDD DDTEGDPIME IENGMGPRRF AYHVLVNATK 1680
SFAAEEKLGQ GGFGAVYRGY LREQGLAVAI KRFIKDSSNQ GRREYKSEIK VISRLRHRNL 1740
VQLIGWFHGR NELLLVYELV PNRSLDVHLY GNGTFLTWPM RINIVIGLGS ALLYLHEEWE 1800
QCVVHRDIKP SNVMLDESFN TKLGDFGLAR LIDHADGVQT MTHPSGTPGY IDPECVITGK 1860
ASAESDVYSF GVVLLEVVCA RRPMSLLDDQ NNGLFRLVEW VWDLYGQGAI HNAADKRLNN 1920
DYDVVEMERV IAVGLWCAHP DRCQRPSIRA AMMVLQSSGP MPMLPAKMPV ATYAPPVASS 1980
EGQLSSSTAP PAHRLTQELG FWVAGKDMRG KDEGN 2015
Nucleotide Sequence
(FASTA)
ATGGATACGG ATGGGAAGCT TGATGTGTTG CTCAAGTTGG TGGAGGGGAA CGAGAAGAAG 60
CGGGTGGAAG CGGATGAGTG TACGAGGGCG GAGTACCAAG AGCTCAAGAA GACGGTGGAA 120
TCCAGAATTC CTGTGGTTGA GAAGAAGGTG GAGGTGTTGA GCGAGGCGTT GCTCGAGCTG 180
AATCTCAAGA GGCCCCAGGA TTTAGATTCT GCTTGTGCCA TAGCCCTGCT GCAGGAGGAA 240
GCTTTAGAAG GTGTTAAGGC CAGTCATTAC AGGAGGTCAG ACCCAGGGGT TATGCTCAAA 300
ACAAATAGAA CTGGACCTAT TGTGGGAGTT ACTCAGCATA CTAAGAACAA TTTGAATCAA 360
TCTGAAGACA GAAGGGGCAC TGAATCTGCT AGAGCTAAAG ATGATAAGGT GGCTGCTCTG 420
AGGGCTTATA GGAGGTCCAA GGGACTATGC TTTGTCTGTG GAGAACGATG GGGTAAAGAA 480
CATAAGTGTG CAACTACAGT TCAGCTTCAT GTAGTGGAAC AACTGTTGGC AGTGTTACAA 540
CCTGAAATTG ACAGTGATTC TGATAACATT GAATTGGGAA CATCACCAGA ACAACATGGC 600
ACTTTATTGA CCATATCTCA ACAAGCTCTG TGGGGAACTG AATCAAATCA GTCTATCAAG 660
GTGAATGGTT GGGTGCAGGG AATGGAACTG ATCATGCTTA TTGACTCAGG CAGCACACAT 720
TCGTTTGTGG ATGAACAAAT TGCACTTAAG TTGATGGGAG CTAGAAGGTT GCAACAACCC 780
CTAACTGTTA GAATTGCTGA TGGAGGTGTC ATGCAGTGCA CTAGGGAAAT TGCTGACTGT 840
CGTTGGTGGA TGCAGGGACA GAGTTTCTGC AGTAATTTCC GACTGCTACC CCTGGGAAAT 900
TATGATGCCA TATTGGGTAT GGACTGGCTC ACAATGCACA GCCCAATGAA GGTAGACTGG 960
GTTCAAAAAT GGATGGAATT CCAATACCTG GGAAGGGATG TTCGAATTCA AGGAATTACT 1020
AATCAGCCAG CACAGTGTTC TCATATAACA CACAACCAGT TATCTGGCAT GAACAGGAAG 1080
GGTTCACTGT TGTATTTTAT CCAGTTACAA AATATGGCCA TACAGGAGAA TAGCACTATT 1140
TCTGATTCAG TCAAACCTAT GTTGTTAGAG TTTGCTGACC TCTTTGAGGA ACCAACTGAA 1200
TTACCTCCGA GAAGAGCTTG TGATCACAGC ATTAACCTCA TTCCAGGGGC TAAACCCATT 1260
AATCTGAGAC CTTATAGGCA CAATCCTGCT CTCAAGGATG AAATAGAGAG GCAGATAGCT 1320
GAGATGCTCA AGTCTGGGGT GATTCAACCC AGCCAAAGTC CTTTTTCTTC CCCAGCTATA 1380
CTTGTCAAGA AAAAAGACCA CACTTGGAGA TTGGTAATAG ATTACAGAAA GTTGAATGCC 1440
ATTACAGTCA AAACAAAATA TCTAGTTCCA GTGATGGAGG AACTACTGGA TGAACTATCT 1500
GGTTCAAAAT GGTTTACCAA GCTGGACTTA AGATCAGGTT ATCACCAAAT TAGAATGGCT 1560
GAGGGAGAGG AACACAAAAC AGCCTTTCAA ACTCATTCTG GGCACTATGA ATATAAGGTG 1620
ATGTCTTTTG GCTTGACTGG GGCACCTGCA ACATTTTTGG GGGCTATGAA TGACACTCTA 1680
AAATATGTAC TTAGAAAATT CGTCCTGGTT TTCTTTGACG ATATCCTGAT CTATAGTCCA 1740
GATTTTGAGT CACATTTGAC TCATGTCAGA CAAGTCTTAC AACTACTTCA GCAATATCAG 1800
TGGAAGTCAC AGTTATCTTA TTTGGGCCAC ATCATCGGTG CAAATGGGGT TGCAACTGAT 1860
CCACAAAAGG TTCAGGATAT ACTCAATTGG GAGATTCCTA CCAATGTTAA GAAGTTGAGA 1920
GGATTTCTGG GATTGGCTGG TTATTACAGA AAATTTGTAC AAGGCTTTGG TCTGAAAAGT 1980
AAACCTCTGA CCAATCTTTT GAGGAAGGGT GTTCCTTTTG TTTGGAGTAC AGAAGCAGAT 2040
TCAGCCTTCC AGGCTCTTAA GACATCTCTG GCCTCAGCTC CTGTCTTGGC TTTACCAAAT 2100
TTTCAAAAAA CCTTTGTGGT GGAAACTGAT GCTAGTGACT ATGGAATCGG TGCAGTACTA 2160
TCACAGGAGA AGCACCCAAT TGCCTATATA AGCAAAGCCC TGGGTCCCAG GACCAGAGGT 2220
CTGTCCACTT ATGAAAAGGA GTGCCTAGCA ATGATCATGG CAGTTGATCA CTGGAGGTCT 2280
TACCTGCAAC ATGTTGAGTT TATAATACTC ACTGATCACC ACAGTCTCAT GCACCTATCA 2340
GACCAAAGAC TCCACACCCC CTGGCAACAT AAGGCCTTCA CAAAGTTACT GGGTTTACAA 2400
TACAAGATTT GTTACAGAAA GGGGAGTACT AATGCTGCTG CAGATGCTCT TTCTAGAAAA 2460
CCACAGCAAT CTGAGGAAGA GTTCAATGCT ATTTCTCAGT GTGTTCCTCA ATGGCTGATG 2520
GAAGTTTTGC AGAGCTATGA TACTGATCCA CATGCTACTC AGTTAGTTGC TGCCCTGACA 2580
CTTAACCCTA ATTCTAAGCC TCATTTTTCA CTTCAGCATG GAGTCTTGAG ATATAAGGGA 2640
AATATTTGGA TAGGCAACAG TCCTGACTTG CAACTTAAAA TCATTAATGA GATGCATGCC 2700
AGTCCAGTAG GAGGGCACTC TGGATTTCCA GTGACTTACA GAAGAATCAA ACAATTATTT 2760
GCCTGGAATG GAATGAAATC CCAAATTAAG GAAACACTTG CCAATTGTCA AATTTGTGCT 2820
CAAGCAAAAC CGGATAGATC AAGGTATCCA AGACTATTGC AGCCTTTACC AGTCCCTAAA 2880
GGTGCTTGGC AGACTATATC ATTGGATTTC ATTGAAGGGC TTCCCAGGTC CAGTCACTAC 2940
AATTGCATCT TAGTAGTAGT GGACAAATTT TCTAAATACT CCCACTTCAT ACCCTTGAGA 3000
CATCCTTTCA CTGCAATTGA TGTAGCAAAG GCCTTCATGA GTAATGTGTA CAAACTCCAT 3060
GGACTTCCAC AAATTATCAT CTCTGACAGA GATAAAATTT TTACAAGCCA ATTGTGGGAG 3120
CAGTTATTTC TGAGATCTGG CACTAAATTA CATTACAGTT CTGCTTATCA CCCTCAATCA 3180
GATGGACAAA CAGAACGAGG AAATCAATGC CTGGAAATAT ATTTGAAATG CTTTGTCCAG 3240
TCAGCACCTT CCAAATGGCC TTCTTGGTTG CATTTAGCTG AGTACTGGTA TAACAACTCA 3300
TACCATTCAA CTATTGACAG AACTCCATTT GAAGCCCTGT ATGGATATCC ACCTCGCCAT 3360
TTTGGGATCA GCATCAGGGA CTGTGACAAC AGTGAGTTGA TAACTTGGAT GCAGGATAGA 3420
AAATTGATGC AACAGTTAGT TCAACAACAT CTGCATCGTG CACAGCAACA AATGAAACTT 3480
TTTGCTGATA AGAAAAGAAG TTTTCGACAG TTCCAGGTGG GTGAGTGGGT CTATCTCAAG 3540
TTACAGCCCT ATGTGCAAAG TTCAGTGGCC CCCAGAGCGA ATCACAAGCT GGCTTTCAAA 3600
TATTTTGGAC CATTTCAAAT CCTGGAGAAG TTGGGTACTG TTGCTTACAA ACTTCAGTTG 3660
CCAGCCACAA GTTCCATTCA TCCAGTCTTT CATGTATCAC TTTTAAAGGA AGCAAAGGGG 3720
TTCCAGCCAT CAGTGCATAC ACCATTATCT CCAACTTACT CTGCTGTCCA GTATCCTATA 3780
GCATTACTGG ATCATCGTCT TACCAAAAAA GCACAAGCTA CCTGGGAGGA TCTGCATGAC 3840
TTGAAGGCCA GATTTCCTAA TGCTTTGGCT TGGGGACAAG CCAAAACTCA AGGGGAAGGG 3900
ATTGTCAGAT CGTCATCTGA CAGTGCTGAG GAAGATACTG AAGAAGAACT GAGCTTGGAA 3960
GCAGAGCAGG ACGATGATGA GAAGATGCAA GCTGAAGCTG AAGATGCAGA GCAGGGAGGA 4020
GTTATGGACC ATCAACCCGT GTTGCGGCGC ACTTCAAGGA CTGTCAAGCC CAACCCACTG 4080
TACCATGGGC CACAATTAGA TCCTAACATC TTCTTCTTGT CCGTCTTCTG CGACGATCTC 4140
TACTCGCCGG CTCTAGTCGT CGTGGCCTTG ACCTTCAACC ACACCAACTT CGGCCCGGAC 4200
GAGCAGACGA ACATCAGGCT TGAAGGCGAC GCGGCCTTCA GCGCCGACGT CTCCTTCAGC 4260
GGCGACGGTG GCGGGTGGGT TGACATCAGC GCCAACAGGC TCGACGGCAA CATCGACCAC 4320
AGCCGTGGCA GGGTGTCGTA CGCCCTTCCG GTGGCCGCCG GCGACGACCG GTTCGTCGCC 4380
ATGGAGTTCG ACACCTTCAA CGACACCATC GTGCACGACC CCGACGCCAC CTACGACCAC 4440
CTCGGCGTCG ACGTCAACTC CGTCGTGTCC AAGAGGATCT TGACCTTGCC AAGCTTCACC 4500
CTCGTCGGGA ACATGACCGC CGTCGTCGAG TACGACAACG TCTCCAGCAT CCTGGCCATG 4560
AGGCTGCACC TCGGCTATGG GCTTAGTGGC CCCCGTCACC GGCCGGATTA CAACCTTAGC 4620
TACAAGGTTG ACCTCAAGAG CGTGTTGCCG GAGCAGGTCG CCGTCGGCTT CTCCGCCGCG 4680
ACGTCCACAT CCGTCGAGCT GCATCAGCTA CGTTCATGGT ACTTCAGCTC GTCGCTGGAA 4740
CCCAAGGCCA CACCGCCACC AGTGGCGCCG CCGTCACCGT CACCACCGCC TACCTCCGGT 4800
TCAGGAAGTG GCGGGGTTGT AGCGGGAGCC ATCGTCGGCG CAGCACTGTT CGTCGTGCTC 4860
CTCTTTGCCA TGGTAGCAGT CGTCGTACTC GTACGACGAC GACATCAGAG GAAGAAGATG 4920
AGGGAGGCGG AGGAGGCGAA CGACGACGAC GATGACACGG AAGGCGATCC CATCATGGAG 4980
ATCGAGAACG GGATGGGGCC GAGGCGGTTC GCGTACCATG TGCTCGTCAA CGCAACCAAG 5040
AGCTTCGCGG CGGAGGAGAA GCTTGGCCAG GGCGGATTCG GCGCAGTGTA CCGGGGCTAT 5100
CTAAGAGAGC AGGGCCTCGC CGTCGCCATT AAGAGGTTCA TCAAAGATTC CTCCAACCAA 5160
GGGAGGAGGG AGTACAAGTC AGAGATCAAG GTCATAAGCC GGCTGCGCCA CCGCAATCTG 5220
GTTCAGCTCA TCGGCTGGTT CCATGGCCGC AACGAGCTCC TCCTCGTCTA CGAGCTCGTC 5280
CCCAACCGCA GCCTTGACGT CCATCTCTAC GGCAATGGTA CCTTCCTCAC ATGGCCAATG 5340
AGGATCAACA TTGTTATTGG GCTCGGATCC GCCCTGCTCT ACCTCCATGA GGAGTGGGAG 5400
CAATGTGTTG TGCACCGTGA CATCAAGCCA AGCAATGTCA TGCTGGATGA ATCCTTCAAT 5460
ACGAAGCTAG GTGACTTCGG CCTCGCAAGG CTCATTGACC ATGCTGATGG GGTACAGACA 5520
ATGACGCACC CCTCTGGGAC GCCCGGCTAT ATTGACCCCG AATGTGTGAT CACTGGTAAA 5580
GCAAGTGCCG AATCTGATGT CTACAGTTTT GGGGTTGTTT TATTAGAAGT GGTATGTGCA 5640
AGGAGACCAA TGAGTCTACT GGATGATCAA AATAATGGTC TCTTCCGACT AGTCGAGTGG 5700
GTCTGGGATC TATACGGCCA GGGAGCCATT CACAACGCAG CTGACAAGCG GCTCAACAAT 5760
GACTATGATG TGGTTGAGAT GGAGCGTGTC ATTGCTGTCG GGCTCTGGTG TGCGCATCCA 5820
GACCGATGCC AGCGGCCATC AATCAGAGCT GCTATGATGG TCCTCCAGTC CAGTGGACCA 5880
ATGCCGATGC TACCAGCCAA GATGCCCGTG GCAACATATG CACCGCCGGT GGCCTCATCG 5940
GAGGGGCAGC TCTCGTCATC GACGGCTCCG CCTGCCCATC GCCTCACTCA AGAACTAGGG 6000
TTTTGGGTCG CTGGAAAAGA CATGAGAGGA AAGGATGAGG GAAATTGA 6048
Domain Profile
S: 1     fsednrigeGgfgevyrgelknt..avavkklkeeadlslkelkqsfltElkvlarfrHd  58
         f+ ++++g+Ggfg vyrg l+++  ava+k++   ++ s ++ ++++++E+kv++r+rH 
Q: 1682  FAAEEKLGQGGFGAVYRGYLREQglAVAIKRF---IKDSSNQGRREYKSEIKVISRLRHR  1738
         78899***************998446666666...566778899****************
S: 59    nilellgysaeseklcLvYqymknGsLedrLqcqkgsepLsWpqRlsillGtaraiefLH  118
         n+++l+g++  +++l LvY++++n sL  +L    + + L+Wp+R++i++G++ a+ +LH
Q: 1739  NLVQLIGWFHGRNELLLVYELVPNRSLDVHLYG--NGTFLTWPMRINIVIGLGSALLYLH  1796
         ***************************987765..6689*********************
S: 119   easp.slihgdiksaNiLLDekltpKlgDfglarfapesekqsstvlrtskvrgtlaYlp  177
         e+ +  ++h+dik++N++LDe+++ KlgDfglar+  +    ++ v +  + +gt  Y+ 
Q: 1797  EEWEqCVVHRDIKPSNVMLDESFNTKLGDFGLARLIDH----ADGVQTMTHPSGTPGYID  1852
         *988799***************************4443....34444445789*******
S: 178   eefirvgqltvkvDvySfGiVllEvltGlra.vdedrktkyLkdllkeeieeekveilek  236
         +e + +g+ + ++DvySfG+VllEv+  +r     d +++ L  l++  ++  +   +++
Q: 1853  PECVITGKASAESDVYSFGVVLLEVVCARRPmSLLDDQNNGLFRLVEWVWDLYGQGAIHN  1912
         **************************9999844567888899999999999999999999
S: 237   fldkkagkl.eeellealielalaclaekakkrPtmsq  273
           dk+ ++  +   +e +i ++l c++ ++ +rP++  
Q: 1913  AADKRLNNDyDVVEMERVIAVGLWCAHPDRCQRPSIRA  1950
         9****98731555567899***************9976
Domain Sequence
(FASTA)
FAAEEKLGQG GFGAVYRGYL REQGLAVAIK RFIKDSSNQG RREYKSEIKV ISRLRHRNLV 60
QLIGWFHGRN ELLLVYELVP NRSLDVHLYG NGTFLTWPMR INIVIGLGSA LLYLHEEWEQ 120
CVVHRDIKPS NVMLDESFNT KLGDFGLARL IDHADGVQTM THPSGTPGYI DPECVITGKA 180
SAESDVYSFG VVLLEVVCAR RPMSLLDDQN NGLFRLVEWV WDLYGQGAIH NAADKRLNND 240
YDVVEMERVI AVGLWCAHPD RCQRPSIRA 269
KeywordComplete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Gene Ontology
GO:0003723; F:RNA binding
GO:0003964; F:RNA-directed DNA polymerase activity
GO:0015074; P:DNA integration
GO:0006278; P:RNA-dependent DNA replication
KEGG
InterPros
IPR023780; Chromo_domain.
IPR016197; Chromodomain-like.
IPR001584; Integrase_cat-core.
IPR021109; Peptidase_aspartic.
IPR005162; Retrotrans_gag.
IPR012337; RNaseH-like_dom.
IPR013242; RVP_2.
IPR000477; RVT.
Pfam
PF00385; Chromo; 1.
PF03732; Retrotrans_gag; 1.
PF00665; rve; 1.
PF08284; RVP_2; 1.
PF00078; RVT_1; 1.
SMARTs
Prosites
PS50994; INTEGRASE; 1.
PS50878; RT_POL; 1.
Prints
Created Date20-Feb-2013