EKS-ORN-00152
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-ORN-00152
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TK/Met435.49.4E-13110581315258
StatusUnreviewed
Ensembl ProteinENSONIP00000005149
UniProt AccessionI3J8F9;
Protein Name
Protein Synonyms/Alias
Gene NameMST1R (2 of 2)
Gene Synonyms/Alias MST1R (2 of 2);
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSONIG00000004085ENSONIP00000005149ENSONIT00000005153
OrganismOreochromis niloticus
Functional Description
Protein Length1364
Protein Sequence
(FASTA)
MVSLTALLTV WIWMQSQTAS GQHTCPSTPQ RSIDFTVKYS LPHFQTPKPI QNIAVNPDNE 60
QVYIGYQNAV VGVNSSMNKI WELKTGPIGS PDCETCLACD VEVKPEDPVD TDNEILLLDP 120
ALNIVPYLYI CGSTQYGICY FTNITLEVPR PQCLYTKERN SPTYCPDCVA SPFGTKATIA 180
EDGQISYFFV AASVNDRITQ TYPRRSLSVL RPLSTEDGFD MVTNVTVLPG LRDSYSINYI 240
YSFSAGDYVY FLFLQRENPS KSDSAFQTRL GRLSISNPEM WMYRELILEC RYEPKRRRRR 300
RGVFKDTVYN GLQAAYFGQI GKDLADELRV AEGENILYGV FAEVNKHGQP QKNSALCAFP 360
LAKINQAIEA GAEACCKSST EQLSRGLCHF QPCENCPHEN SDNRCTDKPT LVSKPHYRVD 420
LFNSEMRNVL FTSVLVNTIG THTVGHFGTS DGRILQMILS LYNRIIFANY SLGETAVSRN 480
AAEYSEDSLL FVVGNKMFRV PSTGPGCAHF KTCPMCLNAP SFMNCGWCSG ICSRQEECSS 540
QWNKKSCGPI ITEFFPKMAP AGGVTEVTLC GSEFQSSVRL AIISGKSHIV TVGSGTVCDV 600
LPEKSNSNRL VCKIKEETPN QGLNITVEVH EGEVKGIYSV EGTAQISGLS FVTPSITEIK 660
PDYGPMFGGT AVTLTGRYLN TGVTRNVQIG DKNCTIQNVS EGNGNLSSIV CHTEAAEAVG 720
ELPVIVIIDS LQVTDAKMFS YKNNPVINSV LPNCSFQRGS ELVIEGQNLD VANKIVVEYT 780
WKSRSSERFQ QVCNGTRNAT HMECWAPAFP ERIPDTIFKT GALSIHIDGK NDVWKRNFSY 840
YPNAQVIPFE NDDSILLLKP GETEVSLHHK KLDAVQTCMK ITMTIGDVVC KAQVLQNELT 900
CRVPKGLVVP SDGLPVRVFV NHEVYDVGTV LFDDSFNQNA IVGIVLGIIA ALVVGAALAL 960
IVMIHLRKKK RATLENRLSR MMSHNLRPSN NNPSSPTDDY RQGSAGLAFQ GLLYSASSDH 1020
LSVPLMSRDN ISMVSLNSNL LEEVKDVLIP PEMLSIEDSQ IIGKGHFGTV YHGYLLDSNK 1080
KEIHCAIKSL NRITDFGEVD QFLREGIIMK GFNHPNILSL LGIMLPKEGL PLVVLPYMKH 1140
GDLRHFIRSE NRNPTVKDLI GFGLQVARGM EYLAQKKFVH RDLAARNCML DENFTVKVAD 1200
FGMARDVYDK EYYSIQDQKR VKLPVKWMAI ESLQTQKFTT KSDVWSYGIL LWELLTRGAS 1260
PYPAVDPYDI TQYLLKGRRL LQPQYCPDTL YVIMLACWDP EPECRPTFHT LATEVQRILS 1320
NLKGEHYISL KVTYVNLDQP RPYSSLVGNE ATTSDLDTDT NAAS 1364
Nucleotide Sequence
(FASTA)
ATGGTCAGTT TGACTGCCTT GTTGACAGTA TGGATATGGA TGCAATCACA AACTGCTTCG 60
GGGCAGCACA CATGTCCTTC TACACCTCAG AGGTCCATAG ATTTCACTGT AAAATATTCC 120
CTCCCCCACT TCCAAACCCC AAAACCAATA CAGAACATTG CAGTGAACCC GGATAATGAA 180
CAAGTTTACA TTGGATATCA GAATGCGGTA GTGGGAGTCA ACAGTTCTAT GAATAAAATA 240
TGGGAGTTAA AAACTGGACC TATAGGCAGT CCTGATTGTG AAACCTGTCT GGCATGCGAT 300
GTAGAAGTTA AACCTGAGGA CCCAGTGGAT ACAGACAACG AGATTCTGCT TTTGGATCCT 360
GCTCTAAATA TAGTTCCATA CTTGTACATT TGTGGAAGTA CTCAGTATGG GATCTGTTAC 420
TTTACTAACA TTACCCTTGA AGTTCCTAGA CCTCAATGTT TATACACAAA AGAAAGGAAC 480
TCTCCAACCT ACTGTCCTGA CTGTGTGGCC AGCCCATTTG GTACCAAGGC CACAATCGCT 540
GAAGATGGGC AGATATCATA CTTCTTTGTT GCAGCCTCTG TCAATGACAG AATAACGCAG 600
ACGTACCCAA GGAGGTCATT ATCAGTGCTG AGGCCACTTT CAACTGAAGA TGGCTTTGAC 660
ATGGTCACAA ACGTGACAGT GCTTCCTGGA CTACGTGACT CTTACAGCAT TAACTACATC 720
TACAGTTTCT CTGCCGGGGA TTATGTCTAC TTTCTTTTCT TGCAGAGGGA AAATCCCTCC 780
AAGAGCGATT CAGCTTTTCA GACTCGTCTG GGACGACTGT CTATTTCTAA TCCAGAGATG 840
TGGATGTACA GAGAGCTTAT TTTGGAGTGC CGGTACGAGC CAAAGCGCAG GAGAAGACGG 900
AGAGGGGTTT TCAAGGACAC TGTTTATAAT GGATTACAGG CAGCATATTT CGGACAAATC 960
GGGAAGGATT TGGCTGATGA GCTGAGGGTA GCTGAGGGAG AAAACATATT GTATGGGGTG 1020
TTTGCGGAGG TAAACAAGCA TGGCCAGCCT CAGAAGAACT CAGCCCTCTG TGCCTTTCCT 1080
TTAGCTAAAA TAAACCAAGC AATCGAGGCT GGTGCGGAGG CCTGCTGTAA GTCAAGTACA 1140
GAGCAGCTAT CCAGAGGTCT CTGCCACTTC CAGCCATGTG AGAACTGCCC ACATGAAAAC 1200
TCTGATAACA GATGCACTGA TAAGCCCACT CTGGTGTCAA AGCCACACTA CAGAGTCGAT 1260
CTCTTCAACA GCGAGATGAG AAATGTACTG TTTACTTCAG TTCTGGTGAA CACGATTGGG 1320
ACTCATACGG TGGGCCATTT TGGCACGTCG GATGGCAGGA TACTCCAGAT GATTCTGAGT 1380
CTCTATAACC GTATTATTTT TGCCAACTAT TCACTGGGAG AGACTGCAGT GTCGAGGAAT 1440
GCAGCTGAGT ACTCTGAGGA TTCACTTCTC TTTGTGGTTG GAAATAAGAT GTTCAGGGTG 1500
CCCTCTACAG GACCAGGGTG TGCACATTTT AAGACGTGCC CCATGTGTTT GAACGCTCCG 1560
TCTTTCATGA ACTGTGGCTG GTGTTCGGGA ATCTGCTCAA GGCAGGAGGA GTGTAGCTCA 1620
CAGTGGAACA AAAAGTCTTG CGGACCTATC ATAACAGAGT TTTTTCCTAA AATGGCCCCC 1680
GCTGGTGGTG TGACTGAAGT GACGCTGTGC GGTTCAGAAT TTCAGTCTAG TGTGCGACTG 1740
GCCATCATCA GCGGCAAATC CCACATCGTT ACTGTGGGCT CAGGGACCGT GTGTGATGTC 1800
CTGCCTGAAA AGAGCAATAG TAACAGACTA GTGTGCAAAA TCAAGGAAGA AACACCAAAC 1860
CAGGGCCTCA ACATCACTGT GGAAGTGCAT GAGGGAGAAG TGAAGGGCAT TTATTCAGTT 1920
GAAGGCACAG CTCAGATTTC TGGCTTGTCA TTTGTGACAC CCAGCATAAC AGAAATCAAG 1980
CCTGATTACG GGCCCATGTT TGGAGGAACA GCGGTTACAT TAACAGGCAG ATATCTTAAC 2040
ACAGGAGTAA CAAGAAATGT CCAAATTGGA GATAAAAACT GCACCATTCA AAATGTTTCT 2100
GAAGGAAACG GGAATTTGTC TTCAATCGTC TGCCACACAG AAGCCGCTGA AGCTGTTGGG 2160
GAACTACCTG TGATAGTCAT TATTGACAGC CTTCAAGTGA CTGACGCTAA GATGTTTTCC 2220
TACAAGAATA ATCCTGTTAT AAACTCTGTG TTGCCGAATT GCAGTTTCCA AAGAGGCTCC 2280
GAGCTGGTGA TAGAAGGTCA GAATCTTGAT GTCGCTAACA AAATTGTGGT TGAGTACACT 2340
TGGAAATCTC GCTCCAGTGA GCGTTTTCAA CAGGTTTGCA ATGGCACAAG AAACGCCACA 2400
CACATGGAGT GCTGGGCTCC TGCTTTTCCA GAACGAATTC CAGATACCAT TTTTAAAACT 2460
GGAGCGCTTT CTATTCACAT AGATGGGAAA AATGACGTCT GGAAAAGAAA CTTTTCCTAC 2520
TACCCTAATG CCCAAGTCAT TCCCTTTGAA AATGATGACA GTATATTACT TTTAAAACCA 2580
GGAGAGACCG AGGTTTCACT GCATCATAAA AAATTGGATG CAGTACAAAC ATGTATGAAG 2640
ATCACAATGA CCATTGGTGA TGTGGTTTGC AAAGCACAGG TTTTGCAAAA TGAGCTGACC 2700
TGCAGGGTTC CTAAAGGGCT GGTTGTTCCC AGCGACGGGC TGCCTGTTAG GGTGTTTGTG 2760
AACCATGAAG TTTATGATGT GGGTACAGTG CTCTTTGATG ACAGCTTCAA CCAAAATGCG 2820
ATTGTAGGCA TTGTCCTGGG TATCATTGCC GCACTGGTAG TAGGAGCTGC CCTTGCATTA 2880
ATTGTGATGA TTCATTTAAG GAAAAAGAAG AGAGCCACCC TAGAGAATCG TTTATCAAGG 2940
ATGATGTCAC ATAATCTACG CCCGAGCAAC AATAATCCTT CTTCTCCAAC AGATGACTAC 3000
AGACAAGGTT CAGCAGGACT GGCTTTCCAG GGATTATTGT ACAGTGCCAG CTCTGATCAT 3060
CTGTCTGTTC CTTTGATGTC ACGAGACAAT ATCTCAATGG TTAGCCTGAA TTCCAATCTT 3120
CTTGAAGAGG TCAAAGATGT GCTAATCCCA CCTGAGATGC TCTCAATTGA GGATAGTCAG 3180
ATTATTGGCA AAGGTCACTT TGGGACAGTT TATCATGGAT ACCTGTTAGA CAGCAACAAG 3240
AAAGAAATCC ACTGTGCTAT TAAATCACTG AACAGGATAA CAGATTTCGG GGAAGTAGAC 3300
CAGTTCCTCA GAGAGGGGAT CATTATGAAA GGCTTTAACC ACCCTAACAT ACTGTCACTG 3360
CTGGGTATCA TGCTGCCCAA AGAAGGGCTC CCCCTGGTGG TTCTGCCGTA TATGAAGCAT 3420
GGAGATCTGC GCCATTTCAT CCGTTCTGAG AACAGGAATC CAACAGTGAA AGACTTGATT 3480
GGGTTTGGTC TTCAGGTTGC CAGGGGAATG GAGTACTTAG CCCAAAAGAA ATTTGTTCAC 3540
AGAGACCTGG CTGCGCGTAA CTGCATGCTG GATGAAAACT TCACAGTAAA GGTGGCTGAC 3600
TTTGGTATGG CAAGGGACGT CTACGACAAG GAGTACTACA GCATTCAAGA TCAGAAAAGG 3660
GTGAAGCTCC CAGTAAAGTG GATGGCCATC GAAAGCCTGC AAACACAGAA GTTCACAACC 3720
AAGTCTGATG TTTGGTCATA TGGCATCTTA CTGTGGGAGC TGTTAACCAG AGGTGCTAGC 3780
CCGTATCCAG CAGTGGACCC CTATGACATC ACGCAGTACT TGTTGAAGGG ACGTCGGCTT 3840
CTGCAGCCAC AGTATTGCCC AGACACCCTC TATGTAATCA TGCTGGCGTG TTGGGACCCG 3900
GAGCCCGAGT GTAGACCTAC CTTCCATACC TTGGCTACCG AAGTACAGCG CATCCTGTCC 3960
AATCTGAAAG GAGAGCACTA CATCAGTTTG AAGGTTACCT ACGTCAACTT AGACCAGCCA 4020
AGGCCTTACT CTTCCCTTGT TGGAAATGAA GCCACGACCT CAGACTTGGA CACAGACACT 4080
AATGCTGCCA GCTGA 4095
Domain Profile
S: 3     keeviGkGhfGvvykgelsdsaskkikvavksleritdiekveeflreglvmkeldhpnv  62
          +++iGkGhfG+vy+g l+ds++k+i++a+ksl+ritd ++v++flreg++mk+++hpn+
Q: 1058  DSQIIGKGHFGTVYHGYLLDSNKKEIHCAIKSLNRITDFGEVDQFLREGIIMKGFNHPNI  1117
         5799********************************************************
S: 63    lsllGialddeglplvvlpymkkGdlksfirneernltvkdllefalqvakgmeylaskk  122
         lsllGi+l++eglplvvlpymk+Gdl++fir+e+rn+tvkdl++f+lqva+gmeyla+kk
Q: 1118  LSLLGIMLPKEGLPLVVLPYMKHGDLRHFIRSENRNPTVKDLIGFGLQVARGMEYLAQKK  1177
         ************************************************************
S: 123   fvhrdlaarnclldekltvkvadfGlardvldkelyvveeerdarlpvkwlaleslqtfk  182
         fvhrdlaarnc+lde++tvkvadfG+ardv+dke+y+ ++++  +lpvkw+a+eslqt+k
Q: 1178  FVHRDLAARNCMLDENFTVKVADFGMARDVYDKEYYSIQDQKRVKLPVKWMAIESLQTQK  1237
         ************************************************************
S: 183   qfttksdvwsfGvllwelltrgatpysevasfdltlylkegrrlrkpeycpdklydvmlk  242
          fttksdvws+G+llwelltrga+py+ v+++d+t+yl +grrl +p+ycpd+ly +ml+
Q: 1238  -FTTKSDVWSYGILLWELLTRGASPYPAVDPYDITQYLLKGRRLLQPQYCPDTLYVIMLA  1296
         .***********************************************************
S: 243   cwkakpaerpqfsdlvsii  261
         cw ++p+ rp+f  l + +
Q: 1297  CWDPEPECRPTFHTLATEV  1315
         ************9998876
Domain Sequence
(FASTA)
DSQIIGKGHF GTVYHGYLLD SNKKEIHCAI KSLNRITDFG EVDQFLREGI IMKGFNHPNI 60
LSLLGIMLPK EGLPLVVLPY MKHGDLRHFI RSENRNPTVK DLIGFGLQVA RGMEYLAQKK 120
FVHRDLAARN CMLDENFTVK VADFGMARDV YDKEYYSIQD QKRVKLPVKW MAIESLQTQK 180
FTTKSDVWSY GILLWELLTR GASPYPAVDP YDITQYLLKG RRLLQPQYCP DTLYVIMLAC 240
WDPEPECRPT FHTLATEV 258
KeywordATP-binding; Complete proteome; Disulfide bond; Kinase; Membrane; Nucleotide-binding; Phosphoprotein; Receptor; Reference proteome; Transferase; Transmembrane; Transmembrane helix; Tyrosine-protein kinase.
Sequence SourceEnsembl
Orthology
Ortholog group
Gasterosteus aculeatus"; ?>Mustela putorius furo"; ?>Rattus norvegicus"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Xiphophorus maculatus"; ?>
EKS-GAA-00143
EKS-MUP-00125
EKS-RAN-00121
EKS-TAR-00156
EKS-TEN-00143
EKS-XET-00103
EKS-XIM-00147
Gene Ontology
GO:0016021; C:integral to membrane
GO:0005524; F:ATP binding
GO:0004713; F:protein tyrosine kinase activity
GO:0004872; F:receptor activity
GO:0007275; P:multicellular organismal development
KEGG
InterPros
IPR013783; Ig-like_fold.
IPR014756; Ig_E-set.
IPR002909; IPT_TIG_rcpt.
IPR011009; Kinase-like_dom.
IPR003659; Plexin-like.
IPR016201; Plexin-like_fold.
IPR000719; Prot_kinase_cat_dom.
IPR017441; Protein_kinase_ATP_BS.
IPR001627; Semaphorin/CD100_Ag.
IPR001245; Ser-Thr/Tyr_kinase_cat_dom.
IPR008266; Tyr_kinase_AS.
IPR020635; Tyr_kinase_cat_dom.
IPR015943; WD40/YVTN_repeat-like_dom.
Pfam
PF07714; Pkinase_Tyr; 1.
PF01403; Sema; 1.
PF01833; TIG; 2.
SMARTs
SM00429; IPT; 3.
SM00423; PSI; 1.
SM00630; Sema; 1.
SM00219; TyrKc; 1.
Prosites
PS00107; PROTEIN_KINASE_ATP; 1.
PS50011; PROTEIN_KINASE_DOM; 1.
PS00109; PROTEIN_KINASE_TYR; 1.
PS51004; SEMA; 1.
Prints
PR00109; TYRKINASE.
Created Date20-Feb-2013