EKS-GOG-00123
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-GOG-00123
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TK/Met465.46.8E-14010841342259
StatusUnreviewed
Ensembl ProteinENSGGOP00000011976
UniProt AccessionG3R996;
Protein Name
Protein Synonyms/Alias
Gene NameMST1R
Gene Synonyms/Alias MST1R;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSGGOG00000012260ENSGGOP00000011976ENSGGOT00000012320
OrganismGorilla gorilla
Functional Description
Protein Length1401
Protein Sequence
(FASTA)
MELLPPLPQS FLLLLLLPAK LAAGEDWQCP RTPYAASRDF DMKYVVPSFS AGGLVQAMVT 60
YEGDRNESAV FVAIRNRLHV LGPDLKSVQS LATGPAGDPG CQTCAACGPG PHGPPGDTDT 120
KVLVLDPALP ALVSCGSSLQ GRCFLHDLEP QGTAVHLAAP ACLFSAHHNR PDDCPDCVAS 180
PLGTRVTVVE QGQASYFYVA SSLDAAVAAS FSPRSVSIRR LKADASGFAP GFVALSVLPK 240
HLVSYSIEYV HSFHTGAFVY FLTVQPASVT DDPSALHTRL ARLSATEPEL GDYRELVLNC 300
RFAPKRRRRG APEGGQSYPV LRVAHSAPVG AQLATELSIA EGQEVLFGVF VTGKDGGPGV 360
GPNSVVCAFP IDLLDTLIDE GVERCCESPV HPGLRRGLDF FQLPSFCPNP PGLEALSPNT 420
SCRHFPLLVS SSFSRVDLFN GLLGPVQVTA LYVTRLDNVT VAHMGTMDGR ILQVELARSL 480
NYLLYVSNFS LGDSGQPVQR DVSRLGDHLL FASGDQVFQV PIRGPGCRHF LTCGRCLSAW 540
RFMGCGWCGN MCGQQKECPG SWQQDHCPPK LTEFHPHSGP LRGSTRLTLC GSNFYLHPSG 600
LVPEGTHQVT VGQSPCRPLP KDSSKLRPVP RKDFVEEFEC ELEPLGTQAV GPTNVSLTVT 660
NMPPGKHFRV DGTSVLRGFS FMEPVLIAVQ PLFGPRAGGT CLTLEGQSLS VGTSRAVLVN 720
GTECLLARVS EGQLLCATPP GATVASVPLS LQVGGAQVPG SWTFQYREDP VVLSISPNCG 780
YINSHITICG QHLTSAWHLV LSFHDGLRAV ESRQCERQLP EQQLCRLPEY VVRDPQGWVA 840
GNLSARGDGA AGFTLPGFRF LPPPHPPSAN LVPLKPEEHA IKFEYIGLGA VADCVGINVT 900
VGGESCQHEF RGDMVVCPLP PSLQLGQDGA PLQVCVDGEC HILGRVVRPG PDGVPQSTLL 960
GILLPLLLLV AALATALVFS YWWRRKQLVL PLNLNDLASL DQTAGAIPLP ILYSGSDYRS 1020
GLALPAIDGL DSTTCVHGAS FSDSKDESCV PLLRKESIQL RDLDSALLVE VKDVLIPHER 1080
VVTHSDRVIG KGHFGVVYHG EYIDQAQNRI QCAIKSLSRI TEMQQVEAFL REGLLMRGLN 1140
HPNVLALIGI MLPPEGLPHV LLPYMCHGDL LQFIRSPQRN PTVKDLISFG LQVARGMEYL 1200
AEQKFVHRDL AARNCMLDES FTVKVADFGL ARDILDKEYY SVRQHRHARL PVKWMALESL 1260
QTYRFTTKSD VWSFGVLLWE LLTRGAPPYR HIDPFDLTHF LAQGRRLPQP EYCPDSLYQV 1320
MQQCWEADPA VRPTFGVLVG EVEQIVSALL GDHYVQLPAT YMNLGPSTSH EMNVRPEQPQ 1380
SSPMPGNVRR PRPLSEPPRP T 1401
Nucleotide Sequence
(FASTA)
ATGGAGCTCC TCCCGCCGCT GCCTCAGTCC TTCCTGTTGC TGCTGCTGTT GCCTGCCAAG 60
CTCGCGGCGG GCGAGGACTG GCAGTGCCCG CGCACCCCCT ACGCGGCCTC TCGCGACTTT 120
GACATGAAGT ACGTGGTGCC CAGCTTCTCC GCCGGAGGCC TGGTGCAGGC CATGGTGACC 180
TACGAGGGCG ACAGAAATGA GAGTGCTGTG TTTGTAGCCA TACGCAATCG CCTGCATGTG 240
CTTGGGCCTG ACCTGAAGTC TGTCCAGAGC CTGGCCACGG GCCCTGCTGG AGACCCTGGC 300
TGCCAGACGT GTGCAGCCTG TGGCCCAGGA CCCCACGGCC CTCCCGGTGA CACAGACACA 360
AAGGTGCTGG TGCTGGATCC CGCGCTGCCT GCGCTGGTCA GTTGTGGCTC CAGCCTGCAG 420
GGCCGCTGCT TCCTGCATGA CCTAGAGCCC CAAGGGACAG CCGTGCATCT GGCAGCGCCA 480
GCCTGCCTCT TCTCAGCCCA CCATAACCGG CCCGATGACT GCCCCGACTG TGTGGCCAGC 540
CCATTGGGCA CCCGTGTAAC TGTGGTTGAG CAAGGCCAGG CCTCCTATTT CTACGTGGCA 600
TCCTCACTGG ACGCAGCCGT GGCTGCCAGC TTCAGCCCAC GCTCAGTGTC TATCAGGCGT 660
CTCAAGGCCG ACGCCTCGGG ATTCGCACCG GGCTTTGTGG CGTTGTCAGT GCTGCCCAAG 720
CATCTTGTCT CCTACAGTAT TGAATACGTG CACAGCTTCC ACACGGGAGC CTTCGTATAC 780
TTCCTGACTG TACAGCCGGC CAGCGTGACA GATGATCCTA GTGCCCTGCA CACACGCCTG 840
GCACGGCTTA GCGCCACTGA GCCAGAGTTG GGTGACTATC GGGAGCTGGT CCTCAACTGC 900
AGATTTGCTC CAAAACGCAG GCGCCGGGGG GCCCCAGAAG GCGGACAGTC CTACCCTGTG 960
CTGCGGGTGG CCCACTCCGC TCCAGTGGGT GCCCAACTTG CCACTGAGCT GAGCATCGCT 1020
GAGGGCCAGG AAGTGCTATT TGGGGTCTTT GTGACTGGCA AGGATGGTGG TCCTGGCGTG 1080
GGCCCCAACT CTGTCGTCTG TGCCTTCCCC ATTGACCTGC TGGACACACT AATTGATGAG 1140
GGTGTGGAGC GCTGTTGTGA ATCCCCAGTC CATCCAGGCC TCCGGCGAGG CCTCGACTTC 1200
TTCCAGTTGC CCAGTTTTTG CCCCAACCCG CCTGGCCTGG AAGCCCTCAG CCCCAACACC 1260
AGCTGCCGCC ACTTCCCTCT GCTGGTCAGT AGCAGCTTCT CACGTGTGGA CCTATTCAAT 1320
GGGCTGTTGG GACCAGTACA GGTCACTGCA TTGTATGTGA CACGCCTTGA CAACGTCACA 1380
GTGGCGCACA TGGGCACAAT GGATGGGCGT ATCCTGCAGG TGGAGCTGGC CAGGTCACTA 1440
AACTACTTGC TGTATGTGTC CAACTTCTCA CTGGGTGACA GTGGGCAGCC CGTGCAGCGG 1500
GATGTCAGTC GTCTTGGGGA CCACCTACTC TTCGCCTCTG GGGACCAGGT TTTCCAGGTA 1560
CCTATCCGAG GCCCTGGCTG CCGCCACTTC CTGACCTGTG GGCGTTGCCT AAGTGCATGG 1620
CGTTTCATGG GCTGTGGCTG GTGTGGGAAC ATGTGCGGCC AGCAGAAGGA GTGTCCTGGC 1680
TCCTGGCAAC AGGACCACTG CCCACCTAAG CTTACTGAGT TCCACCCCCA CAGTGGACCT 1740
CTAAGGGGCA GTACAAGGCT GACCCTGTGT GGCTCCAACT TCTACCTGCA CCCTTCTGGT 1800
CTGGTGCCTG AGGGAACCCA TCAGGTCACT GTGGGCCAAA GTCCCTGCCG GCCACTGCCC 1860
AAGGACAGCT CAAAACTCAG ACCAGTGCCC CGGAAAGACT TTGTAGAGGA GTTTGAGTGT 1920
GAACTGGAGC CCTTGGGCAC CCAGGCAGTG GGGCCTACCA ACGTCAGCCT CACCGTGACT 1980
AACATGCCAC CGGGCAAGCA CTTCCGGGTA GACGGCACCT CCGTGCTGAG AGGCTTCTCT 2040
TTTATGGAGC CAGTGCTGAT AGCAGTGCAA CCCCTCTTTG GCCCACGGGC AGGAGGCACC 2100
TGTCTCACTC TTGAAGGCCA GAGTCTGTCT GTAGGCACCA GCCGGGCTGT GCTGGTCAAT 2160
GGGACTGAGT GTCTGCTAGC ACGGGTCAGT GAGGGGCAGC TTTTATGTGC CACACCCCCT 2220
GGGGCCACGG TGGCCAGTGT CCCCCTTAGC CTGCAGGTGG GGGGTGCCCA GGTACCTGGT 2280
TCCTGGACCT TCCAGTACAG AGAAGACCCT GTCGTGCTAA GCATCAGCCC CAACTGTGGC 2340
TACATCAACT CCCACATCAC CATCTGTGGC CAGCATCTAA CTTCAGCATG GCACTTAGTG 2400
CTGTCATTCC ATGACGGGCT TAGGGCAGTG GAAAGCAGGC AGTGTGAGAG GCAGCTTCCA 2460
GAGCAGCAGC TGTGCCGCCT TCCTGAATAT GTGGTCCGAG ACCCCCAGGG ATGGGTGGCA 2520
GGGAATCTGA GTGCCCGGGG GGATGGAGCT GCTGGCTTTA CACTGCCTGG CTTTCGCTTC 2580
CTACCCCCAC CCCATCCACC CAGTGCCAAC CTAGTTCCAC TGAAGCCTGA GGAGCATGCC 2640
ATTAAGTTTG AGTATATTGG GCTGGGCGCT GTGGCTGACT GTGTGGGTAT CAACGTGACC 2700
GTGGGTGGTG AGAGCTGCCA GCACGAGTTC CGGGGGGACA TGGTTGTCTG CCCCCTGCCC 2760
CCATCCCTGC AGCTTGGCCA GGATGGTGCC CCATTGCAGG TCTGCGTAGA TGGTGAATGT 2820
CATATCCTGG GTAGAGTGGT GCGGCCAGGG CCAGACGGGG TCCCACAGAG CACGCTCCTT 2880
GGTATCCTGC TGCCTTTGCT GCTGCTTGTG GCTGCACTGG CGACTGCACT GGTCTTCAGC 2940
TACTGGTGGC GGAGGAAGCA GCTAGTTCTT CCTCTCAACC TGAATGACCT GGCATCCCTG 3000
GACCAGACTG CTGGAGCCAT ACCCCTGCCT ATTCTGTACT CGGGCTCTGA CTACAGAAGT 3060
GGCCTTGCAC TCCCTGCCAT TGATGGTCTG GATTCCACCA CTTGTGTCCA TGGAGCATCC 3120
TTCTCCGATA GTAAAGATGA ATCCTGTGTG CCACTGCTGC GGAAAGAGTC CATCCAGCTA 3180
AGGGACCTGG ACTCTGCGCT CTTGGTTGAG GTCAAGGATG TGCTGATTCC CCATGAGCGG 3240
GTGGTCACCC ACAGTGACCG AGTCATTGGC AAAGGCCACT TTGGAGTTGT CTACCACGGA 3300
GAATACATAG ACCAGGCCCA GAATCGAATC CAATGTGCCA TCAAGTCACT AAGTCGCATC 3360
ACAGAGATGC AGCAGGTGGA GGCCTTCCTG CGAGAGGGGC TGCTCATGCG TGGCCTGAAC 3420
CACCCGAATG TGCTGGCTCT CATTGGTATC ATGTTGCCAC CTGAGGGCCT GCCCCATGTG 3480
CTGCTGCCCT ATATGTGCCA CGGTGACCTG CTCCAGTTCA TCCGCTCACC TCAGCGGAAC 3540
CCCACCGTGA AGGACCTCAT CAGCTTTGGC CTGCAGGTAG CCCGCGGCAT GGAGTACCTG 3600
GCAGAGCAGA AGTTTGTGCA CAGGGACCTG GCTGCGCGGA ACTGCATGCT GGACGAGTCA 3660
TTCACAGTCA AGGTGGCTGA CTTTGGTTTG GCCCGCGACA TCCTGGACAA GGAGTACTAT 3720
AGTGTTCGAC AGCATCGCCA CGCTCGCCTA CCTGTGAAGT GGATGGCGCT GGAGAGCCTG 3780
CAGACCTATA GATTTACCAC CAAGTCTGAT GTGTGGTCAT TTGGTGTGCT GCTGTGGGAA 3840
CTGCTGACAC GGGGTGCCCC ACCATACCGC CACATTGACC CTTTTGACCT TACCCACTTC 3900
CTGGCCCAGG GTCGGCGCCT GCCCCAGCCT GAGTATTGCC CTGATTCTCT GTACCAAGTG 3960
ATGCAGCAAT GCTGGGAGGC AGACCCAGCA GTGCGACCCA CCTTCGGAGT ACTAGTGGGG 4020
GAGGTGGAGC AGATAGTGTC TGCACTGCTT GGGGACCATT ATGTGCAGCT GCCAGCAACC 4080
TACATGAACT TGGGCCCCAG CACCTCTCAT GAGATGAATG TGCGTCCAGA ACAGCCACAG 4140
TCCTCACCCA TGCCAGGGAA TGTACGCCGG CCCCGGCCAC TCTCAGAGCC TCCTCGGCCC 4200
ACTTGA 4206
Domain Profile
S: 2     dkeeviGkGhfGvvykgelsdsaskkikvavksleritdiekveeflreglvmkeldhpn  61
         ++++viGkGhfGvvy+ge++d+a+++i++a+ksl+rit++++ve+flregl+m++l+hpn
Q: 1084  HSDRVIGKGHFGVVYHGEYIDQAQNRIQCAIKSLSRITEMQQVEAFLREGLLMRGLNHPN  1143
         7899********************************************************
S: 62    vlsllGialddeglplvvlpymkkGdlksfirneernltvkdllefalqvakgmeylask  121
         vl+l+Gi+l++eglp+v+lpym++Gdl +fir+++rn+tvkdl++f+lqva+gmeyla++
Q: 1144  VLALIGIMLPPEGLPHVLLPYMCHGDLLQFIRSPQRNPTVKDLISFGLQVARGMEYLAEQ  1203
         ************************************************************
S: 122   kfvhrdlaarnclldekltvkvadfGlardvldkelyvveeerdarlpvkwlaleslqtf  181
         kfvhrdlaarnc+lde++tvkvadfGlard+ldke+y+v+++r+arlpvkw+aleslqt+
Q: 1204  KFVHRDLAARNCMLDESFTVKVADFGLARDILDKEYYSVRQHRHARLPVKWMALESLQTY  1263
         ************************************************************
S: 182   kqfttksdvwsfGvllwelltrgatpysevasfdltlylkegrrlrkpeycpdklydvml  241
         + fttksdvwsfGvllwelltrga+py+++++fdlt++l++grrl++peycpd+ly+vm+
Q: 1264  R-FTTKSDVWSFGVLLWELLTRGAPPYRHIDPFDLTHFLAQGRRLPQPEYCPDSLYQVMQ  1322
         *.**********************************************************
S: 242   kcwkakpaerpqfsdlvsii  261
         +cw+a+pa rp+f  lv  +
Q: 1323  QCWEADPAVRPTFGVLVGEV  1342
         *************9998765
Domain Sequence
(FASTA)
HSDRVIGKGH FGVVYHGEYI DQAQNRIQCA IKSLSRITEM QQVEAFLREG LLMRGLNHPN 60
VLALIGIMLP PEGLPHVLLP YMCHGDLLQF IRSPQRNPTV KDLISFGLQV ARGMEYLAEQ 120
KFVHRDLAAR NCMLDESFTV KVADFGLARD ILDKEYYSVR QHRHARLPVK WMALESLQTY 180
RFTTKSDVWS FGVLLWELLT RGAPPYRHID PFDLTHFLAQ GRRLPQPEYC PDSLYQVMQQ 240
CWEADPAVRP TFGVLVGEV 259
KeywordATP-binding; Complete proteome; Disulfide bond; Kinase; Membrane; Nucleotide-binding; Phosphoprotein; Receptor; Reference proteome; Transferase; Transmembrane; Transmembrane helix; Tyrosine-protein kinase.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Bos taurus"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Cavia porcellus"; ?>Danio rerio"; ?>Echinops telfairi"; ?>Equus caballus"; ?>Felis catus"; ?>Gasterosteus aculeatus"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Pan troglodytes"; ?>Pteropus vampyrus"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Sorex araneus"; ?>Sus scrofa"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Tursiops truncatus"; ?>Xiphophorus maculatus"; ?>
EKS-AIM-00116
EKS-BOT-00124
EKS-CAJ-00126
EKS-CAF-00127
EKS-CAP-00121
EKS-DAR-00380
EKS-ECT-00004
EKS-EQC-00120
EKS-FEC-00115
EKS-GAA-00144
EKS-HOS-00126
EKS-ICT-00116
EKS-LOA-00119
EKS-MAM-00120
EKS-MOD-00121
EKS-MUM-00125
EKS-MUP-00125
EKS-MYL-00134
EKS-NOL-00116
EKS-ORN-00151
EKS-ORA-00102
EKS-ORC-00116
EKS-ORL-00140
EKS-PAT-00112
EKS-PTV-00106
EKS-RAN-00121
EKS-SAH-00116
EKS-SOA-00004
EKS-SUS-00110
EKS-TAR-00155
EKS-TEN-00144
EKS-TUT-00176
EKS-XIM-00148
Gene Ontology
GO:0016021; C:integral to membrane
GO:0005524; F:ATP binding
GO:0004714; F:transmembrane receptor protein tyrosine kinase activity
GO:0007275; P:multicellular organismal development
GO:0007169; P:transmembrane receptor protein tyrosine kinase signaling pathway
KEGG
InterPros
IPR013783; Ig-like_fold.
IPR014756; Ig_E-set.
IPR002909; IPT_TIG_rcpt.
IPR011009; Kinase-like_dom.
IPR003659; Plexin-like.
IPR016201; Plexin-like_fold.
IPR002165; Plexin_repeat.
IPR000719; Prot_kinase_cat_dom.
IPR017441; Protein_kinase_ATP_BS.
IPR001627; Semaphorin/CD100_Ag.
IPR001245; Ser-Thr/Tyr_kinase_cat_dom.
IPR008266; Tyr_kinase_AS.
IPR020635; Tyr_kinase_cat_dom.
IPR016244; Tyr_kinase_HGF/MSP_rcpt.
IPR015943; WD40/YVTN_repeat-like_dom.
Pfam
PF07714; Pkinase_Tyr; 1.
PF01437; PSI; 1.
PF01403; Sema; 1.
PF01833; TIG; 2.
SMARTs
SM00429; IPT; 3.
SM00423; PSI; 1.
SM00630; Sema; 1.
SM00219; TyrKc; 1.
Prosites
PS00107; PROTEIN_KINASE_ATP; 1.
PS50011; PROTEIN_KINASE_DOM; 1.
PS00109; PROTEIN_KINASE_TYR; 1.
PS51004; SEMA; 1.
Prints
PR00109; TYRKINASE.
Created Date20-Feb-2013