EKS-NOL-00116
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-NOL-00116
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
TK/Met303.89.9E-9111481316169
StatusUnreviewed
Ensembl ProteinENSNLEP00000008444
UniProt AccessionG1R5H2;
Protein Name
Protein Synonyms/Alias
Gene NameMST1R
Gene Synonyms/Alias MST1R;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSNLEG00000006926ENSNLEP00000008444ENSNLET00000008843
OrganismNomascus leucogenys
Functional Description
Protein Length1375
Protein Sequence
(FASTA)
MELLPPLPQS FLLLLLLPAK PAAGEDWQCP HTPYAASRDF DVKYVVPSFS AGGLVQAMVT 60
YEGDRNESAV FVAIRNRLHV LGPDLKSVQS LATGPAGDPG CQTCAACGPG PHGPPGDTDT 120
KVLVLEPALP ALISCGSSLQ GRCFLHDLEP QGTAVHLAGP ACLFSAHRNR PDDCPDCVAS 180
PLGTRVTVVE QGQASYFYVA SSLDAAVAAS FSPRSVSIRR LKADASGFAP GFVALSVLPK 240
HLVSYSIEYV HSFHTGAFVY FLTVQPASVT DAPSALHTRL ARLSATEPEL GDYRELVLDC 300
RFAPKRRRRG APEGGQPYPV LRVAHSAPVG AQLATELSIA EGQEVLFGVF VAGKDGGPGM 360
GPNSVVCAFP IDLLDTLIDE GVERCCESPV HPGLRRGLDF FQSPSFCPNP PGLEALSPNT 420
SCRHFPLLVS SSFSRVDLFN GLLGPVQVTA LYVTRLDNVT VAHMGTMDGR ILQVELARSL 480
NYLLYVSNFS LGDSGQPVQR DVSRLGDHLL FASGDQVFQV PIRGPGCHHF LTCGRCLRAQ 540
RFMGCGWCGN MCGQQKECPG SWQQDHCPPK LTEFHPHSGP LRGSTRLTLC GSNFYLHPSG 600
LVPEGTHQVT VGQSPCRPLP KDSSKLRSVP RKDFVEEFEC ELEPLGTQAV GPTNVSLTVT 660
NMPPGKHFRV DGTSVLRGFS FMEPVLIAVQ PLFGPRAGGT CLTLEGQSLS VGTSRAVLVN 720
GTECLLARVS EGQLLCATPP GATVASVPLS LQVGGAQVPG SWTFHYREDP VVLSISPKCG 780
YINSHITICG QHLTSAWHLV LSFHDGLRAV ESRCERQLPE QQLCRLPEYV VRDPQGWVAG 840
NLSARGDGAA GFTLPGFRFL PPPHPPSANL VPLKPEEHAI KFEYIGLGAV ADCVGVNVTV 900
GGESCQHEFR GDTVVCPLPP SLQLGQDGAP LQVCVDGECH ILGRVVQPGP DGVPQSTLLG 960
ILLPLLLLVA ALATALVFSY WWQRKQLVLP PNLNDLASLD QTAGATPLPI LYSGSDYRSG 1020
LALPATDGLD STTCVHGASF SDSEDESCVP LLRNESIQLR DLDSALLAEV KDVLIPHEWV 1080
VTHSDRVIGK GWGPGGAGAE MESQHRLGWA WWLTPVIPAL WEAEAGRSRG QEIETVLAHS 1140
ETPSLLKIQK KPANPTVKDL ISFGLQVARG MEYLAEQKFV HRDLAARNCM LDESFTVKVA 1200
DFGLARDILD KEYYSVRQHR HARLPVKWMA LESLQTYRFT TKSDVWSFGV LLWELLTRGA 1260
PPYPHIDPFD LTHFLAQGRR LPQPEYCPDS LYQVMQQCWE VDPAVRPTFG VLVGEVEQIV 1320
SALLGDHYVQ LPATYMNLGP STSHEMNVRP EQPQSSPMPG SARRPRPLSE PPRPT 1375
Nucleotide Sequence
(FASTA)
ATGGAGCTCC TCCCGCCGCT GCCTCAGTCC TTCCTGTTGC TGCTGCTGTT GCCTGCCAAG 60
CCCGCGGCGG GCGAGGACTG GCAGTGCCCG CACACCCCCT ACGCGGCCTC TCGCGACTTT 120
GACGTGAAGT ACGTGGTGCC CAGCTTCTCC GCCGGAGGCC TGGTACAGGC CATGGTGACC 180
TACGAGGGCG ACAGAAATGA GAGTGCTGTG TTTGTAGCCA TACGCAATCG CCTGCACGTG 240
CTTGGGCCTG ACCTGAAGTC TGTCCAGAGC CTGGCCACGG GCCCTGCTGG GGACCCTGGC 300
TGCCAGACGT GTGCAGCCTG TGGCCCAGGC CCCCATGGCC CTCCCGGTGA CACAGACACA 360
AAGGTGCTGG TGCTGGAGCC CGCGCTGCCT GCGCTGATCA GTTGTGGCTC CAGCCTGCAG 420
GGCCGCTGCT TCCTGCATGA CCTAGAGCCC CAAGGGACAG CCGTGCATCT GGCGGGGCCA 480
GCCTGCCTCT TCTCAGCCCA CCGTAACCGG CCCGATGACT GCCCCGACTG TGTGGCCAGC 540
CCACTGGGCA CCCGTGTGAC TGTGGTTGAG CAAGGCCAGG CCTCCTATTT CTACGTGGCA 600
TCCTCACTGG ACGCAGCCGT GGCTGCCAGC TTCAGCCCAC GCTCAGTGTC TATCAGGCGT 660
CTCAAGGCCG ACGCCTCGGG ATTCGCACCG GGTTTTGTGG CATTGTCAGT GCTGCCCAAG 720
CATCTTGTCT CCTACAGTAT TGAATACGTG CACAGCTTCC ACACGGGAGC CTTTGTCTAT 780
TTCCTGACTG TACAGCCGGC CAGCGTGACA GATGCTCCTA GTGCCCTGCA CACACGCCTG 840
GCACGGCTTA GCGCCACTGA GCCAGAGTTG GGTGACTATC GGGAGCTGGT CCTCGACTGC 900
AGATTTGCTC CAAAACGCAG GCGCCGGGGG GCCCCAGAGG GCGGACAGCC CTACCCTGTG 960
CTGCGGGTGG CCCACTCCGC TCCAGTGGGT GCCCAACTTG CCACTGAGCT GAGCATCGCT 1020
GAGGGCCAGG AAGTGCTATT TGGGGTCTTT GTGGCTGGCA AGGATGGTGG TCCTGGCATG 1080
GGCCCCAACT CTGTCGTCTG TGCCTTCCCC ATTGACCTGC TGGACACACT AATTGATGAG 1140
GGTGTGGAGC GCTGTTGTGA ATCCCCAGTC CATCCAGGCC TCCGGCGAGG CCTGGACTTC 1200
TTCCAGTCGC CCAGTTTTTG CCCCAACCCG CCTGGCCTGG AGGCCCTCAG CCCCAACACC 1260
AGCTGCCGCC ACTTCCCTCT GCTGGTCAGT AGCAGCTTCT CACGTGTGGA CCTATTCAAT 1320
GGGCTGTTGG GACCAGTACA GGTCACTGCA CTGTATGTGA CACGCCTTGA CAACGTCACA 1380
GTGGCGCACA TGGGCACTAT GGATGGGCGT ATCCTGCAGG TGGAGCTGGC CAGGTCACTC 1440
AACTACTTGC TGTATGTGTC CAACTTCTCA CTGGGTGACA GCGGGCAGCC CGTGCAGCGG 1500
GATGTCAGTC GTCTTGGGGA CCACCTACTC TTCGCCTCTG GGGACCAGGT TTTCCAGGTA 1560
CCTATCCGAG GCCCTGGCTG CCACCACTTC CTCACCTGTG GGCGTTGCCT AAGGGCACAG 1620
CGTTTCATGG GCTGTGGCTG GTGTGGGAAC ATGTGTGGAC AGCAGAAGGA GTGTCCTGGC 1680
TCCTGGCAAC AGGACCACTG CCCACCTAAG CTTACTGAGT TCCACCCCCA CAGTGGACCT 1740
CTAAGGGGCA GTACAAGGCT GACCCTGTGT GGCTCCAACT TCTACCTGCA CCCTTCTGGT 1800
CTGGTGCCTG AGGGAACCCA TCAGGTCACT GTGGGCCAAA GTCCCTGCCG GCCACTGCCC 1860
AAGGACAGCT CAAAACTCAG ATCAGTGCCC CGAAAAGACT TTGTAGAGGA GTTTGAGTGT 1920
GAACTGGAGC CCTTGGGCAC CCAGGCAGTG GGGCCTACCA ACGTCAGCCT CACCGTGACT 1980
AACATGCCAC CGGGCAAGCA CTTCCGGGTA GACGGCACCT CCGTGCTGAG AGGCTTCTCT 2040
TTCATGGAGC CAGTGCTGAT AGCAGTGCAA CCCCTCTTTG GCCCACGGGC AGGAGGCACC 2100
TGTCTCACTC TTGAAGGCCA GAGTCTGTCT GTAGGCACCA GCCGGGCTGT GCTGGTCAAT 2160
GGGACTGAGT GTCTGCTAGC ACGGGTCAGT GAGGGGCAGC TTTTATGTGC CACACCCCCT 2220
GGGGCCACGG TGGCCAGTGT CCCCCTTAGC CTGCAGGTGG GGGGTGCCCA GGTACCTGGT 2280
TCCTGGACCT TCCACTACAG AGAAGACCCT GTCGTGCTAA GCATCAGCCC CAAATGTGGC 2340
TACATCAACT CCCACATCAC CATCTGTGGC CAGCATCTAA CTTCAGCATG GCACTTAGTG 2400
CTGTCATTCC ATGATGGGCT TAGGGCAGTG GAGAGCAGGT GTGAGAGGCA GCTTCCAGAG 2460
CAGCAGCTGT GCCGCCTTCC TGAATATGTG GTCCGAGACC CCCAGGGATG GGTGGCAGGG 2520
AATCTGAGTG CCCGGGGGGA TGGAGCTGCT GGCTTTACAC TGCCTGGCTT TCGCTTCCTA 2580
CCCCCACCCC ATCCACCCAG TGCCAACCTA GTTCCACTGA AGCCTGAGGA GCATGCCATT 2640
AAGTTTGAGT ATATTGGGCT GGGCGCTGTG GCTGACTGCG TGGGTGTCAA CGTGACCGTG 2700
GGTGGTGAGA GCTGCCAGCA CGAGTTCCGG GGGGACACGG TTGTCTGCCC CCTGCCCCCA 2760
TCCCTGCAGC TTGGCCAAGA TGGTGCCCCA TTGCAGGTCT GCGTGGATGG TGAATGTCAT 2820
ATCCTGGGTA GAGTAGTGCA GCCAGGGCCA GATGGGGTCC CACAGAGCAC GCTCCTTGGT 2880
ATCCTGCTGC CTTTGCTGCT ACTAGTGGCC GCACTGGCCA CTGCACTGGT CTTCAGCTAC 2940
TGGTGGCAGA GGAAGCAGCT AGTTCTTCCT CCCAACCTGA ATGACCTGGC ATCCCTGGAC 3000
CAGACTGCTG GAGCCACACC CCTGCCTATT CTGTACTCAG GCTCTGACTA CAGAAGTGGC 3060
CTTGCACTCC CTGCCACTGA TGGTCTGGAT TCCACCACTT GTGTCCATGG AGCATCCTTC 3120
TCCGATAGTG AAGATGAATC CTGTGTCCCA CTGCTGCGGA ACGAGTCCAT CCAGCTAAGG 3180
GACCTGGACT CTGCGCTCTT GGCCGAGGTC AAGGATGTGC TGATTCCCCA TGAGTGGGTG 3240
GTCACCCACA GTGACCGAGT CATTGGCAAA GGTTGGGGGC CAGGTGGGGC TGGGGCAGAG 3300
ATGGAGTCTC AACATAGGCT AGGCTGGGCG TGGTGGCTCA CGCCTGTAAT CCCAGCACTT 3360
TGGGAGGCCG AGGCGGGCAG ATCACGAGGT CAGGAGATCG AGACCGTCCT GGCTCACAGT 3420
GAAACCCCGT CTTTACTAAA AATACAAAAA AAACCAGCCA ACCCCACCGT GAAGGACCTC 3480
ATCAGCTTTG GCCTGCAGGT AGCCCGCGGC ATGGAGTACC TGGCAGAGCA GAAGTTTGTG 3540
CACAGGGACC TGGCTGCACG GAACTGCATG CTGGATGAGT CATTCACAGT CAAGGTGGCT 3600
GACTTTGGTT TGGCCCGCGA CATCCTGGAC AAGGAGTACT ATAGTGTTCG ACAGCATCGC 3660
CACGCTCGCC TGCCTGTCAA GTGGATGGCG CTGGAGAGCC TGCAGACCTA TAGATTTACC 3720
ACCAAGTCTG ATGTGTGGTC ATTTGGTGTG CTGCTGTGGG AACTGCTGAC ACGGGGTGCC 3780
CCACCATACC CCCACATCGA CCCTTTTGAC CTCACCCACT TCCTGGCCCA GGGTCGGCGC 3840
CTGCCCCAGC CTGAGTATTG CCCTGATTCT CTGTACCAAG TGATGCAGCA ATGCTGGGAG 3900
GTGGACCCAG CAGTGAGACC CACCTTCGGA GTACTAGTGG GGGAAGTGGA GCAGATAGTG 3960
TCTGCACTGC TTGGGGACCA TTATGTGCAG CTGCCAGCAA CCTACATGAA CTTGGGCCCC 4020
AGCACCTCGC ATGAGATGAA TGTGCGTCCA GAACAGCCTC AGTCCTCACC CATGCCCGGT 4080
AGTGCACGTC GGCCCCGGCC ACTCTCAGAG CCTCCTCGGC CCACTTGA 4128
Domain Profile
S: 92    irneernltvkdllefalqvakgmeylaskkfvhrdlaarnclldekltvkvadfGlard  151
         i+++  n+tvkdl++f+lqva+gmeyla++kfvhrdlaarnc+lde++tvkvadfGlard
Q: 1148  IQKKPANPTVKDLISFGLQVARGMEYLAEQKFVHRDLAARNCMLDESFTVKVADFGLARD  1207
         6777889*****************************************************
S: 152   vldkelyvveeerdarlpvkwlaleslqtfkqfttksdvwsfGvllwelltrgatpysev  211
         +ldke+y+v+++r+arlpvkw+aleslqt++ fttksdvwsfGvllwelltrga+py+++
Q: 1208  ILDKEYYSVRQHRHARLPVKWMALESLQTYR-FTTKSDVWSFGVLLWELLTRGAPPYPHI  1266
         *******************************.****************************
S: 212   asfdltlylkegrrlrkpeycpdklydvmlkcwkakpaerpqfsdlvsii  261
         ++fdlt++l++grrl++peycpd+ly+vm++cw+ +pa rp+f  lv  +
Q: 1267  DPFDLTHFLAQGRRLPQPEYCPDSLYQVMQQCWEVDPAVRPTFGVLVGEV  1316
         *******************************************9998765
Domain Sequence
(FASTA)
IQKKPANPTV KDLISFGLQV ARGMEYLAEQ KFVHRDLAAR NCMLDESFTV KVADFGLARD 60
ILDKEYYSVR QHRHARLPVK WMALESLQTY RFTTKSDVWS FGVLLWELLT RGAPPYPHID 120
PFDLTHFLAQ GRRLPQPEYC PDSLYQVMQQ CWEVDPAVRP TFGVLVGEV 169
KeywordComplete proteome; Disulfide bond; Membrane; Reference proteome; Transmembrane; Transmembrane helix.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Bos taurus"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Cavia porcellus"; ?>Danio rerio"; ?>Echinops telfairi"; ?>Equus caballus"; ?>Felis catus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Pan troglodytes"; ?>Pteropus vampyrus"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Sorex araneus"; ?>Sus scrofa"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Xiphophorus maculatus"; ?>
EKS-AIM-00116
EKS-BOT-00124
EKS-CAJ-00126
EKS-CAF-00127
EKS-CAP-00121
EKS-DAR-00380
EKS-ECT-00004
EKS-EQC-00120
EKS-FEC-00115
EKS-GOG-00123
EKS-HOS-00126
EKS-ICT-00116
EKS-LOA-00119
EKS-MAM-00120
EKS-MOD-00121
EKS-MUM-00125
EKS-MUP-00125
EKS-MYL-00134
EKS-ORN-00151
EKS-ORA-00102
EKS-ORC-00116
EKS-ORL-00140
EKS-PAT-00112
EKS-PTV-00106
EKS-RAN-00121
EKS-SAH-00116
EKS-SOA-00004
EKS-SUS-00110
EKS-TAR-00155
EKS-TEN-00144
EKS-XET-00103
EKS-XIM-00148
Gene Ontology
GO:0016021; C:integral to membrane
GO:0005524; F:ATP binding
GO:0004713; F:protein tyrosine kinase activity
GO:0004872; F:receptor activity
GO:0007275; P:multicellular organismal development
GO:0043406; P:positive regulation of MAP kinase activity
GO:0051897; P:positive regulation of protein kinase B signaling cascade
GO:0009615; P:response to virus
KEGG
InterPros
IPR013783; Ig-like_fold.
IPR014756; Ig_E-set.
IPR002909; IPT_TIG_rcpt.
IPR011009; Kinase-like_dom.
IPR003659; Plexin-like.
IPR016201; Plexin-like_fold.
IPR002165; Plexin_repeat.
IPR000719; Prot_kinase_cat_dom.
IPR001627; Semaphorin/CD100_Ag.
IPR001245; Ser-Thr/Tyr_kinase_cat_dom.
IPR008266; Tyr_kinase_AS.
IPR015943; WD40/YVTN_repeat-like_dom.
Pfam
PF07714; Pkinase_Tyr; 1.
PF01437; PSI; 1.
PF01403; Sema; 1.
PF01833; TIG; 2.
SMARTs
SM00429; IPT; 3.
SM00423; PSI; 1.
SM00630; Sema; 1.
Prosites
PS50011; PROTEIN_KINASE_DOM; 1.
PS00109; PROTEIN_KINASE_TYR; 1.
PS51004; SEMA; 1.
Prints
PR00109; TYRKINASE.
Created Date20-Feb-2013