EKS-MAM-00226
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-MAM-00226
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Other/Haspin314.41.0E-93487757271
StatusUnreviewed
Ensembl ProteinENSMMUP00000035415
UniProt AccessionF7DR84;
Protein Name
Protein Synonyms/Alias
Gene NameGSG2
Gene Synonyms/Alias GSG2;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSMMUG00000005853ENSMMUP00000035415ENSMMUT00000008201
OrganismMacaca mulatta
Functional Description
Protein Length800
Protein Sequence
(FASTA)
MAASPPGFGS RLFRTYGAAD GRRQRRPGRE AAQWFPPQDR RRFFNSSGSS DASIGDPSQS 60
DDPDDPDDPD DPDFPGSPVR RRRRRPGGRV PKDRPSLTVT PKRRKLRARP SLTVTPRRLG 120
LRTRPPQKCS TPCGPLRLPP FPSRDPGRLS PDLSACGQPR DGDEVGTSAS LFSSLASPGP 180
GSPTPRDRAI SIGTSASLVA ASAVPSGLYL PEVSLDQASL PCSQEEATGG VRLTRMVHQT 240
RASFRSAIFG LVNSGTPEDS EFGADGKNMR ESCSKRKLVG SGPECPGLSN TGKRRATGQV 300
SCQERALQEA VPREHQEASV SKGRIVPGGI DRLERTRPSR KSKHQEARET SLLHSHHFKK 360
GQKMRKDSFP TQDLTPLQNA SFWTKTRASF SFHKKKIVTD VSEVCSIYTT ATSLSGSLLS 420
ECSNRHVMNK TSGALSSWHS SSMYLLSPLN TLSISNKKAS DAEKVYGECN QKGPVPFSHC 480
LPTEKLQCCE KIGEGVFGEV FRTIADHAPV AIKIIAIEGP DLVNGSHQKT FEEILPEIII 540
SKELSLLSGE VCNRTEGFIG LNSVHCVQGS YPPLLLKAWD HYHSTKGSAN DRPDFFKDDQ 600
LFIVLEFEFG GTDLEQMKTK LSSLATAKSI LHQLTASLAV AEASLRFEHR DLHWGNVLLK 660
KTSLKELHYT LNGKSSTIPT RGLQVSIIDY TLSRLERDGI VVFCDVSMDE DLFTGDGDYQ 720
FDIYRLMKKE NNNCWGEYHP YSNVLWLHYL TDKILKQMTF KSKCNTPAMK QIRKQIREFH 780
RTMLNFSSAA DLLCQHSLFK 800
Nucleotide Sequence
(FASTA)
ATGGCGGCTT CGCCCCCGGG ATTTGGGAGC CGGCTTTTCC GCACTTATGG GGCTGCCGAC 60
GGCAGGAGAC AGCGGCGGCC GGGCCGGGAA GCCGCGCAGT GGTTCCCGCC GCAGGACCGG 120
AGGCGTTTCT TCAACAGCAG CGGCAGCAGC GACGCCAGCA TCGGCGACCC CTCGCAGTCC 180
GACGATCCTG ACGATCCCGA CGATCCGGAC GACCCCGACT TCCCCGGCAG CCCGGTGAGG 240
CGGCGGCGGA GGCGTCCCGG CGGCCGAGTC CCCAAGGACC GGCCCAGCCT GACCGTGACC 300
CCAAAGCGCC GGAAGCTGCG AGCTCGCCCG AGCCTCACCG TGACCCCAAG ACGCCTGGGG 360
CTGCGAACTC GGCCCCCGCA GAAGTGCAGC ACCCCCTGCG GCCCGCTCCG ACTCCCGCCC 420
TTCCCCAGCC GCGACCCCGG CCGCCTCAGC CCGGACCTCA GCGCGTGCGG CCAGCCCAGG 480
GACGGCGACG AGGTGGGCAC CAGTGCTTCC CTGTTCAGCT CTCTGGCCTC GCCCGGCCCC 540
GGGTCCCCAA CGCCAAGGGA CAGGGCCATC TCGATCGGCA CCTCCGCCTC TCTGGTTGCA 600
GCGTCAGCCG TCCCGAGCGG CCTCTACCTC CCGGAAGTCT CCCTGGACCA AGCGTCTCTC 660
CCCTGCTCCC AGGAGGAAGC GACAGGAGGA GTCAGGCTCA CCAGGATGGT CCACCAAACC 720
CGTGCCAGCT TCAGGTCAGC AATCTTTGGC CTTGTGAACT CAGGAACCCC TGAGGATTCC 780
GAGTTTGGGG CAGATGGGAA GAATATGAGG GAATCCTGCA GTAAAAGGAA ACTGGTGGGA 840
AGTGGACCGG AGTGTCCAGG TCTGTCAAAC ACAGGCAAGA GGAGGGCCAC GGGCCAAGTC 900
TCTTGTCAAG AGAGAGCGCT TCAAGAGGCC GTCCCGAGAG AGCATCAGGA AGCCAGCGTT 960
TCCAAGGGCC GCATTGTGCC AGGGGGAATC GACAGGCTGG AGAGAACTAG ACCAAGCCGG 1020
AAGAGCAAAC ATCAGGAGGC AAGGGAAACC TCTCTCCTCC ATTCCCACCA CTTTAAAAAG 1080
GGCCAAAAGA TGAGAAAAGA TTCGTTCCCC ACCCAGGACC TGACTCCTTT ACAGAATGCC 1140
AGCTTTTGGA CCAAAACCAG GGCTTCCTTC AGTTTCCACA AGAAGAAAAT TGTGACTGAT 1200
GTGTCAGAGG TCTGCAGCAT CTATACCACT GCCACTTCTC TCTCTGGATC CCTCCTATCA 1260
GAATGTTCAA ACCGGCATGT CATGAACAAA ACAAGTGGTG CTCTGTCCTC TTGGCACTCC 1320
TCCTCAATGT ATTTGCTAAG CCCCTTAAAC ACTCTAAGTA TTTCAAACAA AAAGGCATCT 1380
GATGCTGAAA AGGTTTATGG GGAATGCAAT CAGAAGGGTC CTGTCCCCTT TAGCCATTGC 1440
CTTCCCACAG AAAAACTGCA ATGCTGTGAG AAGATTGGGG AAGGGGTGTT TGGCGAAGTG 1500
TTTCGAACAA TTGCTGATCA CGCACCTGTA GCCATAAAAA TCATTGCTAT TGAAGGACCC 1560
GATTTAGTCA ATGGATCCCA TCAGAAAACC TTTGAGGAAA TCCTGCCAGA GATCATCATC 1620
TCCAAAGAGT TGAGCCTCTT ATCCGGTGAA GTGTGCAACC GCACTGAAGG CTTTATCGGG 1680
CTGAACTCAG TGCACTGTGT CCAGGGATCT TACCCTCCCT TGCTCCTCAA AGCCTGGGAT 1740
CACTATCATT CAACCAAAGG CTCCGCAAAT GACCGGCCTG ATTTTTTTAA AGACGACCAG 1800
CTCTTCATTG TGCTGGAATT TGAGTTTGGA GGGACTGACC TAGAGCAAAT GAAAACCAAG 1860
CTGTCTTCCT TGGCTACTGC AAAGAGCATT CTACACCAGC TCACAGCCTC CCTCGCAGTG 1920
GCGGAGGCAT CACTGCGGTT TGAGCACCGA GACTTACACT GGGGGAATGT GCTCTTAAAG 1980
AAAACCAGCC TCAAAGAACT CCACTACACC CTCAACGGGA AGAGCAGCAC CATCCCCACC 2040
CGTGGGCTGC AAGTCAGCAT CATTGACTAC ACCTTGTCCC GCTTGGAACG GGACGGGATT 2100
GTGGTTTTCT GCGACGTTTC CATGGACGAG GACCTGTTTA CCGGTGATGG CGACTACCAG 2160
TTTGACATCT ACAGGCTCAT GAAGAAGGAG AATAACAACT GCTGGGGAGA ATATCACCCT 2220
TATAGTAATG TGCTCTGGCT ACATTACCTG ACAGACAAGA TTCTGAAACA AATGACCTTC 2280
AAGAGTAAAT GTAACACTCC TGCCATGAAG CAAATCAGGA AGCAGATCCG GGAGTTCCAC 2340
AGGACAATGC TGAACTTCAG CTCTGCCGCT GACTTGCTCT GCCAGCACAG TCTATTTAAG 2400
TAA 2403
Domain Profile
S: 2     ekvkklGeGayGevfsvtwkgkevvlKiiplegsdv........keealkEliilkklsk  53
         + ++k+GeG++Gevf+++ ++ +v++Kii++eg+d+        +ee+l+E+ii+k+ls 
Q: 487   QCCEKIGEGVFGEVFRTIADHAPVAIKIIAIEGPDLvngshqktFEEILPEIIISKELSL  546
         679*********************************************************
S: 54    lske..nstpnFlellgakvvkgdvpkellkawdkydkekes..........dqlylvll  101
         ls e  n+t++F+ l+++++v+g++p++llkawd+y+++k+s          dql++vl+
Q: 547   LSGEvcNRTEGFIGLNSVHCVQGSYPPLLLKAWDHYHSTKGSandrpdffkdDQLFIVLE  606
         ************************************************************
S: 102   leeggtsldni..klksesqllsilqqlvlslaiaekelkfeHrDlhlgNvLikktkkke  159
         +e+ggt+l+++  kl+s ++++sil+ql++sla+ae++l+feHrDlh+gNvL+kkt+ ke
Q: 607   FEFGGTDLEQMktKLSSLATAKSILHQLTASLAVAEASLRFEHRDLHWGNVLLKKTSLKE  666
         ***********999**********************************************
S: 160   leYtldgkkiklkskgvlvtiIDftksrlvkkdkevvyedlevdeelfegkgdkqfdvyr  219
         l+Ytl+gk+++++++g++v+iID+t+sr  ++d+ vv++d+++de+lf+g+gd+qfd+yr
Q: 667   LHYTLNGKSSTIPTRGLQVSIIDYTLSR-LERDGIVVFCDVSMDEDLFTGDGDYQFDIYR  725
         ****************************.9******************************
S: 220   lmranvknewkefepktnllwltylsskllkk  251
         lm+++++n+w e++p++n+lwl+yl++k+lk+
Q: 726   LMKKENNNCWGEYHPYSNVLWLHYLTDKILKQ  757
         ******************************85
Domain Sequence
(FASTA)
QCCEKIGEGV FGEVFRTIAD HAPVAIKIIA IEGPDLVNGS HQKTFEEILP EIIISKELSL 60
LSGEVCNRTE GFIGLNSVHC VQGSYPPLLL KAWDHYHSTK GSANDRPDFF KDDQLFIVLE 120
FEFGGTDLEQ MKTKLSSLAT AKSILHQLTA SLAVAEASLR FEHRDLHWGN VLLKKTSLKE 180
LHYTLNGKSS TIPTRGLQVS IIDYTLSRLE RDGIVVFCDV SMDEDLFTGD GDYQFDIYRL 240
MKKENNNCWG EYHPYSNVLW LHYLTDKILK Q 271
KeywordATP-binding; Complete proteome; Nucleotide-binding; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Cavia porcellus"; ?>Ciona intestinalis"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Felis catus"; ?>Gadus morhua"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Loxodonta africana"; ?>Microcebus murinus"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Otolemur garnettii"; ?>Pan troglodytes"; ?>Petromyzon marinus"; ?>Pongo abelii"; ?>Procavia capensis"; ?>Tarsius syrichta"; ?>Tupaia belangeri"; ?>Xiphophorus maculatus"; ?>Saccharomyces cerevisiae"; ?>Schizosaccharomyces pombe"; ?>
EKS-BOT-00224
EKS-CAE-00189
EKS-CAJ-00231
EKS-CAF-00231
EKS-CAP-00250
EKS-CII-00132
EKS-DAR-00536
EKS-DRM-00114
EKS-FEC-00210
EKS-GAM-00128
EKS-GAA-00276
EKS-GOG-00221
EKS-HOS-00231
EKS-LOA-00221
EKS-MIM-00002
EKS-MUM-00229
EKS-MUP-00223
EKS-MYL-00225
EKS-NOL-00208
EKS-ORA-00196
EKS-ORC-00201
EKS-OTG-00230
EKS-PAT-00213
EKS-PEM-00115
EKS-POA-00217
EKS-PRC-00005
EKS-TAS-00003
EKS-TUB-00007
EKS-XIM-00283
EKS-SAC-00010
EKS-SCP-00006
Gene Ontology
GO:0005813; C:centrosome
GO:0031965; C:nuclear membrane
GO:0005819; C:spindle
GO:0005524; F:ATP binding
GO:0003677; F:DNA binding
GO:0072354; F:histone kinase activity (H3-T3 specific)
GO:0007243; P:intracellular protein kinase cascade
GO:0007064; P:mitotic sister chromatid cohesion
GO:0071459; P:protein localization to chromosome, centromeric region
GO:0090231; P:regulation of spindle checkpoint
KEGG
mcc:707337;
InterPros
IPR024604; DUF3635.
IPR011009; Kinase-like_dom.
IPR000719; Prot_kinase_cat_dom.
IPR017441; Protein_kinase_ATP_BS.
Pfam
PF12330; DUF3635; 1.
PF00069; Pkinase; 1.
SMARTs
Prosites
PS00107; PROTEIN_KINASE_ATP; 1.
PS50011; PROTEIN_KINASE_DOM; 1.
Prints
Created Date20-Feb-2013