EPS-SEM-00217
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEPS-SEM-00217
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Asp-Based PTP/FCP33.76.0E-831138474
StatusUnreviewed
Ensembl ProteinEFJ28332
UniProt AccessionD8RHS1;
Protein Name
Protein Synonyms/Alias Putative uncharacterized protein;
Gene NameSELMODRAFT_441087
Gene Synonyms/Alias SELMODRAFT_441087;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
SELMODRAFT_441087EFJ28332EFJ28332
OrganismSelaginella moellendorffii
Functional Description
Protein Length812
Protein Sequence
(FASTA)
MNKFNAAQRN LQRDRRHKKA QEKRLRLGLQ SLKLSRQSAS RTPSGKRQRK LEKKLRRAQK 60
EVLESGLVSV QDIEMICADS APDAPAASSK GFSLKLKKQR KLKLKKNKPG REKSATKEIM 120
DILPQYIGTS SSIPGDETPK ENEDKSTASP SGIFSLIKLS GSGIVFLSTP DDTDVVSVLS 180
RVLDDIRAPH TKPFSWCNRL TPVQATCASE EEAIFVTVLG LLRDDAGGIV SKTGRSNLKY 240
AVGFNSRAAE DGNTGVERDG CIAAVARAMQ EFEQQASVNL SNPEIVVAVE LIPLVGKTTP 300
LAGISLLPKE VVVKDLTRLK RNTKRMVWID DQPVYPQQPE LGIQVPMYMP QKSVDKIEWK 360
ECENDRTLVE LIPFLTALAK LDDFSMGIAA YKQGKFDGEK FNLDESDSSM ILGDDDAMED 420
VFEDDDKAVE EKEERLKAEE ERVKAQQERE KLEAEDKLLT PDSSGPLWRY IVNFIPEDKL 480
SRSKKLAAHE YISKKFKSDD YKCLIALLAR KKHLIDDLNC LGNLSVNSRA GYYDALVQVF 540
HGLEGPWDLE FILPVLRKVT CNYFYYGKGM QVTCFNSENL RDLCETTIPM LDTPQEIRFD 600
EIIPMVASLN FLRDENLLGI TVPDWFQPSS GSCSSWMWWP HFVTIDFERS LVKGELAQKL 660
ATTDTKGLLG KLSSTTCRIV PGKERDLCYK QDEGLQAGMD FFATPFEESK DKFLVCDKNL 720
VLFRLQFVDG QVVNCGLWLM TDFNSEQNAR EMAAKLCVDL GLKVNLELDY KHNVEKMKTS 780
MEEITLLSSW FFGKEHDAIV NLQAPVTVRF CQ 812
Nucleotide Sequence
(FASTA)
ATGAACAAGT TCAATGCCGC GCAGCGCAAT CTCCAGCGCG ATAGGCGCCA CAAGAAGGCT 60
CAGGAGAAGC GATTGAGGCT GGGCCTGCAG TCGCTCAAGC TGTCCAGGCA ATCGGCGTCG 120
CGCACCCCAT CGGGCAAGCG GCAGCGCAAG TTAGAGAAGA AGCTGCGCCG GGCGCAGAAG 180
GAAGTTTTGG AGAGCGGCCT CGTGTCGGTG CAAGACATTG AGATGATCTG TGCAGATTCG 240
GCTCCTGACG CGCCAGCGGC CTCCTCCAAG GGATTTTCTC TCAAGCTGAA GAAGCAAAGG 300
AAGCTCAAAT TGAAGAAGAA CAAGCCTGGA CGCGAGAAGA GTGCTACCAA GGAGATAATG 360
GACATTCTTC CTCAGTACAT TGGAACTTCC AGCAGCATTC CTGGAGACGA GACCCCGAAG 420
GAAAACGAGG ACAAGAGCAC TGCTTCTCCG AGCGGCATTT TCTCGCTTAT CAAGCTCTCC 480
GGTAGCGGCA TTGTCTTCCT TTCGACTCCA GATGATACAG ATGTTGTGAG CGTTCTATCA 540
CGAGTTCTAG ACGACATAAG AGCTCCACAC ACGAAGCCTT TCTCGTGGTG TAACAGGCTC 600
ACTCCAGTGC AGGCGACGTG CGCATCAGAG GAGGAAGCCA TCTTCGTCAC AGTTCTGGGG 660
CTCTTGAGAG ACGACGCCGG AGGAATCGTC TCAAAGACCG GAAGAAGCAA TCTCAAGTAC 720
GCGGTGGGAT TCAACAGTAG AGCTGCCGAA GATGGCAACA CTGGCGTGGA GCGAGACGGG 780
TGCATCGCAG CAGTGGCAAG AGCCATGCAA GAATTCGAGC AGCAGGCCTC TGTCAACTTG 840
TCGAATCCAG AGATCGTGGT GGCCGTGGAG CTGATTCCTC TGGTTGGAAA GACTACTCCT 900
CTGGCTGGAA TTTCTTTGCT GCCCAAGGAA GTTGTCGTAA AGGACTTGAC CCGCTTGAAA 960
AGAAACACCA AGAGGATGGT GTGGATAGAT GATCAACCTG TGTACCCTCA ACAGCCGGAA 1020
CTTGGCATTC AGGTGCCCAT GTATATGCCG CAAAAGTCAG TGGATAAGAT TGAGTGGAAG 1080
GAGTGCGAAA ACGATCGCAC GCTTGTGGAA TTGATCCCGT TCCTGACTGC GCTCGCCAAG 1140
CTAGATGATT TCTCGATGGG AATTGCGGCG TACAAGCAAG GGAAGTTTGA TGGCGAGAAA 1200
TTTAACTTGG ACGAGTCCGA CAGTTCGATG ATTTTGGGGG ATGATGATGC CATGGAGGAT 1260
GTTTTTGAAG ATGATGATAA AGCAGTAGAA GAGAAAGAAG AGAGGCTGAA AGCCGAGGAA 1320
GAGAGGGTGA AAGCCCAGCA GGAGCGCGAG AAGCTCGAGG CGGAGGACAA GCTCTTGACA 1380
CCCGACTCAT CAGGACCACT TTGGCGGTAC ATAGTCAACT TTATCCCTGA GGACAAGCTG 1440
AGTCGATCAA AGAAGCTGGC AGCCCACGAG TACATTAGCA AGAAATTCAA GAGCGACGAC 1500
TATAAGTGTC TGATTGCACT GTTAGCCAGG AAAAAGCATC TCATAGATGA CTTGAATTGC 1560
CTCGGGAATC TTTCTGTCAA CTCCAGGGCC GGGTACTATG ATGCCCTAGT GCAAGTGTTT 1620
CACGGCCTCG AAGGTCCATG GGACCTTGAA TTCATCCTGC CAGTTCTGAG AAAGGTGACG 1680
TGCAATTACT TCTACTACGG CAAGGGCATG CAGGTTACAT GCTTCAACAG CGAGAACCTT 1740
CGGGACTTAT GCGAGACCAC CATACCTATG CTTGACACAC CGCAAGAAAT CCGCTTTGAC 1800
GAAATAATTC CCATGGTTGC AAGTCTGAAT TTCCTGAGGG ATGAGAACCT CTTGGGCATT 1860
ACAGTACCTG ACTGGTTTCA GCCATCCTCC GGCTCATGTA GCAGCTGGAT GTGGTGGCCT 1920
CACTTCGTGA CCATTGACTT TGAGAGGAGC TTGGTGAAAG GAGAACTTGC CCAGAAACTT 1980
GCTACGACCG ACACAAAAGG CCTCTTGGGG AAGCTATCGT CCACCACTTG CCGGATAGTT 2040
CCGGGTAAGG AGAGGGATTT GTGCTACAAG CAAGACGAGG GGTTACAAGC AGGGATGGAT 2100
TTCTTCGCAA CACCATTTGA GGAATCAAAG GACAAGTTTC TGGTGTGTGA CAAAAATCTG 2160
GTTCTGTTTA GGCTACAGTT TGTGGATGGG CAAGTTGTAA ACTGCGGGCT CTGGCTGATG 2220
ACGGACTTCA ATTCGGAGCA AAATGCTCGA GAGATGGCAG CTAAATTGTG TGTTGATCTA 2280
GGCTTGAAGG TTAACCTGGA GTTAGATTAT AAACACAACG TGGAAAAGAT GAAAACGAGC 2340
ATGGAAGAGA TCACATTGTT ATCTTCCTGG TTCTTCGGCA AAGAACATGA TGCCATTGTA 2400
AATCTTCAAG CTCCTGTTAC TGTGAGGTTT TGCCAGTGA 2439
Domain Profile
S: 88    kyvKdLsllgrdlskvvivDdsprsfelqpdnlipiepfygds..........dkdtell  137
          +vKdL+ l+r+ +++v +Dd+p  + +qp+ +i++++++ ++          ++d++l 
Q: 311   VVVKDLTRLKRNTKRMVWIDDQPV-YPQQPELGIQVPMYMPQKsvdkiewkecENDRTLV  369
         347********************8.*************9999999999999999999***
S: 138   kllpflealeksddv  152
         +l+pfl+al+k dd+
Q: 370   ELIPFLTALAKLDDF  384
         **********88887
Domain Sequence
(FASTA)
VVVKDLTRLK RNTKRMVWID DQPVYPQQPE LGIQVPMYMP QKSVDKIEWK ECENDRTLVE 60
LIPFLTALAK LDDF 74
KeywordComplete proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Gene Ontology
KEGG
smo:SELMODRAFT_441087;
InterPros
IPR023214; HAD-like_dom.
IPR004274; NIF.
Pfam
PF03031; NIF; 1.
SMARTs
Prosites
Prints
Created Date20-Feb-2013