EKS-CAJ-00507
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-CAJ-00507
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Atypical/TAF1N/AN/AN/AN/AN/A
StatusUnreviewed
Ensembl ProteinENSCJAP00000043164
UniProt AccessionF6U8D3;
Protein Name
Protein Synonyms/Alias
Gene NameTAF1
Gene Synonyms/Alias TAF1;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSCJAG00000013390ENSCJAP00000024653ENSCJAT00000026074
ENSCJAG00000013390ENSCJAP00000048791ENSCJAT00000053018
ENSCJAG00000013390ENSCJAP00000043164ENSCJAT00000062007
OrganismCallithrix jacchus
Functional Description
Protein Length1702
Protein Sequence
(FASTA)
DYDEDDYDAD CEDIDCKLMP PPPPPLGPMK KDKDQDAITG EIIHIISNFS IKGKFTFKTL 60
GNSIFTYAQE VVEEVQIIKV LRFLRLFGPG KNVPSVWRSA RRKRKKKHRE LIQEEQIQEV 120
ECSVESEVSQ KSLWNYDYAP PPPPEQCLSD DEITMMAPVE SKFSQSTGDI DKVTDTKPRV 180
AEWRYGPARL WYDMLGVPED GSGFDYGFKL KKTEHEPVIK SRMVEEFRKL EESNGTDLLA 240
DENFLMVTQL HWEDDIIWDG EDVKHKGTKP QRASLAGWLP SSMTRNAMAY NVQQGFAATL 300
DDDKPWYSIF PIDNEDLVYG RWEDNIIWDA QAMPRLLEPP VLTLDPNDEN LILEIPDEKE 360
EATSNSPSKE SKKESSLKKS RILLGKTGVI KEEPQQNMSQ PEVKDPWNLS NDEYYYPKQQ 420
GLRGTFGGNI IQHSIPAVEL RQPFFPTHMG PIKLRQFHRP PLKKYSFGAL SQPGPHSVQP 480
LLKHIKKKAK MREQERQASG GGEMFFMRTP QDLTGKDGDL ILAEYSEENG PLMMQVGMAT 540
KIKNYYKRKP GKDPGAPDCK YGETVYCHTS PFLGSLHPGQ LLQAFENNLF RAPIYLHKMP 600
ETDFLIIRTR QGYYIRELVD IFVVGQQCPL FEVPGPNSKR ANTHIRDFLQ VFIYRLFWKS 660
KDRPRRIRME DIKKAFPSHS ESSIRKRLKL CADFKRTGMD SNWWVLKSDF RLPTEEEIRA 720
MVSPEQCCAY YSMIAAEQRL KDAGYGEKSF FAPEEENEED FQMKIDDEVR TAPWNTTRAF 780
IAAMKGKCLL EVTGVADPTG CGEGFSYVKI PNKPTQQKDD KEPQPVKKTV TGTDADLRRL 840
SLKNAKQLLR KFGVPEEEIK KLSRWEVIDV VRTMSTEQAR SGEGPMSKFA RGSRFSVAEH 900
QERYKEECQR IFDLQNKVLS STEVLSTDTD SSSAEDSDFE EMGKNIENML QNKKTSSQLS 960
REREEQERKE LQRMLLAAGS ATSGNNHRDD DTASVTSLNS SATGRCLKIY RTFRDEEGKE 1020
YVRCETVRKP AVIDAYVRIR TTKDEEFIRK FALFDEQHRE EMRKERRRIQ EQLRRLKRNQ 1080
EKEKLKGPPE KKPKKMKERP DLKLKCGACG AIGHMRTNKF CPLYYQTNAP PSNPVAMTEE 1140
QEEELEKTVI HNDNEELIKV EGTKIVLGKQ LIESADEVRR KSLVLKFPKQ QLPPKKKRRV 1200
GTTVHCDYLN RPHKSIHRRR TDPMVTLSSI LESIINDMRD LPNTYPFHTP VNAKVVKDYY 1260
KIITRPMDLQ TLRENVRKRL YPSREEFREH LELIVKNSAT YNGPKHSLTQ ISQSMLDLCD 1320
EKLKEKEDKL ARLEKAINPL LDDDDQVAFS FILDNIVTQK MMAVPDSWPF HHPVNKKFVP 1380
DYYKVIVSPM DLETIRKNIS KHKYQSRESF LDDVNLILAN SVKYNGPESQ YTKTAQEIVN 1440
VCYQTLTEYD EHLTQLEKDI CTAKEAALEE AELESLDPMT PGPYTPQPPD LYDTNTSLSM 1500
SRDASVFQDE SNMSVLDIPT ATPEKQVTQE GEDGDGDLAD EEEGTVQQPQ ASVLYEDLLM 1560
SEGEDDEEDA GSDEEGDNPF SAIQLSESGS DSDVGSGGIR PKQPRMLQEN TRMGMENEES 1620
MMSYEGDGGE ASHGLEDSNI SYGSYEEPDP KSNTQDTSFS SIGGYEVCSG PSVLSQVHLS 1680
EDEEDSEDFH SIAGDSDLDS DE 1702
Nucleotide Sequence
(FASTA)
GATTATGATG AAGATGACTA TGATGCTGAT TGTGAAGACA TTGATTGCAA GTTGATGCCT 60
CCTCCACCTC CACCCCTGGG ACCAATGAAG AAGGATAAGG ACCAGGATGC TATTACTGGT 120
GAAATAATTC ATATAATTTC AAATTTTAGT ATAAAAGGGA AGTTTACTTT TAAAACACTT 180
GGCAACAGTA TTTTCACATA TGCTCAAGAA GTAGTAGAAG AAGTCCAAAT TATTAAGGTA 240
TTACGCTTCC TACGTCTTTT TGGACCAGGG AAGAATGTCC CATCTGTTTG GCGGAGTGCT 300
CGGAGAAAGA GGAAAAAGAA GCACCGAGAG CTGATACAGG AAGAACAGAT CCAGGAGGTG 360
GAATGCTCAG TAGAATCAGA AGTCAGCCAG AAGTCTTTGT GGAACTATGA CTATGCTCCA 420
CCACCGCCTC CAGAGCAGTG TCTCTCTGAC GATGAAATCA CGATGATGGC TCCTGTGGAG 480
TCTAAATTTT CCCAATCAAC TGGGGATATA GATAAAGTGA CAGATACTAA ACCAAGAGTG 540
GCTGAGTGGC GTTATGGGCC TGCCCGACTG TGGTATGATA TGCTGGGTGT CCCTGAAGAT 600
GGCAGTGGGT TTGACTATGG CTTCAAACTG AAAAAGACAG AACATGAACC TGTGATAAAA 660
TCTAGAATGG TAGAGGAATT TAGGAAACTT GAGGAAAGCA ATGGCACTGA TCTTCTGGCT 720
GATGAAAATT TCCTGATGGT GACACAGCTG CATTGGGAGG ATGATATTAT CTGGGATGGG 780
GAGGATGTCA AGCACAAAGG GACAAAACCT CAGCGTGCAA GCCTGGCAGG CTGGCTTCCT 840
TCTAGCATGA CTAGGAATGC TATGGCTTAC AATGTTCAGC AAGGTTTTGC AGCCACTCTG 900
GATGATGACA AACCTTGGTA CTCCATTTTT CCCATTGACA ATGAGGATCT GGTATATGGA 960
CGCTGGGAAG ACAATATCAT TTGGGATGCT CAGGCCATGC CCCGGCTGTT GGAACCTCCT 1020
GTTTTGACAC TTGATCCCAA TGATGAGAAC CTCATTTTGG AAATTCCTGA TGAGAAGGAA 1080
GAGGCCACCT CTAACTCCCC CTCCAAGGAG AGTAAGAAGG AGTCATCTCT GAAGAAGAGT 1140
CGAATTCTCT TAGGGAAAAC AGGAGTCATC AAGGAGGAAC CACAGCAGAA CATGTCTCAG 1200
CCAGAAGTGA AAGATCCGTG GAATCTCTCC AATGATGAGT ATTATTACCC CAAGCAACAG 1260
GGTCTTCGAG GCACCTTTGG AGGGAATATT ATCCAGCACT CAATTCCTGC TGTGGAATTA 1320
CGGCAGCCCT TCTTTCCCAC ACACATGGGG CCCATCAAAC TCCGGCAGTT CCATCGCCCA 1380
CCTCTGAAAA AGTACTCATT TGGGGCACTG TCTCAGCCAG GTCCCCACTC AGTCCAACCT 1440
TTGCTAAAGC ACATCAAAAA AAAGGCCAAG ATGAGAGAAC AAGAGAGGCA AGCTTCAGGT 1500
GGTGGAGAGA TGTTTTTTAT GCGCACACCT CAGGACCTCA CAGGCAAAGA TGGAGATCTT 1560
ATTCTTGCAG AATATAGTGA GGAAAATGGA CCCTTAATGA TGCAGGTTGG CATGGCAACC 1620
AAGATAAAGA ATTATTATAA ACGGAAACCT GGAAAAGATC CTGGAGCACC GGATTGTAAA 1680
TATGGGGAAA CTGTTTACTG CCATACATCT CCTTTTCTGG GCTCTCTCCA TCCTGGCCAA 1740
TTACTGCAGG CATTTGAGAA CAACCTTTTT CGTGCTCCAA TTTATCTTCA TAAGATGCCA 1800
GAAACTGATT TCCTGATCAT TCGGACAAGA CAGGGTTACT ATATTCGGGA ATTAGTGGAT 1860
ATTTTTGTGG TTGGCCAGCA GTGTCCCTTA TTTGAAGTTC CTGGGCCTAA CTCCAAAAGG 1920
GCCAATACAC ATATTCGAGA CTTTCTACAG GTTTTTATTT ACCGTCTTTT CTGGAAGAGT 1980
AAAGATCGGC CACGGAGGAT AAGAATGGAA GATATAAAAA AAGCCTTTCC TTCCCATTCA 2040
GAAAGCAGCA TCCGGAAGAG GCTAAAGCTC TGCGCTGACT TCAAACGCAC AGGGATGGAC 2100
TCAAACTGGT GGGTGCTTAA GTCTGATTTT CGTTTACCAA CGGAAGAAGA GATCAGAGCT 2160
ATGGTGTCAC CAGAGCAGTG CTGTGCTTAT TATAGCATGA TAGCTGCAGA ACAACGACTG 2220
AAGGATGCTG GCTATGGTGA GAAATCCTTT TTTGCTCCAG AAGAAGAAAA TGAGGAAGAT 2280
TTCCAGATGA AGATTGATGA TGAAGTTCGC ACTGCTCCTT GGAACACCAC AAGGGCCTTC 2340
ATTGCTGCCA TGAAGGGCAA GTGTCTCCTA GAGGTGACTG GGGTGGCCGA TCCCACAGGA 2400
TGTGGTGAAG GATTCTCCTA TGTGAAGATT CCAAACAAAC CAACACAGCA GAAGGATGAT 2460
AAAGAGCCGC AGCCAGTGAA GAAGACAGTG ACAGGAACAG ATGCAGACCT TCGTCGCCTT 2520
TCTCTGAAAA ATGCCAAGCA ACTTCTACGT AAATTTGGTG TGCCTGAGGA AGAGATTAAA 2580
AAATTGTCCC GCTGGGAAGT GATTGATGTG GTGCGCACAA TGTCAACAGA ACAGGCTCGT 2640
TCTGGAGAGG GGCCCATGAG TAAATTTGCA CGTGGATCAA GGTTTTCTGT GGCTGAGCAT 2700
CAAGAGCGTT ACAAAGAGGA ATGTCAGCGC ATCTTTGACC TACAGAACAA GGTTCTGTCA 2760
TCAACTGAAG TCTTATCAAC TGACACAGAC AGTAGCTCAG CTGAAGATAG TGACTTTGAA 2820
GAAATGGGAA AGAACATTGA GAACATGTTG CAGAACAAGA AAACCAGCTC TCAGCTATCA 2880
CGTGAACGGG AGGAGCAGGA GCGGAAGGAA CTACAGCGAA TGCTACTGGC AGCAGGCTCA 2940
GCAACATCAG GAAACAATCA CAGAGATGAT GACACAGCTT CTGTGACTAG CCTTAACTCT 3000
TCCGCCACTG GACGCTGTCT CAAGATTTAT CGCACATTTC GAGATGAAGA GGGAAAAGAG 3060
TATGTTCGCT GTGAGACAGT CCGAAAACCA GCTGTCATTG ATGCCTATGT GCGTATACGG 3120
ACTACAAAAG ATGAAGAATT CATTCGAAAA TTTGCCCTTT TTGATGAACA ACATCGAGAA 3180
GAGATGCGAA AAGAACGGCG GAGGATTCAA GAGCAGCTGA GGCGTCTTAA GCGGAACCAG 3240
GAAAAGGAGA AGCTCAAGGG TCCTCCTGAG AAGAAGCCCA AGAAAATGAA GGAGCGTCCT 3300
GACCTAAAAC TGAAATGTGG GGCATGTGGT GCCATTGGAC ACATGAGGAC TAACAAATTC 3360
TGCCCCCTCT ATTATCAAAC AAATGCGCCA CCTTCCAACC CTGTTGCCAT GACAGAAGAG 3420
CAGGAGGAAG AGTTGGAAAA GACAGTCATT CATAATGATA ATGAAGAACT TATCAAGGTT 3480
GAAGGAACCA AAATTGTCTT GGGGAAACAG CTAATTGAGA GTGCGGATGA GGTTCGAAGA 3540
AAATCTCTGG TTCTCAAGTT TCCTAAACAG CAACTTCCTC CAAAGAAAAA ACGGCGAGTT 3600
GGAACCACTG TTCATTGTGA CTATTTGAAT AGACCTCATA AGTCCATCCA TCGGCGACGC 3660
ACAGACCCTA TGGTGACGCT ATCATCCATC TTGGAGTCCA TCATCAATGA CATGAGAGAT 3720
CTTCCAAATA CATATCCTTT CCACACTCCA GTTAATGCAA AGGTTGTAAA GGACTACTAC 3780
AAAATCATCA CTCGGCCAAT GGACCTGCAA ACACTTCGTG AAAATGTGCG TAAACGCCTC 3840
TACCCATCTC GGGAAGAGTT CAGAGAGCAT CTGGAGCTAA TTGTGAAAAA TAGTGCAACC 3900
TACAATGGGC CAAAACACTC ATTGACTCAG ATCTCTCAAT CTATGCTGGA TCTCTGTGAT 3960
GAAAAACTCA AAGAGAAAGA AGACAAATTA GCTCGTTTAG AGAAAGCTAT CAACCCCTTG 4020
CTGGATGATG ATGACCAGGT GGCATTTTCT TTCATTCTGG ACAACATTGT CACCCAGAAA 4080
ATGATGGCAG TTCCAGATTC TTGGCCATTT CATCACCCAG TTAATAAGAA GTTTGTTCCA 4140
GATTATTACA AAGTGATTGT CAGTCCAATG GATTTAGAGA CCATACGTAA GAACATCTCC 4200
AAACACAAGT ATCAGAGTCG GGAGAGCTTT CTAGATGATG TAAACCTTAT TCTGGCCAAC 4260
AGTGTTAAGT ATAATGGGCC TGAGAGTCAA TATACTAAGA CTGCTCAGGA GATTGTGAAC 4320
GTCTGTTACC AGACATTGAC TGAGTATGAT GAGCATTTGA CTCAACTCGA GAAGGATATT 4380
TGTACAGCTA AAGAAGCAGC TTTGGAGGAA GCAGAATTAG AAAGCCTGGA CCCAATGACC 4440
CCAGGGCCCT ACACACCTCA GCCTCCTGAT TTGTATGATA CCAACACATC CCTCAGTATG 4500
TCTCGAGATG CCTCTGTATT TCAAGATGAG AGCAATATGT CGGTCTTGGA TATTCCCACT 4560
GCCACTCCAG AAAAGCAGGT AACACAGGAA GGTGAAGATG GAGATGGTGA TCTTGCAGAT 4620
GAAGAGGAAG GAACTGTACA GCAGCCTCAA GCCAGTGTCC TGTATGAGGA TTTGCTTATG 4680
TCTGAAGGAG AAGATGATGA GGAAGATGCT GGGAGTGATG AAGAAGGAGA CAATCCTTTC 4740
TCTGCTATCC AGCTGAGTGA AAGTGGAAGT GACTCTGATG TGGGATCTGG TGGAATAAGA 4800
CCCAAACAAC CCCGCATGCT TCAGGAAAAC ACAAGGATGG GCATGGAAAA TGAAGAAAGC 4860
ATGATGTCCT ATGAGGGAGA CGGTGGGGAG GCTTCCCATG GTTTGGAGGA TAGCAACATC 4920
AGTTATGGGA GCTATGAGGA GCCTGATCCC AAGTCGAACA CCCAAGACAC AAGCTTCAGC 4980
AGCATCGGTG GGTATGAGGT ATGCTCTGGG CCGAGCGTAC TAAGCCAGGT CCACCTGTCA 5040
GAGGACGAGG AGGACAGTGA GGATTTCCAC TCCATTGCTG GGGATAGTGA CTTGGACTCT 5100
GATGAATGA 5109
Domain Profile
N/A
Domain Sequence
(FASTA)
N/A
KeywordBromodomain; Complete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Canis familiaris"; ?>Ciona intestinalis"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Equus caballus"; ?>Felis catus"; ?>Gallus gallus"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Meleagris gallopavo"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pelodiscus sinensis"; ?>Pongo abelii"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Taeniopygia guttata"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Saccharomyces cerevisiae"; ?>Schizosaccharomyces pombe"; ?>
EKS-AIM-00466
EKS-ANC-00493
EKS-BOT-00495
EKS-CAE-00438
EKS-CAF-00500
EKS-CII-00276
EKS-CIS-00253
EKS-DAR-00926
EKS-DRM-00234
EKS-EQC-00492
EKS-FEC-00476
EKS-GAG-00419
EKS-GAA-00606
EKS-GOG-00490
EKS-HOS-00514
EKS-ICT-00471
EKS-LAC-00513
EKS-LOA-00505
EKS-MAM-00498
EKS-MEG-00407
EKS-MOD-00487
EKS-MUM-00537
EKS-MUP-00494
EKS-MYL-00492
EKS-NOL-00450
EKS-ORN-00629
EKS-ORA-00434
EKS-ORC-00472
EKS-ORL-00582
EKS-OTG-00511
EKS-PES-00439
EKS-POA-00480
EKS-RAN-00510
EKS-SAH-00469
EKS-TAG-00565
EKS-TAR-00623
EKS-TEN-00624
EKS-XET-00652
EKS-SAC-00129
EKS-SCP-00124
Gene Ontology
GO:0005669; C:transcription factor TFIID complex
GO:0003677; F:DNA binding
GO:0006352; P:DNA-dependent transcription, initiation
GO:0006355; P:regulation of transcription, DNA-dependent
KEGG
InterPros
IPR001487; Bromodomain.
IPR018359; Bromodomain_CS.
IPR011177; TAF1_animal.
IPR009067; TAF_II_230-bd.
IPR022591; TFIID_sub1_DUF3591.
Pfam
PF00439; Bromodomain; 2.
PF12157; DUF3591; 1.
PF09247; TBP-binding; 1.
SMARTs
SM00297; BROMO; 2.
Prosites
PS00633; BROMODOMAIN_1; 2.
PS50014; BROMODOMAIN_2; 2.
Prints
PR00503; BROMODOMAIN.
Created Date20-Feb-2013