EKS-CAF-00500
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-CAF-00500
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Atypical/TAF1N/AN/AN/AN/AN/A
StatusUnreviewed
Ensembl ProteinENSCAFP00000025204
UniProt AccessionJ9NZ21;
Protein Name
Protein Synonyms/Alias
Gene NameTAF1
Gene Synonyms/Alias TAF1;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSCAFG00000017103ENSCAFP00000038541ENSCAFT00000043925
ENSCAFG00000017103ENSCAFP00000025204ENSCAFT00000027104
OrganismCanis familiaris
Functional Description
Protein Length1893
Protein Sequence
(FASTA)
MGPGWDLLLP VAAGDTTSAI MSDTDSDEDS AGGGPFSLTG FLFGNINGAG QLEGESVLDD 60
ECKKHLAGLG ALGLGSLITE LTANEELAGT DGALVNDEGW IRSTEDAVDY SDITEVAEDE 120
SRRYQQTMGS LQPLCHSDYD EDDYDADCED IDCKLMPPPP PPPGPMKKDK DQDGLTGVSE 180
DGEGIILPSI IAPSSLASEK VDFSSSSDSE SEMGPQEASQ AESEDGKLTL PLAGIMQHDA 240
TKLLPSVTEL FPEFRPGKVL RFLRLFGPGK NVPSVWRSAR RKRKKKHREL IQEEQIQEVE 300
CSVESEVNKK SLWNYDYAPP PPPEQCLSDD EITMMAPVES KFSQSTGDID KVTDTKPRVA 360
EWRYGPARLW YDMLGVPEDG SGFDYGFKLR KMDHEPVIKC RMMEDLRKFE ENNGSDLLAD 420
ENFLMVTQLH WEDDIIWDGE DVKHKGTKPQ RASLAGWLPS SMTRNAMAYN VQQGFAATLD 480
DDKPWYSIFP IDNEELVYGR WEDNIIWDAQ AMPRLLEPPV LTLDPNDENL ILEIPDEKEE 540
ATSNSPSKEN KKESSLKKSR ILLGKTGVIK EEPQQNMSQP EVKDPWNLSN DEYYYPKQQG 600
LRGTFGGNII QHSIPAVELR QPFFPTHMGP IKLRQFHRPP LKKYSFGALS QPGPHSVQPL 660
LKHIKKKAKM REQERQASGG GEMFFMRTPQ DLTGKDGDLI LAEYSEENGP LMMQVGMATK 720
IKNYYKRKPG KDPGAPDCKY GETVYCHTSP FLGSLHPGQL LQAFENNLFR APIYLHKMPE 780
TDFLIIRTRQ GYYIRELVDI FVVGQQCPLF EVPGPNSKRA NTHIRDFLQV FIYRLFWKSK 840
DRPRRIRMED IKKAFPSHSE SSIRKRLKLC ADFKRTGMDS NWWVLKSDFR LPTEEEIRAM 900
VSPEQCCAYY SMIAAEQRLK DAGYGEKSFF APEEENEEDF QMKIVNEVRT APWNTTRAFI 960
AAMKGKCLLE VTGVADPTGC GEGFSYVKIP NKPTQQKDDK EPQPVKKTVT GTDADLRRLS 1020
LKNAKQLLRK FGVPEEEIKK LSRWEVIDVV RTMSTEQARS GEGPMSKFAR GSRFSVAEHQ 1080
ERYKEECQRI FDLQNKVLSS TEILSTDTDS SSAEDSDFEE MGKNIENMLQ NKKTSSQLSR 1140
EREEQERKEL QRMLLAAGSA ASGNNHRDDD TASVTSLNSS ATGRCLKIYR TFRDEEGKEY 1200
VRCETVRKPA VIDAYVRIRT TKDEEFIRKF ALFDEQHREE MRKERRRIQE QLRRLKRNQE 1260
KEKLKGPPEK KPKKMKERPD LKLKCGACGA IGHMRTNKFC PLYYQTNAPP SNPVAMTEEQ 1320
EEELEKTVIH NDNEELIKVE GTKIVLGKQL IESADEVRRK SLVLKFPKQQ LPPKKKRRVG 1380
TTVHCDYLNR PHKSIHRRRT DPMVTLSSIL ESIINDMRDL PNTYPFHTPV NAKVVKDYYK 1440
IITRPMDLQT LRENVRKRLY PSREEFREHL ELIVKNSATY NGPKHSLTQI SQSMLDLCDE 1500
KLKEKEDKLA RLEKAINPLL DDDDQVAFSF ILDNIVTQKM MAVPVSWPFH HPVNKKFVPD 1560
YYKVIISPMD LETIRKNISK HKYQSRESFL DDVNLILANS VKYNVGPESQ YTKTAQEIVN 1620
VCYQTLTEYD EHLTQLEKDI CTAKEAALEE AELESLDPMT PGPYTPQPPD LYDTNTSLSM 1680
SRDASVFQDE SNMSVLDIPT ATPEKQVTQE GEDADGDLAD EEEGSMQQPQ ASVLYEDLLM 1740
SEGEDDEEDA GSDEEGDNPF SAIQLSESGS DSDVGSSGIR PKQPRMLQEN TRMGMENEES 1800
MMSYEGDGGE ASHGLEDSNM SYGSYEEPDP KSNTQDTSFS SIGGYEVSEE EEDEEEEQRS 1860
GPSVLSQVHL SEDEEDSEDF HSIAGDSDLD SDE 1893
Nucleotide Sequence
(FASTA)
ATGGGACCTG GCTGGGATTT GCTGCTGCCT GTAGCCGCCG GAGACACTAC CAGCGCCATC 60
ATGTCAGACA CGGACAGCGA TGAAGATTCT GCTGGAGGCG GCCCATTTTC TCTAACCGGT 120
TTCCTTTTTG GCAACATCAA TGGAGCCGGG CAGCTGGAGG GGGAAAGCGT CTTGGACGAT 180
GAGTGTAAGA AGCACTTGGC AGGCTTGGGG GCTTTGGGTC TGGGCAGCCT GATCACTGAA 240
CTCACGGCAA ACGAAGAATT GGCTGGGACC GATGGTGCCC TGGTAAATGA TGAAGGATGG 300
ATCAGGAGTA CAGAAGATGC TGTGGACTAT TCAGACATCA CTGAGGTAGC AGAAGATGAA 360
AGCCGAAGAT ATCAGCAGAC AATGGGGAGC TTGCAGCCCC TTTGCCACTC AGATTATGAT 420
GAAGATGACT ATGATGCCGA TTGTGAAGAC ATTGATTGCA AGTTGATGCC TCCTCCACCT 480
CCACCTCCAG GACCAATGAA GAAAGATAAG GACCAGGATG GTCTTACCGG TGTGTCTGAG 540
GATGGAGAAG GCATCATCTT GCCCTCCATC ATTGCCCCTT CCTCTTTGGC CTCAGAGAAA 600
GTGGACTTCA GTAGTTCCTC TGACTCAGAA TCTGAGATGG GACCTCAGGA AGCATCACAG 660
GCAGAATCTG AGGATGGAAA GCTGACCCTA CCATTGGCTG GGATTATGCA ACACGATGCC 720
ACCAAGCTGT TGCCAAGTGT CACAGAACTT TTCCCGGAAT TTCGGCCTGG GAAGGTATTA 780
CGCTTCCTTC GTCTTTTCGG ACCAGGGAAG AATGTCCCAT CTGTTTGGCG GAGTGCTCGG 840
AGAAAGAGGA AGAAGAAGCA CCGTGAGCTG ATACAGGAAG AACAGATCCA GGAGGTGGAG 900
TGCTCAGTAG AGTCCGAAGT CAACAAGAAG TCTTTATGGA ATTATGACTA TGCTCCACCA 960
CCACCTCCAG AGCAGTGTCT CTCTGATGAT GAAATTACAA TGATGGCTCC TGTTGAATCA 1020
AAGTTTTCCC AGTCAACTGG AGATATAGAT AAAGTGACAG ATACCAAACC AAGAGTGGCT 1080
GAGTGGCGTT ATGGGCCTGC TCGACTGTGG TATGATATGC TGGGTGTCCC TGAAGATGGC 1140
AGTGGGTTTG ATTATGGCTT CAAACTGAGA AAAATGGACC ATGAGCCTGT GATAAAATGT 1200
AGAATGATGG AGGACTTGAG GAAATTTGAG GAAAACAATG GCTCTGACCT ACTGGCTGAT 1260
GAAAACTTCC TGATGGTGAC ACAGCTGCAC TGGGAGGATG ATATCATCTG GGATGGGGAG 1320
GATGTCAAGC ACAAAGGAAC AAAGCCTCAG CGTGCAAGCC TGGCAGGCTG GCTTCCTTCC 1380
AGCATGACTA GGAATGCCAT GGCCTACAAT GTTCAGCAAG GTTTTGCAGC CACCCTGGAT 1440
GATGACAAAC CTTGGTACTC CATCTTTCCC ATTGACAACG AGGAGCTGGT ATATGGACGC 1500
TGGGAGGACA ATATCATTTG GGATGCTCAG GCCATGCCCA GGCTGTTGGA GCCTCCTGTT 1560
TTGACACTTG ATCCCAATGA TGAGAACCTT ATTTTGGAAA TTCCTGATGA AAAGGAAGAG 1620
GCCACTTCTA ACTCCCCCTC CAAGGAAAAT AAGAAAGAAT CATCTCTGAA GAAGAGCCGA 1680
ATTCTCTTAG GGAAAACAGG GGTCATCAAG GAGGAACCAC AGCAGAACAT GTCTCAGCCA 1740
GAAGTGAAAG ATCCATGGAA TCTCTCCAAT GATGAATATT ACTACCCCAA GCAACAGGGT 1800
CTTCGAGGCA CCTTTGGAGG AAATATTATC CAGCACTCAA TTCCTGCAGT GGAATTACGA 1860
CAGCCCTTCT TTCCTACCCA TATGGGGCCC ATCAAACTTC GGCAGTTCCA TCGTCCACCC 1920
CTGAAGAAAT ATTCCTTCGG GGCACTCTCT CAGCCAGGTC CCCACTCAGT CCAGCCCTTG 1980
CTGAAGCACA TCAAAAAGAA GGCTAAGATG AGAGAACAAG AGAGACAAGC TTCTGGTGGT 2040
GGAGAGATGT TTTTTATGCG CACACCTCAG GACCTCACAG GCAAAGATGG AGATCTTATT 2100
CTTGCAGAAT ACAGTGAGGA AAATGGACCA TTAATGATGC AGGTTGGCAT GGCAACCAAG 2160
ATAAAAAACT ATTATAAGCG GAAACCTGGA AAAGATCCTG GGGCCCCAGA TTGTAAATAT 2220
GGAGAAACTG TTTACTGTCA TACATCTCCT TTCCTAGGCT CTCTTCATCC TGGTCAATTA 2280
CTGCAGGCAT TTGAGAACAA TCTCTTTCGT GCCCCAATTT ATCTTCATAA GATGCCAGAA 2340
ACTGACTTCC TTATTATTCG GACAAGACAA GGTTACTACA TTCGGGAGTT AGTGGATATT 2400
TTTGTGGTCG GCCAGCAATG CCCCTTGTTT GAAGTTCCTG GGCCCAACTC CAAAAGAGCT 2460
AATACACATA TTCGAGACTT TCTCCAGGTT TTTATTTACC GCCTCTTCTG GAAGAGTAAA 2520
GATCGGCCAA GGCGGATCCG AATGGAAGAT ATAAAAAAAG CCTTTCCTTC TCACTCAGAA 2580
AGTAGCATCC GCAAGAGGCT AAAGCTCTGC GCTGACTTCA AACGCACAGG GATGGACTCA 2640
AACTGGTGGG TGCTAAAGTC AGATTTTCGT TTACCAACTG AAGAGGAGAT CAGAGCTATG 2700
GTGTCTCCAG AGCAGTGCTG TGCTTATTAT AGCATGATAG CTGCAGAGCA GCGACTAAAG 2760
GATGCTGGCT ATGGTGAGAA GTCCTTTTTT GCTCCAGAAG AAGAAAATGA GGAAGATTTC 2820
CAGATGAAGA TTGTAAATGA GGTTCGCACT GCTCCTTGGA ATACCACAAG GGCTTTCATT 2880
GCTGCCATGA AGGGCAAGTG TCTCTTGGAG GTGACTGGGG TGGCAGATCC CACAGGGTGT 2940
GGTGAAGGAT TCTCCTATGT CAAGATTCCA AACAAACCAA CACAGCAGAA GGATGATAAG 3000
GAGCCTCAGC CAGTGAAGAA GACAGTGACA GGAACAGATG CAGATCTCCG TCGCCTCTCA 3060
CTGAAAAATG CCAAGCAGCT CCTACGTAAA TTTGGTGTGC CTGAGGAAGA GATTAAAAAG 3120
CTGTCCCGCT GGGAAGTGAT TGATGTGGTA CGCACAATGT CAACAGAGCA AGCCCGCTCT 3180
GGAGAGGGGC CCATGAGTAA ATTTGCCCGT GGATCAAGGT TTTCTGTGGC TGAGCATCAA 3240
GAGCGTTACA AAGAGGAATG TCAGCGCATC TTTGATCTAC AGAACAAGGT TTTGTCATCA 3300
ACTGAAATAT TGTCAACTGA CACAGACAGC AGCTCAGCCG AAGACAGTGA CTTTGAAGAA 3360
ATGGGAAAGA ACATTGAGAA CATGTTGCAG AATAAGAAAA CCAGCTCTCA GCTGTCACGT 3420
GAACGAGAGG AGCAGGAGCG GAAGGAACTG CAGCGGATGC TACTGGCAGC AGGTTCTGCA 3480
GCATCAGGAA ACAATCACAG AGATGATGAT ACAGCTTCTG TGACCAGCCT TAACTCTTCT 3540
GCCACTGGCC GTTGTCTCAA AATTTATCGC ACATTTCGAG ATGAAGAAGG GAAAGAGTAT 3600
GTTCGCTGTG AGACAGTCCG AAAGCCTGCT GTCATTGATG CCTATGTGCG CATACGGACC 3660
ACAAAAGATG AGGAATTCAT TCGAAAGTTT GCCCTTTTTG ATGAACAGCA TCGAGAAGAG 3720
ATGCGAAAGG AACGGCGGAG GATTCAAGAG CAACTTAGGC GGCTCAAGCG GAATCAGGAA 3780
AAGGAGAAGC TTAAGGGTCC CCCTGAGAAG AAGCCTAAAA AAATGAAGGA GCGTCCTGAT 3840
CTAAAACTGA AATGTGGAGC TTGTGGTGCC ATTGGGCACA TGAGGACAAA CAAGTTCTGC 3900
CCCCTCTACT ATCAAACAAA TGCTCCACCT TCCAACCCTG TTGCCATGAC AGAGGAGCAG 3960
GAGGAGGAGT TGGAAAAGAC AGTCATTCAT AATGATAATG AAGAACTTAT CAAGGTTGAA 4020
GGGACCAAGA TCGTCCTGGG AAAACAGCTA ATTGAGAGTG CAGATGAGGT TCGAAGAAAA 4080
TCTCTGGTTC TCAAGTTCCC TAAACAGCAA CTTCCTCCCA AGAAGAAACG GCGAGTTGGA 4140
ACCACTGTTC ACTGTGACTA CTTAAATAGA CCTCATAAAT CCATCCACCG GCGCCGGACA 4200
GACCCAATGG TGACATTGTC CTCCATCTTG GAGTCTATCA TCAATGACAT GAGAGATCTT 4260
CCAAATACCT ACCCTTTCCA CACTCCAGTC AATGCAAAGG TTGTAAAGGA CTACTACAAA 4320
ATCATCACTC GACCAATGGA CTTACAAACA CTCCGTGAAA ATGTGCGCAA ACGCCTTTAC 4380
CCATCTCGGG AAGAGTTCAG AGAACATTTG GAGCTCATTG TGAAAAATAG TGCAACCTAT 4440
AATGGTCCGA AGCACTCATT GACCCAGATC TCTCAATCTA TGCTGGATCT CTGTGATGAA 4500
AAACTCAAAG AGAAAGAAGA TAAATTGGCT CGTTTAGAGA AAGCTATCAA CCCCTTGCTG 4560
GATGATGATG ACCAAGTAGC ATTTTCTTTC ATTCTGGACA ACATTGTCAC CCAGAAAATG 4620
ATGGCAGTTC CAGTTTCTTG GCCATTTCAT CACCCAGTTA ATAAGAAGTT TGTTCCAGAT 4680
TATTACAAAG TGATTATCAG TCCAATGGAT TTAGAGACTA TACGTAAGAA TATCTCCAAG 4740
CACAAGTACC AGAGTCGGGA GAGCTTTCTG GATGATGTCA ACCTTATTCT GGCCAACAGT 4800
GTTAAGTATA ATGTAGGGCC TGAAAGTCAG TATACTAAGA CTGCTCAGGA GATTGTAAAT 4860
GTCTGTTACC AGACATTGAC TGAGTATGAT GAACATTTGA CTCAACTTGA GAAGGATATT 4920
TGTACAGCTA AGGAAGCGGC TTTGGAGGAA GCAGAATTAG AAAGCTTGGA CCCAATGACC 4980
CCAGGGCCTT ATACACCTCA GCCTCCTGAT TTGTATGACA CCAACACATC CCTCAGTATG 5040
TCCCGAGATG CCTCTGTATT TCAAGATGAG AGCAATATGT CTGTCCTGGA TATTCCCACT 5100
GCCACTCCAG AAAAGCAGGT GACACAGGAA GGTGAAGATG CAGATGGTGA TCTTGCAGAT 5160
GAAGAGGAAG GAAGTATGCA ACAGCCTCAA GCCAGTGTCC TGTATGAGGA TTTGCTTATG 5220
TCTGAAGGAG AAGATGATGA AGAAGATGCT GGGAGTGATG AAGAAGGAGA CAATCCTTTC 5280
TCTGCTATCC AACTGAGTGA AAGTGGAAGT GACTCTGATG TTGGATCTAG TGGGATAAGA 5340
CCCAAACAGC CCCGCATGCT TCAGGAAAAC ACAAGGATGG GCATGGAAAA TGAAGAAAGC 5400
ATGATGTCCT ATGAGGGAGA TGGTGGGGAG GCTTCTCATG GTTTGGAGGA TAGCAATATG 5460
AGTTATGGGA GCTATGAGGA GCCTGATCCC AAGTCGAACA CCCAAGATAC AAGCTTCAGC 5520
AGCATCGGTG GGTATGAGGT ATCAGAGGAA GAAGAAGATG AGGAGGAAGA GCAGCGCTCT 5580
GGGCCAAGTG TACTAAGCCA GGTCCACCTG TCAGAGGATG AGGAGGACAG TGAGGATTTC 5640
CACTCCATTG CTGGGGACAG TGATTTGGAC TCTGATGAAT GA 5682
Domain Profile
N/A
Domain Sequence
(FASTA)
N/A
KeywordBromodomain; Complete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Ciona intestinalis"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Equus caballus"; ?>Felis catus"; ?>Gallus gallus"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Meleagris gallopavo"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pelodiscus sinensis"; ?>Pongo abelii"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Taeniopygia guttata"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Saccharomyces cerevisiae"; ?>Schizosaccharomyces pombe"; ?>
EKS-AIM-00466
EKS-ANC-00493
EKS-BOT-00495
EKS-CAE-00438
EKS-CAJ-00507
EKS-CII-00276
EKS-CIS-00253
EKS-DAR-00926
EKS-DRM-00234
EKS-EQC-00492
EKS-FEC-00476
EKS-GAG-00419
EKS-GAA-00606
EKS-GOG-00490
EKS-HOS-00514
EKS-ICT-00471
EKS-LAC-00513
EKS-LOA-00505
EKS-MAM-00498
EKS-MEG-00407
EKS-MOD-00487
EKS-MUM-00537
EKS-MUP-00494
EKS-MYL-00493
EKS-NOL-00450
EKS-ORN-00629
EKS-ORA-00434
EKS-ORC-00472
EKS-ORL-00582
EKS-OTG-00511
EKS-PES-00439
EKS-POA-00480
EKS-RAN-00510
EKS-SAH-00469
EKS-TAG-00565
EKS-TAR-00623
EKS-TEN-00624
EKS-XET-00652
EKS-SAC-00129
EKS-SCP-00124
Gene Ontology
GO:0005669; C:transcription factor TFIID complex
GO:0003677; F:DNA binding
GO:0006352; P:DNA-dependent transcription, initiation
GO:0006355; P:regulation of transcription, DNA-dependent
KEGG
InterPros
IPR001487; Bromodomain.
IPR018359; Bromodomain_CS.
IPR011177; TAF1_animal.
IPR009067; TAF_II_230-bd.
IPR022591; TFIID_sub1_DUF3591.
Pfam
PF00439; Bromodomain; 2.
PF12157; DUF3591; 1.
PF09247; TBP-binding; 1.
SMARTs
SM00297; BROMO; 2.
Prosites
PS00633; BROMODOMAIN_1; 2.
PS50014; BROMODOMAIN_2; 2.
Prints
PR00503; BROMODOMAIN.
Created Date20-Feb-2013