EKS-TAG-00565
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-TAG-00565
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Atypical/TAF1N/AN/AN/AN/AN/A
StatusUnreviewed
Ensembl ProteinENSTGUP00000007224
UniProt AccessionH0Z9H1;
Protein Name
Protein Synonyms/Alias
Gene NameTAF1
Gene Synonyms/Alias TAF1;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSTGUG00000006988ENSTGUP00000007224ENSTGUT00000007295
OrganismTaeniopygia guttata
Functional Description
Protein Length1889
Protein Sequence
(FASTA)
MSDSESEEEA DGGRAEPFSL AGFLFGNINE AGQLEGDSVL DKESKKHLAG LGVLGLGNLI 60
TEITASEEES AESEGAHLDE EGWVKSTEDA VDYSDITEVA EDESRRYKQA MGSLQPGRRP 120
DEDEDDYDAD CEDIDSKLMP PPPPPPVPGK KEDEKDAAAI VSEDGDGIIL PSIIAPSSAA 180
SDKVDFSSSS DSESEMGPQE ARQAESKEGK LTLPLAGIMQ RDATKQLPSV TELFPEFRPG 240
KVLRFLRLFG PGKNVPSVWR SARRKRKKKH RELAQEVQIQ EGEVVVESGV EGKSPWEYEF 300
AAPPPPEQCL SDDEITMMAP VESKFSQSTG DIDKVTDTKP KVAEWRYGPA QLWYDMLGIP 360
EDGSGFDYGF KLKEKNEEEA KGHTEEGVRG EPWGKKDDLL ADEHFLMVTQ LQWEDDVIWN 420
GEDVKHKGTK TQRASLAGWL PSSMTRNATA YNAQQEGVRF SLDEDKPWYS IFPIDNEELV 480
YGRWEDNIIW DDQAMETYLD PPVLTLDPND ENIILEIPDE KEEMTLNSPS KENKKESSLK 540
KSRILLGKTG VIKEEPQQNM SQPEVKDPWN LSNDEFYYPK QQGLRGTFGG NIIQHSIPAV 600
ELRQPFFPTH MGPMKLRQFH RPPLKKYSFG ALSQPGPHTV QPLLKHIKKK AKMREQERQA 660
SGGGEMFFMR TPQDLTGKDG DLILAEYSEE NAPLMMQVGM ATKIKNYYKR KPGKDPGAPD 720
CKYGETVYCH TSPFLGSLHP GQLLQAFENN LFRAPIYLHK MPETDFLIIR TRQGYYIREL 780
VDIFVVGQEC PLYEVPGPNS KRANTHIRDF LQVFIYRLFW KSRDRPRRIR MEDIKKAFPS 840
HSESSIRKRL KLCADFKRTG MDSNWWVLKP DFRLPTEEEI RAMVSPEQCC AYYSMIAAEQ 900
RLKDAGYGEK SFFAPEEENE EDFQMKIDDE VRTAPWNTTR AFIAAMKGKC LLEVTGVADP 960
TGCGEGFSYV KIPNKPTQQK DDKEPQPVKK TVTGTDADLR RLSLKNAKQL LRKFGVPEEE 1020
IKKLSRWEVI DVVRTMSTEQ ARSGEGPMSK FARGSRFSVA EHQERYKEEC QRIFDLQNKV 1080
LESTEILSTD TDSSSAEDSD FEEMGKNIEN MLQNKKTSSQ LSREREEQER KELQRMLLGE 1140
DNDKDRGKKD RRDKKGLSNS HKDDDTASVT SLNSSATGRR LKIYRTFRDE DGKEYVRCET 1200
VRKPAVIDAY CRIRTTKDEE FIRKFALFDE QHREEMRKER RRIQEQLRRL KRNQEKEKLK 1260
GPPEKKPKKM KERPDLKLKC GACGAIGHMR TNKFCPLYYQ TNAPPSNPVA MTEEQEEELE 1320
KTVIHNDNEE LIKVEGTKIV LGKQLIESAD EVRRKSLVLK FPKQQLPPKK KRRVGTTVHC 1380
DYLNRPHKSI HRRRTDPMVT LSSILEGIIN DIRDLPNTYP FHTPVNPKVV KDYYKIITRP 1440
MDLQTLRENV RKRQYPSREE FREHLELIVK NSATYNGPKH SLTQISQSML DLCDEKLKEA 1500
SSKEDKLARL EKAINPLLDD DDQVAFSFIL DNIVTQKMMA VPDSWPFHHP VNKKFVPDYY 1560
KVIANPMDLE TIRKNISKHK YQNRETFLDD VNLILANSIK YNGMYSQYTK TAQEIVNICY 1620
QTLAEYDEHL TQLERDISTA KEAALEEADL ESLDPMTPGP YTPQATPPDL YDTSTSLSVS 1680
HGASFYQDES NLSAMDTPIT TTGKQGTQEV EDADGDLADE EEGSAQQTQA SVLYEDLLMS 1740
DGEDDDDGSD EEGDNPFSSI QLSESGSDSD VEPNAVRPKQ PHVLQENTRM GMDNEESMMS 1800
YEGDGGETSH VMEDSNISYG SYEEPDPKSN TRDTSFSSIG GYEISEEEEE EEQQRSGPSV 1860
LSQVHLSEDE EDSEDFHSIA GDSDLDSDE 1889
Nucleotide Sequence
(FASTA)
ATGTCGGACT CGGAGAGCGA GGAGGAGGCG GACGGCGGCC GCGCCGAGCC CTTCTCCCTG 60
GCCGGCTTCC TCTTCGGCAA CATCAATGAG GCGGGGCAGC TGGAAGGGGA CAGCGTCCTC 120
GACAAGGAAT CCAAGAAGCA CCTGGCCGGG CTGGGCGTGC TGGGGCTGGG CAACCTCATC 180
ACCGAGATCA CCGCCAGCGA GGAGGAGAGC GCCGAGAGCG AGGGAGCGCA CCTGGACGAG 240
GAAGGCTGGG TTAAGAGCAC AGAGGACGCT GTTGATTACT CAGACATCAC TGAAGTGGCA 300
GAGGATGAGA GCCGTCGGTA CAAGCAGGCG ATGGGCAGCC TGCAGCCAGG TCGGAGGCCA 360
GATGAAGATG AGGATGATTA TGATGCTGAC TGTGAAGACA TTGATTCCAA GCTGATGCCA 420
CCACCACCAC CACCTCCAGT ACCTGGAAAA AAAGAAGATG AAAAGGATGC AGCTGCCATT 480
GTATCTGAAG ATGGAGACGG TATCATTTTG CCCTCCATCA TTGCTCCTTC CTCTGCTGCC 540
TCTGACAAGG TGGATTTCAG CAGCAGCTCT GATTCTGAGT CAGAGATGGG ACCTCAAGAA 600
GCCAGGCAGG CAGAATCCAA GGAAGGCAAA CTCACACTTC CTCTTGCAGG AATCATGCAG 660
CGAGATGCTA CCAAACAGTT GCCAAGTGTT ACAGAGCTCT TCCCAGAATT TCGACCAGGC 720
AAGGTGCTGC GTTTCCTGCG ACTCTTTGGC CCTGGGAAGA ACGTTCCGTC GGTCTGGCGA 780
AGTGCCCGGA GGAAACGCAA GAAGAAGCAT CGGGAGTTGG CGCAGGAAGT GCAGATACAG 840
GAGGGTGAAG TTGTGGTTGA GAGTGGAGTG GAAGGAAAAT CTCCTTGGGA GTATGAGTTT 900
GCTGCTCCTC CCCCTCCTGA GCAGTGCCTT TCAGATGATG AGATTACCAT GATGGCGCCT 960
GTGGAGTCCA AGTTCTCCCA GTCGACGGGT GACATAGACA AGGTGACAGA CACCAAGCCC 1020
AAGGTGGCCG AGTGGCGCTA TGGCCCTGCC CAGCTCTGGT ACGACATGCT GGGCATCCCT 1080
GAGGACGGCA GCGGCTTCGA TTATGGCTTC AAGCTGAAGG AGAAGAACGA GGAAGAGGCT 1140
AAAGGACACA CAGAGGAGGG GGTGAGAGGG GAACCATGGG GCAAAAAGGA TGATCTGCTA 1200
GCTGATGAGC ACTTCCTCAT GGTGACACAG CTCCAGTGGG AGGATGATGT CATCTGGAAT 1260
GGGGAGGATG TCAAGCACAA AGGGACTAAG ACACAGCGTG CGAGCTTGGC GGGCTGGCTG 1320
CCATCCAGCA TGACCAGAAA TGCCACTGCA TACAATGCCC AACAAGAGGG AGTACGATTT 1380
TCCCTGGATG AAGACAAGCC CTGGTACTCC ATTTTCCCAA TTGATAATGA AGAGTTAGTG 1440
TATGGCCGTT GGGAAGACAA TATCATCTGG GATGATCAGG CAATGGAAAC CTATTTGGAT 1500
CCCCCTGTCT TAACACTTGA TCCAAATGAT GAAAACATAA TTCTAGAAAT TCCTGATGAA 1560
AAGGAAGAGA TGACTTTGAA CTCTCCATCC AAGGAGAACA AGAAAGAATC TTCTCTAAAG 1620
AAGAGTCGGA TCCTGTTGGG AAAAACAGGT GTCATCAAGG AAGAACCACA GCAGAATATG 1680
TCTCAGCCAG AGGTGAAGGA TCCCTGGAAC CTCTCCAATG ATGAGTTTTA CTACCCCAAA 1740
CAGCAGGGAC TTCGAGGAAC CTTTGGAGGC AACATCATCC AGCACTCCAT CCCAGCAGTG 1800
GAGCTGCGCC AGCCGTTCTT TCCCACCCAC ATGGGTCCCA TGAAGCTCCG GCAATTCCAT 1860
CGGCCTCCCC TGAAGAAATA TTCCTTTGGT GCCTTGTCCC AGCCCGGGCC CCACACTGTG 1920
CAGCCTCTGC TGAAGCACAT CAAGAAGAAG GCCAAGATGA GGGAGCAGGA GCGCCAGGCC 1980
TCGGGCGGGG GGGAGATGTT CTTCATGCGC ACACCACAGG ACCTCACTGG CAAGGATGGA 2040
GATCTCATCC TTGCTGAGTA CAGTGAGGAG AACGCCCCTC TGATGATGCA GGTTGGCATG 2100
GCAACAAAGA TTAAAAATTA CTACAAGAGG AAACCTGGCA AAGATCCAGG AGCTCCAGAT 2160
TGTAAATATG GAGAGACTGT TTATTGTCAC ACTTCTCCCT TCCTGGGTTC TCTGCATCCA 2220
GGCCAGTTAC TGCAGGCATT TGAAAACAAT CTTTTCCGGG CCCCCATCTA TTTGCATAAG 2280
ATGCCTGAAA CAGATTTCCT GATCATCCGG ACAAGACAAG GTTACTACAT TCGAGAATTA 2340
GTGGATATTT TTGTAGTTGG GCAGGAGTGC CCACTCTATG AAGTACCTGG TCCCAACTCC 2400
AAGCGAGCTA ACACACATAT CAGAGACTTC CTGCAGGTTT TTATCTATCG GCTCTTCTGG 2460
AAGAGCAGGG ACCGTCCGCG GAGGATCCGC ATGGAGGACA TCAAGAAGGC CTTTCCCTCC 2520
CACTCCGAGA GCAGCATCCG CAAGCGCCTG AAGCTCTGTG CTGATTTCAA ACGCACAGGA 2580
ATGGACTCAA ACTGGTGGGT TTTAAAGCCG GATTTCAGGT TGCCAACAGA GGAGGAGATC 2640
AGAGCCATGG TGTCCCCAGA GCAGTGCTGT GCCTATTACA GCATGATTGC AGCAGAGCAA 2700
CGGCTCAAGG ATGCAGGTTA TGGGGAGAAA TCCTTCTTTG CTCCAGAAGA GGAGAATGAA 2760
GAAGATTTCC AAATGAAGAT TGATGATGAG GTGCGCACAG CTCCCTGGAA CACCACACGA 2820
GCCTTCATTG CTGCTATGAA GGGCAAGTGC CTCCTAGAGG TGACCGGTGT TGCAGATCCC 2880
ACTGGTTGTG GGGAAGGATT TTCTTATGTC AAGATTCCAA ACAAACCAAC CCAGCAGAAG 2940
GATGATAAAG AGCCTCAGCC AGTGAAGAAG ACAGTGACTG GCACAGATGC GGATCTGCGC 3000
CGACTTTCCC TCAAAAATGC CAAACAGCTT CTGCGGAAGT TTGGAGTGCC TGAGGAGGAG 3060
ATAAAGAAGC TGTCCCGGTG GGAGGTGATT GATGTGGTCC GTACGATGTC CACAGAGCAG 3120
GCACGCTCAG GGGAAGGGCC TATGAGCAAA TTCGCCCGTG GCTCTCGGTT CTCTGTGGCA 3180
GAACATCAGG AGAGATACAA GGAAGAGTGT CAGCGCATCT TCGACTTACA AAATAAAGTT 3240
TTGGAATCCA CTGAGATCCT GTCAACAGAC ACAGATAGCA GTTCAGCTGA AGACAGTGAC 3300
TTTGAAGAGA TGGGAAAGAA TATTGAGAAT ATGTTACAGA ACAAGAAAAC TAGTTCTCAG 3360
CTCTCTCGGG AAAGAGAAGA GCAGGAACGG AAGGAATTGC AAAGGATGCT CCTGGGAGAA 3420
GACAATGACA AAGACAGGGG CAAAAAGGAC AGGAGAGACA AAAAGGGGCT GTCAAACTCT 3480
CACAAGGATG ATGACACTGC CTCTGTGACT AGCCTGAATT CCTCTGCCAC CGGCCGGCGC 3540
CTGAAGATCT ATCGCACGTT CAGGGACGAG GATGGGAAGG AGTACGTGAG GTGTGAGACG 3600
GTGCGGAAAC CCGCTGTCAT CGACGCCTAC TGCCGCATAC GGACCACCAA GGATGAGGAG 3660
TTCATACGGA AATTTGCTCT GTTTGATGAG CAGCACCGTG AGGAGATGCG GAAGGAGCGG 3720
CGCAGGATCC AGGAGCAATT ACGGCGGCTA AAGCGGAACC AAGAGAAAGA GAAACTCAAG 3780
GGCCCTCCAG AAAAGAAGCC CAAGAAAATG AAAGAGCGTC CAGACTTAAA ATTAAAATGT 3840
GGAGCATGTG GTGCAATTGG GCACATGAGG ACCAATAAGT TCTGCCCTCT TTACTATCAA 3900
ACAAATGCCC CACCTTCTAA TCCTGTTGCA ATGACAGAGG AGCAGGAGGA GGAGCTGGAA 3960
AAAACAGTCA TTCACAATGA CAACGAAGAA CTCATCAAAG TTGAAGGAAC AAAAATTGTC 4020
CTGGGAAAAC AACTGATTGA GAGTGCTGAT GAGGTTCGCA GGAAATCCCT GGTACTGAAG 4080
TTTCCTAAGC AGCAGCTTCC TCCAAAGAAG AAGCGTCGTG TAGGGACAAC GGTGCACTGC 4140
GATTATCTCA ACCGCCCCCA TAAATCCATC CACCGGCGGA GAACAGATCC CATGGTGACA 4200
CTCTCATCCA TCTTGGAGGG CATCATCAAT GACATCAGGG ATCTTCCCAA TACATACCCT 4260
TTTCACACGC CTGTAAATCC GAAAGTTGTC AAAGATTATT ATAAGATCAT TACCCGGCCC 4320
ATGGATTTAC AGACCCTGCG TGAAAATGTC CGTAAGAGGC AGTACCCATC CCGGGAGGAG 4380
TTCAGGGAAC ACCTGGAGCT CATTGTGAAG AACAGTGCCA CATACAATGG GCCAAAGCAC 4440
TCACTGACAC AGATATCCCA GTCCATGCTC GACCTGTGTG ATGAAAAGCT AAAAGAGGCA 4500
AGTTCTAAGG AAGATAAACT GGCTCGATTA GAAAAAGCAA TTAATCCTCT CCTGGATGAT 4560
GATGATCAAG TGGCCTTCTC CTTCATTTTG GATAACATTG TGACTCAAAA GATGATGGCA 4620
GTTCCAGATT CTTGGCCGTT TCACCATCCA GTTAACAAAA AGTTTGTTCC TGATTATTAC 4680
AAGGTGATTG CTAATCCAAT GGATCTGGAG ACCATCCGCA AGAATATCTC CAAACACAAA 4740
TACCAGAACA GGGAGACTTT CCTGGATGAT GTTAACCTCA TCCTTGCCAA CAGCATTAAG 4800
TACAATGGTA TGTACAGCCA GTACACAAAA ACAGCCCAGG AGATTGTAAA TATCTGTTAC 4860
CAGACTTTAG CTGAGTATGA TGAGCATCTG ACTCAACTTG AGAGAGACAT CTCCACTGCT 4920
AAAGAAGCAG CACTGGAGGA GGCAGATCTG GAAAGTCTTG ATCCTATGAC CCCTGGTCCC 4980
TACACTCCAC AGGCAACACC ACCAGATTTG TATGATACCA GCACTTCCCT CAGTGTGTCC 5040
CATGGTGCTT CATTCTATCA AGATGAAAGC AATTTGTCTG CTATGGACAC TCCCATCACT 5100
ACCACAGGGA AGCAAGGAAC TCAGGAAGTT GAAGATGCAG ATGGTGACCT TGCTGATGAA 5160
GAAGAAGGAT CAGCGCAGCA GACCCAGGCC AGTGTCCTGT ATGAAGATTT GCTTATGTCT 5220
GATGGAGAGG ATGATGATGA TGGGAGTGAT GAAGAGGGAG ATAATCCTTT CTCGTCTATC 5280
CAGCTGAGTG AGAGTGGCAG TGACTCAGAT GTGGAGCCCA ATGCAGTGAG ACCCAAACAA 5340
CCTCATGTTC TTCAAGAGAA CACACGGATG GGCATGGACA ATGAAGAAAG CATGATGTCC 5400
TATGAAGGAG ATGGTGGGGA GACATCTCAT GTTATGGAGG ACAGCAATAT CAGTTATGGC 5460
AGCTACGAAG AACCAGACCC AAAGTCCAAC ACAAGAGACA CAAGTTTCAG CAGCATTGGA 5520
GGCTATGAGA TCTCAGAAGA GGAGGAAGAG GAAGAGCAGC AGCGGTCTGG GCCAAGTGTG 5580
TTAAGTCAGG TCCATCTGTC TGAGGATGAG GAGGACAGCG AGGACTTTCA CTCCATCGCA 5640
GGAGACAGTG ACCTGGACTC AGATGAATGA 5670
Domain Profile
N/A
Domain Sequence
(FASTA)
N/A
KeywordBromodomain; Complete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Ciona intestinalis"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Equus caballus"; ?>Felis catus"; ?>Gallus gallus"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Meleagris gallopavo"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pelodiscus sinensis"; ?>Pongo abelii"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Saccharomyces cerevisiae"; ?>Schizosaccharomyces pombe"; ?>
EKS-AIM-00466
EKS-ANC-00493
EKS-BOT-00495
EKS-CAE-00438
EKS-CAJ-00507
EKS-CAF-00500
EKS-CII-00276
EKS-CIS-00253
EKS-DAR-00926
EKS-DRM-00234
EKS-EQC-00492
EKS-FEC-00476
EKS-GAG-00419
EKS-GAA-00606
EKS-GOG-00490
EKS-HOS-00514
EKS-ICT-00471
EKS-LAC-00513
EKS-LOA-00505
EKS-MAM-00498
EKS-MEG-00407
EKS-MOD-00487
EKS-MUM-00537
EKS-MUP-00494
EKS-MYL-00492
EKS-NOL-00450
EKS-ORN-00629
EKS-ORA-00434
EKS-ORC-00472
EKS-ORL-00582
EKS-OTG-00511
EKS-PES-00439
EKS-POA-00480
EKS-RAN-00510
EKS-SAH-00469
EKS-TAR-00623
EKS-TEN-00624
EKS-XET-00652
EKS-SAC-00129
EKS-SCP-00124
Gene Ontology
GO:0045120; C:pronucleus
GO:0005669; C:transcription factor TFIID complex
GO:0003677; F:DNA binding
GO:0006352; P:DNA-dependent transcription, initiation
GO:0006355; P:regulation of transcription, DNA-dependent
KEGG
InterPros
IPR001487; Bromodomain.
IPR018359; Bromodomain_CS.
IPR011177; TAF1_animal.
IPR009067; TAF_II_230-bd.
IPR022591; TFIID_sub1_DUF3591.
Pfam
PF00439; Bromodomain; 2.
PF12157; DUF3591; 1.
PF09247; TBP-binding; 1.
SMARTs
SM00297; BROMO; 2.
Prosites
PS00633; BROMODOMAIN_1; 2.
PS50014; BROMODOMAIN_2; 2.
Prints
PR00503; BROMODOMAIN.
Created Date20-Feb-2013