EKS-EQC-00492
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-EQC-00492
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Atypical/TAF1N/AN/AN/AN/AN/A
StatusUnreviewed
Ensembl ProteinENSECAP00000005787
UniProt AccessionF7DCN3;
Protein Name
Protein Synonyms/Alias
Gene NameTAF1
Gene Synonyms/Alias TAF1;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
ENSECAG00000006365ENSECAP00000005777ENSECAT00000007820
ENSECAG00000006365ENSECAP00000005787ENSECAT00000007830
OrganismEquus caballus
Functional Description
Protein Length1893
Protein Sequence
(FASTA)
MGPDWDLLLP TAAAGTAAAI MSDTDSDEDS GGGGSFSLTG FLFGNINGAG QLEGESVLDD 60
ECKKHLAGLG ALGLGSLITE LTANEELTGI DSGLVNDEGW IRSTEDAVDY SDINEVAEDE 120
SRRYQQTLGS LQPLCRSDYD EDDYDADCED IDCKLMPPPP PPPGPMKKDK DQDAITGVSE 180
DGEGIILPSI IAPSSLASEK VDFSSSSDSE SEMGPQEATQ ADSEDGKLTL PLAGIMQHDA 240
TKLLPSVTEL FPEFRPGKVL RFLRLFGPGK NVPSVWRSAR RKRKKKHREL IQEEQAQEVE 300
CSVESEVNQK SLWNYDYAPP PPPEQCLSDD EITMMAPMES KFSQSTGDID KVTDTKPRVA 360
EWRYGPAQLW YDMLGVPEDG SGFDYGFKLR KMEHEPVIKC RMIEDFRKLE ENIGTDLLAD 420
ENFLMVTQLH WEDDIIWDGE DVKHKGTKPQ RASLAGWLPS SMTRNAMAYN VQQGFAATLD 480
DDKPWYSIFP IDNEELVYGR WEDNIIWDAQ AMPHLLEPPV LTLDPNDENL ILEIPDEKEE 540
ATSNSPSKES KKESSLKKSR ILLGKTGVIK EEPQQNMSQP EVKDPWNLSN DEYYYPKQQG 600
LRGTFGGNII QHSIPAVELR QPFFPTHMGP IKLRQFHRPP LKKYSFGALS QPGPHSVQPL 660
LKHIKKKAKM REQERQASGG GEMFFMRTPQ DLTGKDGDLI LAEYSEENGP LMMQVGMATK 720
IKNYYKRKPG KDPGAPDCKY GETVYCHTSP FLGSLHPGQL LQAFENNLFR APIYLHKMPE 780
TDFLIIRTRQ GYYIRELVDI FVVGQQCPLF EVPGPNSKRA NTHIRDFLQV FIYRLFWKSK 840
DRPRRIRMED IKKAFPSHSE SSIRKRLKLC ADFKRTGMDS NWWVLKSDFR LPTEEEIRAM 900
VSPEQCCAYY SMIAAEQRLK DAGYGEKSFF APEEENEEDF QMKIDDEVRT APWNTTRAFI 960
AAMKGKCLLE VTGVADPTGC GEGFSYVKIP NKPTQQKDDK EPQPVKKTVT GTDADLRRLS 1020
LKNAKQLLRK FGVPEEEIKK LSRWEVIDVV RTMSTEQARS GEGPMSKFAR GSRFSVAEHQ 1080
ERYKEECQRI FDLQNKVLSS TEVLSTDTDS SSAEDSDFEE MGKNIENMLQ NKKTSSQLSR 1140
EREEQERKEL QRMLLAAGSA ASGNNHRDDD TASVTSLNSS ATGRCLKIYR TFRDEEGKEY 1200
VRCETVRKPA VIDAYVRIRT TKDEEFIRKF ALFDEQHREE MRKERRRIQE QLRRLKRNQE 1260
KEKLKGPPEK KPKKMKERPD LKLKCGACGA IGHMRTNKFC PLYYQTNAPP SNPVAMTEEQ 1320
EEELEKTVIH NDNEELIKVE GTKIVLGKQL IESADEVRRK SLVLKFPKQQ LPPKKKRRVG 1380
TTVHCDYLNR PHKSIHRRRT DPMVTLSSIL ESIINDMRDL PNTYPFHTPV NAKVVKDYYK 1440
IITRPMDLQT LRENVRKRLY PSREEFREHL ELIVKNSATY NGPKHSLTQI SQSMLDLCDE 1500
KLKEKEDKLA RLEKAINPLL DDDDQVAFSF ILDNIVTQKM MAVPDSWPFH HPVNKKFVPD 1560
YYKVIVSPMD LETIRKNISK HKYQSRESFL DDVNLILANS VKYNGPESQY TKTAQEIVNV 1620
CYQTLTEYDE HLTQLEKDIC TAKEAALEEA ELESLDPMTP GPYTPQPPDL YDTNTSLSMS 1680
RDASVFQDES NMSVLDIPTA TPEKQVAQEG EDADGDLADE EEGTVQQPQA SVLYEDLLMS 1740
EGEDDEEDAG SDEEGDNPFS AIQLSESGSD SDVGSGGIRP KQPRMLQENT RMGMENEESM 1800
MSYEGDGGEA SHGLEDSNIS YGSYEEPDPK SNTQDTSFSS IGGYEVSEEE EDEEEEEQRS 1860
GPSVLSQVHL SEDEEDSEDF HSIGGDSDLD SDE 1893
Nucleotide Sequence
(FASTA)
ATGGGACCAG ACTGGGATTT GCTGCTGCCC ACAGCCGCTG CCGGCACCGC AGCCGCCATC 60
ATGTCAGATA CCGACAGCGA TGAAGATTCT GGTGGAGGCG GCTCGTTTTC TTTAACCGGT 120
TTCCTTTTCG GCAACATCAA TGGAGCTGGG CAGCTGGAGG GGGAAAGCGT CTTGGATGAT 180
GAATGTAAGA AGCACTTGGC AGGCTTGGGG GCTTTGGGGC TGGGCAGCCT GATCACTGAA 240
CTCACGGCAA ATGAAGAATT GACTGGGATT GACAGTGGCC TGGTAAATGA TGAAGGATGG 300
ATCAGGAGTA CAGAAGATGC TGTGGACTAT TCAGACATCA ATGAGGTGGC AGAAGATGAA 360
AGCCGAAGAT ACCAGCAGAC ATTGGGGAGC TTGCAGCCCC TTTGCCGTTC AGATTATGAT 420
GAAGATGACT ATGATGCTGA TTGTGAAGAC ATTGATTGCA AGTTGATGCC TCCTCCACCT 480
CCACCCCCAG GACCAATGAA GAAGGATAAG GACCAGGATG CTATTACCGG TGTGTCTGAG 540
GATGGAGAAG GCATCATCTT GCCCTCCATC ATTGCCCCTT CCTCTTTGGC CTCAGAGAAA 600
GTGGACTTCA GTAGTTCCTC TGACTCAGAA TCTGAGATGG GACCTCAGGA AGCAACACAG 660
GCAGACTCGG AGGATGGAAA GCTGACCCTA CCATTGGCTG GGATTATGCA GCATGATGCC 720
ACCAAGCTGT TGCCAAGTGT CACAGAACTT TTCCCAGAAT TTCGGCCTGG GAAGGTGTTA 780
CGCTTCCTCC GTCTTTTTGG ACCAGGGAAG AATGTCCCAT CTGTTTGGCG GAGTGCTCGG 840
AGAAAGAGGA AAAAGAAGCA CCGTGAGCTG ATACAGGAAG AGCAGGCCCA GGAGGTGGAG 900
TGCTCAGTAG AATCAGAAGT CAACCAGAAG TCTTTGTGGA ACTACGACTA CGCTCCACCA 960
CCACCTCCAG AACAGTGTCT CTCTGATGAT GAAATCACAA TGATGGCTCC CATGGAGTCC 1020
AAGTTTTCCC AGTCAACTGG AGATATAGAT AAAGTAACAG ATACCAAACC AAGAGTGGCT 1080
GAGTGGCGTT ATGGGCCTGC CCAACTGTGG TATGATATGC TGGGTGTCCC TGAAGATGGC 1140
AGTGGGTTTG ACTATGGCTT CAAACTGAGA AAGATGGAAC ATGAACCTGT GATAAAATGT 1200
AGAATGATAG AGGACTTTAG GAAACTTGAA GAAAACATTG GCACTGATCT TCTGGCTGAT 1260
GAAAACTTTC TGATGGTGAC ACAGCTGCAT TGGGAGGATG ATATCATCTG GGATGGGGAG 1320
GATGTCAAAC ACAAAGGGAC AAAACCTCAG CGTGCAAGCC TGGCAGGCTG GCTTCCTTCC 1380
AGTATGACTA GGAATGCCAT GGCTTACAAT GTTCAGCAAG GTTTTGCAGC CACCCTGGAT 1440
GATGACAAGC CTTGGTACTC CATTTTTCCC ATTGACAATG AGGAGCTGGT ATATGGACGC 1500
TGGGAGGACA ATATCATTTG GGATGCTCAG GCCATGCCCC ACCTGTTGGA GCCTCCTGTT 1560
TTGACACTTG ATCCCAATGA TGAGAACCTC ATTTTGGAAA TTCCTGATGA GAAGGAAGAG 1620
GCCACTTCTA ACTCCCCCTC CAAGGAGAGT AAGAAGGAAT CATCTCTGAA GAAGAGTCGA 1680
ATTCTCTTAG GGAAAACAGG CGTCATTAAG GAGGAACCAC AGCAGAACAT GTCTCAGCCA 1740
GAAGTGAAAG ATCCATGGAA TCTCTCCAAT GATGAATATT ACTACCCCAA ACAACAGGGT 1800
CTTCGAGGCA CCTTTGGAGG AAATATTATC CAGCACTCAA TTCCTGCTGT GGAATTACGG 1860
CAGCCCTTCT TTCCCACCCA CATGGGGCCC ATCAAACTCC GGCAGTTCCA TCGCCCACCT 1920
CTGAAGAAGT ACTCTTTTGG CGCGCTGTCT CAGCCAGGTC CCCACTCAGT CCAACCCTTG 1980
CTAAAGCACA TCAAAAAGAA GGCTAAGATG AGAGAACAAG AGAGGCAAGC TTCGGGTGGT 2040
GGAGAGATGT TTTTTATGCG CACACCTCAG GACCTTACAG GCAAAGATGG AGATCTTATT 2100
CTTGCAGAAT ACAGTGAGGA AAATGGACCA TTAATGATGC AGGTTGGCAT GGCAACAAAG 2160
ATAAAAAACT ACTATAAGCG GAAACCTGGA AAAGATCCCG GAGCCCCAGA TTGTAAATAT 2220
GGAGAAACTG TTTACTGCCA TACGTCTCCT TTCCTGGGCT CTCTCCATCC TGGCCAATTA 2280
CTGCAGGCGT TTGAGAACAA CCTTTTTCGT GCCCCAATTT ATCTTCATAA GATGCCAGAA 2340
ACCGACTTCC TTATCATTCG AACAAGACAA GGGTACTATA TTCGAGAATT AGTGGATATT 2400
TTTGTGGTTG GTCAGCAGTG CCCCTTGTTT GAAGTTCCTG GACCCAACTC CAAAAGGGCC 2460
AACACACATA TTCGAGACTT TCTCCAGGTT TTTATTTACC GCCTCTTCTG GAAGAGCAAA 2520
GATCGGCCAC GGCGGATCCG AATGGAAGAT ATAAAAAAAG CCTTTCCCTC CCATTCGGAA 2580
AGCAGCATTC GGAAGAGGCT AAAGCTCTGC GCTGACTTCA AACGCACAGG GATGGACTCA 2640
AACTGGTGGG TGCTGAAGTC TGATTTTCGT TTACCGACTG AAGAGGAGAT CAGAGCTATG 2700
GTGTCTCCAG AGCAGTGTTG TGCCTACTAT AGCATGATAG CTGCAGAGCA GCGATTGAAG 2760
GATGCTGGCT ATGGTGAGAA GTCCTTTTTT GCTCCAGAAG AAGAAAATGA GGAAGATTTC 2820
CAGATGAAGA TTGATGATGA GGTTCGCACT GCTCCTTGGA ACACCACAAG GGCTTTCATT 2880
GCTGCCATGA AGGGCAAGTG TCTCCTAGAG GTGACTGGGG TGGCAGATCC CACAGGGTGT 2940
GGTGAAGGAT TCTCCTATGT GAAGATTCCA AACAAACCAA CACAGCAGAA GGATGATAAG 3000
GAGCCTCAGC CAGTGAAGAA GACAGTGACA GGAACAGATG CAGACCTCCG TCGCCTTTCT 3060
CTGAAAAATG CCAAGCAGCT TCTGCGTAAA TTTGGTGTGC CTGAGGAAGA GATTAAAAAG 3120
TTGTCCCGCT GGGAAGTGAT CGATGTGGTA CGCACAATGT CAACAGAACA GGCCCGTTCT 3180
GGAGAGGGCC CCATGAGTAA ATTTGCCCGT GGATCAAGGT TTTCTGTGGC TGAGCATCAA 3240
GAGCGTTACA AAGAGGAATG TCAGCGCATC TTTGACCTAC AGAACAAGGT TTTGTCCTCA 3300
ACCGAAGTCT TATCAACTGA CACAGACAGC AGCTCAGCTG AAGACAGTGA CTTTGAAGAA 3360
ATGGGAAAGA ACATTGAGAA CATGTTGCAG AATAAGAAAA CCAGCTCTCA GCTATCACGT 3420
GAACGGGAGG AGCAGGAGCG GAAGGAACTA CAGCGGATGC TACTGGCAGC AGGTTCCGCA 3480
GCATCAGGAA ACAATCACAG AGATGATGAC ACAGCTTCTG TGACTAGCCT TAATTCTTCT 3540
GCCACTGGCC GTTGTCTCAA GATTTATCGC ACATTTCGAG ACGAAGAGGG GAAAGAGTAT 3600
GTTCGCTGTG AGACGGTCCG AAAGCCAGCT GTCATTGATG CCTATGTGCG CATACGGACC 3660
ACAAAGGATG AGGAATTCAT TCGAAAATTT GCCCTTTTTG ATGAACAGCA TCGAGAAGAG 3720
ATGCGAAAAG AACGGCGGAG GATTCAGGAG CAACTGAGGC GGCTTAAACG GAACCAGGAG 3780
AAGGAGAAGC TTAAGGGTCC TCCTGAGAAG AAGCCCAAAA AAATGAAGGA GCGTCCTGAC 3840
CTAAAACTGA AATGTGGGGC TTGTGGTGCC ATTGGGCACA TGAGGACAAA CAAATTCTGC 3900
CCCCTCTATT ATCAAACAAA TGCTCCACCT TCCAACCCTG TTGCCATGAC AGAAGAGCAG 3960
GAGGAGGAGT TGGAAAAGAC AGTCATTCAT AATGATAATG AAGAACTTAT CAAGGTTGAA 4020
GGGACCAAGA TTGTCTTGGG GAAACAGCTA ATTGAGAGTG CAGATGAGGT TCGCAGAAAA 4080
TCTCTGGTTC TCAAGTTCCC TAAACAGCAA CTTCCTCCTA AGAAGAAACG ACGAGTTGGA 4140
ACCACTGTTC ACTGTGACTA TTTGAATAGA CCTCATAAGT CCATCCACCG GCGTCGGACA 4200
GACCCCATGG TGACATTGTC ATCCATCTTG GAGTCTATCA TCAATGACAT GAGAGATCTT 4260
CCAAATACAT ACCCTTTCCA CACTCCAGTC AATGCGAAGG TTGTAAAGGA CTACTACAAA 4320
ATCATCACTC GACCAATGGA CTTACAAACA CTCCGTGAAA ATGTGCGTAA ACGCCTCTAC 4380
CCATCTCGGG AAGAGTTCAG AGAGCATTTG GAGCTAATTG TGAAGAATAG CGCAACCTAC 4440
AATGGGCCAA AGCACTCATT GACCCAGATC TCTCAATCCA TGCTGGATCT CTGTGATGAA 4500
AAACTCAAAG AGAAAGAAGA CAAATTGGCA CGTTTGGAGA AAGCCATCAA CCCCTTGCTG 4560
GATGATGATG ACCAAGTGGC ATTTTCTTTC ATTCTGGATA ACATTGTCAC CCAGAAAATG 4620
ATGGCAGTTC CAGATTCTTG GCCATTTCAT CACCCAGTTA ATAAGAAGTT TGTTCCAGAT 4680
TATTACAAAG TGATCGTCAG TCCAATGGAT TTAGAGACTA TACGTAAGAA TATCTCCAAG 4740
CACAAGTATC AGAGTCGAGA GAGCTTTCTA GATGATGTCA ACCTCATTCT GGCCAACAGT 4800
GTTAAGTACA ATGGGCCTGA GAGTCAGTAT ACTAAGACTG CCCAGGAGAT TGTGAATGTC 4860
TGTTACCAGA CGTTGACTGA GTATGATGAG CATTTGACTC AACTTGAGAA GGATATTTGT 4920
ACAGCTAAGG AAGCAGCTTT GGAGGAAGCA GAATTAGAAA GCTTGGACCC AATGACCCCA 4980
GGGCCTTACA CACCTCAGCC TCCTGATTTG TATGATACCA ACACATCCCT CAGTATGTCT 5040
CGAGATGCCT CTGTATTTCA AGATGAGAGC AATATGTCTG TCCTGGATAT TCCCACTGCC 5100
ACTCCAGAAA AGCAGGTGGC ACAGGAAGGT GAAGATGCAG ATGGTGATCT TGCAGACGAA 5160
GAGGAAGGAA CTGTGCAACA GCCTCAAGCA AGTGTCCTGT ATGAGGATTT GCTTATGTCT 5220
GAAGGAGAAG ATGATGAGGA AGATGCTGGG AGTGATGAAG AAGGAGATAA TCCTTTCTCT 5280
GCTATCCAGC TGAGTGAAAG CGGAAGTGAC TCTGATGTGG GATCTGGTGG GATAAGACCC 5340
AAACAGCCCC GCATGCTTCA GGAGAACACA AGGATGGGCA TGGAAAATGA AGAAAGCATG 5400
ATGTCCTATG AGGGAGACGG TGGGGAGGCT TCTCATGGTT TGGAGGATAG CAACATCAGT 5460
TATGGGAGCT ATGAGGAGCC TGATCCCAAG TCGAACACCC AAGACACAAG CTTCAGCAGC 5520
ATCGGTGGGT ATGAGGTATC AGAGGAGGAA GAAGATGAGG AGGAGGAAGA GCAGCGCTCT 5580
GGGCCGAGCG TACTAAGCCA GGTCCACCTG TCAGAGGACG AGGAGGACAG TGAGGATTTC 5640
CACTCGATTG GTGGGGACAG TGACTTGGAC TCTGATGAAT GA 5682
Domain Profile
N/A
Domain Sequence
(FASTA)
N/A
KeywordBromodomain; Complete proteome; Reference proteome.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Ciona intestinalis"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Felis catus"; ?>Gallus gallus"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Meleagris gallopavo"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pelodiscus sinensis"; ?>Pongo abelii"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Taeniopygia guttata"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Saccharomyces cerevisiae"; ?>Schizosaccharomyces pombe"; ?>
EKS-AIM-00466
EKS-ANC-00493
EKS-BOT-00495
EKS-CAE-00438
EKS-CAJ-00507
EKS-CAF-00500
EKS-CII-00276
EKS-CIS-00253
EKS-DAR-00926
EKS-DRM-00234
EKS-FEC-00476
EKS-GAG-00419
EKS-GAA-00606
EKS-GOG-00490
EKS-HOS-00514
EKS-ICT-00471
EKS-LAC-00513
EKS-LOA-00505
EKS-MAM-00498
EKS-MEG-00407
EKS-MOD-00487
EKS-MUM-00537
EKS-MUP-00494
EKS-MYL-00493
EKS-NOL-00450
EKS-ORN-00629
EKS-ORA-00434
EKS-ORC-00472
EKS-ORL-00582
EKS-OTG-00511
EKS-PES-00439
EKS-POA-00480
EKS-RAN-00510
EKS-SAH-00469
EKS-TAG-00565
EKS-TAR-00623
EKS-TEN-00624
EKS-XET-00652
EKS-SAC-00129
EKS-SCP-00124
Gene Ontology
GO:0045120; C:pronucleus
GO:0005669; C:transcription factor TFIID complex
GO:0003677; F:DNA binding
GO:0006352; P:DNA-dependent transcription, initiation
GO:0006355; P:regulation of transcription, DNA-dependent
KEGG
InterPros
IPR001487; Bromodomain.
IPR018359; Bromodomain_CS.
IPR011177; TAF1_animal.
IPR009067; TAF_II_230-bd.
IPR022591; TFIID_sub1_DUF3591.
Pfam
PF00439; Bromodomain; 2.
PF12157; DUF3591; 1.
PF09247; TBP-binding; 1.
SMARTs
SM00297; BROMO; 2.
Prosites
PS00633; BROMODOMAIN_1; 2.
PS50014; BROMODOMAIN_2; 2.
Prints
PR00503; BROMODOMAIN.
Created Date20-Feb-2013