EKS-SAC-00129
Eukaryotic Protein Kinase & Protein Phosphatase Database
TagContent
EKPD IDEKS-SAC-00129
Classification
Group/FamilyScoreE-ValueStartEndDomain Length
Atypical/TAF1N/AN/AN/AN/AN/A
StatusReviewed
Ensembl ProteinYGR274C
UniProt AccessionP46677; D6VV51;
Protein NameTranscription initiation factor TFIID subunit 1
Protein Synonyms/Alias TAFII-130; TAFII-145; TBP-associated factor 1; TBP-associated factor 145 kDa;
Gene NameTAF1
Gene Synonyms/Alias TAF1; TAF130, TAF145; YGR274C; G9374;
Ensembl Information
Ensembl Gene IDEnsembl Protein IDEnsembl Transcript ID
YGR274CYGR274CYGR274C
OrganismSaccharomyces cerevisiae
Functional DescriptionFunctions as a component of the DNA-binding generaltranscription factor complex TFIID. Binding of TFIID to a promoter (with or without TATA element) is the initial step in pre- initiation complex (PIC) formation. TFIID plays a key role in the regulation of gene expression by RNA polymerase II through different activities such as transcription activator interaction, core promoter recognition and selectivity, TFIIA and TFIIB interaction, chromatin modification (histone acetylation by TAF1), facilitation of DNA opening and initiation of transcription.
Protein Length1066
Protein Sequence
(FASTA)
MVKQQGSGKT NLANEDEAYE AIFGGEFGSL EIGSYIGGDE GANSKDYTEH LPDAVDFEDE 60
DELADDDDDL PEESDANLHP AMMTMGAYDD VNENGAVLGI DSNSLNMQLP EINGDLSQQF 120
ILEDDGGTPA TSNALFMGMD ANEIHLATET GVLDGSGANE IGHSQLSIGG VNGNDMSING 180
GFIMEPDMSD GKHKKATKLD LINHEKYLLK KYFPDFEKGK ILKWNKLIYR RSVPYHWHSE 240
ISRVKKPFMP LNLKFKVQQD DKRLFNSRTI SYVAPIYQGK NNLLQSNSSA SRRGLIHVSI 300
DELFPIKEQQ KKRKIIHDEK TISEDLLIAT DDWDQEKIIN QGTSSTATLA DSSMTPNLKF 360
SGGYKLKSLI EDVAEDWQWD EDMIIDAKLK ESKHAELNMN DEKLLLMIEK TNNLAQQKQQ 420
LDSSNLILPL NETILQQKFN LSNDDKYQIL KKTHQTKVRS TISNLNIQHS QPAINLQSPF 480
YKVAVPRYQL RHFHRENFGS HIRPGTKIVF SKLKARKRKR DKGKDVKESF STSQDLTIGD 540
TAPVYLMEYS EQTPVALSKF GMANKLINYY RKANEQDTLR PKLPVGETHV LGVQDKSPFW 600
NFGFVEPGHI VPTLYNNMIR APVFKHDISG TDFLLTKSSG FGISNRFYLR NINHLFTVGQ 660
TFPVEEIPGP NSRKVTSMKA TRLKMIIYRI LNHNHSKAIS IDPIAKHFPD QDYGQNRQKV 720
KEFMKYQRDG PEKGLWRLKD DEKLLDNEAV KSLITPEQIS QVESMSQGLQ FQEDNEAYNF 780
DSKLKSLEEN LLPWNITKNF INSTQMRAMI QIHGVGDPTG CGEGFSFLKT SMKGGFVKSG 840
SPSSNNNSSN KKGTNTHSYN VAQQQKAYDE EIAKTWYTHT KSLSISNPFE EMTNPDEINQ 900
TNKHVKTDRD DKKILKIVRK KRDENGIIQR QTIFIRDPRV IQGYIKIKEQ DKEDVNKLLE 960
EDTSKINNLE ELEKQKKLLQ LELANLEKSQ QRRAARQNSK RNGGATRTEN SVDNGSDLAG 1020
VTDGKAARNK GKNTTRRCAT CGQIGHIRTN KSCPMYSSKD NPASPK 1066
Nucleotide Sequence
(FASTA)
ATGGTAAAGC AGCAGGGATC CGGCAAGACC AACTTGGCCA ACGAAGATGA AGCATATGAA 60
GCTATTTTTG GCGGAGAGTT TGGCTCTTTA GAAATCGGGT CATACATTGG CGGGGATGAA 120
GGTGCCAATT CAAAGGACTA TACGGAGCAT TTGCCGGATG CTGTAGATTT TGAAGATGAA 180
GATGAACTTG CTGATGACGA TGACGATTTG CCAGAAGAAT CTGATGCTAA TTTGCATCCA 240
GCTATGATGA CTATGGGCGC GTATGATGAT GTAAACGAGA ACGGTGCCGT ACTCGGTATC 300
GACTCAAATA GTTTGAATAT GCAACTGCCT GAAATTAATG GTGATTTGTC TCAACAGTTT 360
ATTTTGGAGG ATGATGGGGG TACTCCCGCA ACTAGCAATG CTTTGTTTAT GGGAATGGAT 420
GCAAATGAAA TTCATCTCGC CACTGAAACT GGAGTTCTTG ATGGTAGTGG CGCAAATGAA 480
ATTGGGCATT CTCAACTTTC CATTGGTGGC GTTAATGGAA ATGATATGTC GATAAATGGT 540
GGATTTATCA TGGAACCAGA TATGTCAGAT GGCAAGCATA AGAAAGCCAC CAAATTAGAC 600
TTGATAAACC ATGAGAAGTA TCTTCTAAAA AAATACTTTC CTGATTTTGA AAAGGGTAAA 660
ATTTTAAAAT GGAACAAGCT GATTTATAGA AGATCTGTTC CTTATCATTG GCACAGTGAA 720
ATATCTAGGG TAAAGAAACC GTTTATGCCT TTAAATTTGA AATTCAAGGT TCAACAGGAT 780
GATAAGAGGC TATTCAACTC AAGGACAATA TCTTACGTCG CTCCGATTTA TCAAGGGAAA 840
AACAATTTAC TTCAAAGTAA CTCTTCTGCA TCCCGAAGAG GTTTAATTCA TGTTTCCATT 900
GATGAACTTT TCCCTATCAA AGAGCAACAA AAAAAAAGAA AGATTATTCA TGATGAAAAG 960
ACCATATCGG AAGACTTGCT TATTGCCACT GATGATTGGG ACCAAGAAAA AATCATCAAC 1020
CAAGGTACTT CATCAACAGC AACCTTGGCG GATTCGTCTA TGACACCCAA CTTAAAGTTC 1080
TCCGGCGGTT ATAAATTGAA GAGCTTGATT GAGGATGTTG CTGAAGATTG GCAGTGGGAT 1140
GAAGACATGA TCATTGATGC AAAATTGAAA GAGTCTAAAC ATGCTGAATT AAATATGAAT 1200
GACGAAAAAC TGTTGCTGAT GATTGAGAAA ACAAATAATT TAGCGCAGCA AAAGCAACAA 1260
CTGGATAGTA GTAACTTGAT ATTGCCCCTC AACGAAACTA TTTTGCAACA AAAATTCAAT 1320
TTATCTAATG ATGATAAATA TCAAATCTTG AAAAAAACCC ATCAAACAAA AGTTCGTTCT 1380
ACCATCTCGA ACTTAAATAT TCAGCATTCT CAACCTGCGA TTAATTTACA ATCTCCATTT 1440
TACAAAGTGG CTGTTCCTAG ATACCAGTTG AGGCATTTCC ACCGTGAAAA CTTTGGTTCG 1500
CACATCAGAC CAGGTACTAA AATTGTCTTC AGTAAGTTAA AAGCGCGTAA AAGGAAAAGA 1560
GATAAAGGTA AAGATGTCAA AGAATCATTT TCTACATCTC AAGATCTAAC CATCGGGGAT 1620
ACTGCTCCTG TTTATTTAAT GGAATATTCT GAACAAACAC CGGTAGCTTT GTCTAAATTT 1680
GGTATGGCCA ATAAATTAAT TAATTATTAT CGGAAAGCCA ATGAACAAGA TACTCTAAGG 1740
CCCAAATTGC CTGTTGGCGA AACTCACGTT TTGGGAGTTC AAGACAAATC ACCATTTTGG 1800
AACTTTGGAT TTGTTGAACC TGGTCATATC GTCCCCACAT TATACAATAA CATGATTAGA 1860
GCACCCGTTT TCAAACATGA TATATCAGGA ACAGATTTTC TTCTGACAAA AAGTTCCGGA 1920
TTTGGTATAA GCAATCGATT TTACTTACGT AATATTAACC ATCTTTTTAC GGTTGGACAA 1980
ACTTTTCCTG TCGAGGAGAT TCCCGGACCT AATTCAAGAA AGGTGACATC AATGAAAGCT 2040
ACAAGACTGA AAATGATTAT TTATAGAATT CTAAACCATA ATCACAGCAA GGCAATTTCT 2100
ATTGATCCCA TTGCAAAGCA CTTTCCCGAC CAAGATTATG GACAAAACAG ACAAAAAGTG 2160
AAGGAGTTCA TGAAATATCA AAGAGATGGT CCTGAAAAAG GTCTGTGGAG GCTTAAAGAT 2220
GATGAAAAAT TGTTAGACAA TGAAGCAGTG AAAAGTTTAA TTACGCCAGA GCAGATCAGC 2280
CAAGTTGAAT CAATGAGTCA GGGCTTACAA TTCCAAGAAG ATAATGAAGC ATATAATTTT 2340
GATTCTAAGC TAAAATCTCT AGAAGAAAAC TTGTTACCAT GGAACATTAC AAAAAATTTT 2400
ATTAATTCAA CACAAATGCG AGCTATGATA CAAATACATG GTGTTGGTGA CCCAACGGGC 2460
TGTGGTGAAG GTTTTTCTTT CTTGAAAACT TCAATGAAAG GTGGTTTTGT TAAGTCTGGT 2520
TCTCCTTCCA GTAATAACAA TAGTTCAAAC AAAAAAGGCA CTAACACGCA TAGCTACAAT 2580
GTGGCTCAAC AGCAAAAAGC TTACGACGAA GAAATTGCGA AGACCTGGTA TACACATACA 2640
AAATCGTTGA GCATAAGCAA TCCTTTTGAG GAAATGACCA ATCCTGATGA GATTAATCAG 2700
ACCAACAAGC ATGTTAAGAC GGATAGAGAT GATAAGAAAA TTCTGAAGAT TGTTAGAAAG 2760
AAAAGAGATG AAAATGGTAT AATTCAAAGG CAGACAATTT TCATAAGAGA TCCTAGGGTC 2820
ATTCAAGGAT ATATAAAAAT CAAAGAACAG GATAAAGAAG ATGTTAACAA ATTGTTAGAG 2880
GAGGACACTT CAAAGATAAA TAACTTGGAA GAACTTGAGA AACAGAAAAA ACTGTTGCAA 2940
CTAGAATTAG CTAATTTGGA AAAATCACAG CAACGTAGAG CAGCAAGGCA AAATTCGAAG 3000
AGAAATGGTG GTGCCACGAG AACAGAAAAC TCTGTGGATA ATGGTAGCGA CCTTGCCGGT 3060
GTAACTGACG GGAAAGCAGC CAGGAATAAA GGTAAGAACA CTACAAGGAG ATGTGCTACA 3120
TGCGGACAAA TCGGGCACAT TAGAACAAAT AAATCTTGTC CAATGTATAG CAGTAAAGAT 3180
AACCCTGCTT CACCAAAGTA G 3201
Domain Profile
N/A
Domain Sequence
(FASTA)
N/A
KeywordAcyltransferase; Chromatin regulator; Coiled coil; Complete proteome; Direct protein sequencing; Nucleus; Phosphoprotein; Reference proteome; Transcription; Transcription regulation; Transferase.
Sequence SourceEnsembl
Orthology
Ortholog group
Ailuropoda melanoleuca"; ?>Anolis carolinensis"; ?>Bos taurus"; ?>Caenorhabditis elegans"; ?>Callithrix jacchus"; ?>Canis familiaris"; ?>Ciona savignyi"; ?>Danio rerio"; ?>Drosophila melanogaster"; ?>Equus caballus"; ?>Felis catus"; ?>Gallus gallus"; ?>Gasterosteus aculeatus"; ?>Gorilla gorilla"; ?>Homo sapiens"; ?>Ictidomys tridecemlineatus"; ?>Latimeria chalumnae"; ?>Loxodonta africana"; ?>Macaca mulatta"; ?>Meleagris gallopavo"; ?>Monodelphis domestica"; ?>Mus musculus"; ?>Mustela putorius furo"; ?>Myotis lucifugus"; ?>Nomascus leucogenys"; ?>Oreochromis niloticus"; ?>Ornithorhynchus anatinus"; ?>Oryctolagus cuniculus"; ?>Oryzias latipes"; ?>Otolemur garnettii"; ?>Pelodiscus sinensis"; ?>Pongo abelii"; ?>Rattus norvegicus"; ?>Sarcophilus harrisii"; ?>Taeniopygia guttata"; ?>Takifugu rubripes"; ?>Tetraodon nigroviridis"; ?>Xenopus tropicalis"; ?>Schizosaccharomyces pombe"; ?>
EKS-AIM-00466
EKS-ANC-00493
EKS-BOT-00495
EKS-CAE-00438
EKS-CAJ-00507
EKS-CAF-00500
EKS-CIS-00253
EKS-DAR-00926
EKS-DRM-00234
EKS-EQC-00492
EKS-FEC-00476
EKS-GAG-00419
EKS-GAA-00606
EKS-GOG-00490
EKS-HOS-00514
EKS-ICT-00471
EKS-LAC-00513
EKS-LOA-00505
EKS-MAM-00498
EKS-MEG-00407
EKS-MOD-00487
EKS-MUM-00537
EKS-MUP-00494
EKS-MYL-00492
EKS-NOL-00450
EKS-ORN-00629
EKS-ORA-00434
EKS-ORC-00472
EKS-ORL-00582
EKS-OTG-00511
EKS-PES-00439
EKS-POA-00480
EKS-RAN-00510
EKS-SAH-00469
EKS-TAG-00565
EKS-TAR-00623
EKS-TEN-00624
EKS-XET-00652
EKS-SCP-00124
Gene Ontology
GO:0005669; C:transcription factor TFIID complex
GO:0003682; F:chromatin binding
GO:0004402; F:histone acetyltransferase activity
GO:0003676; F:nucleic acid binding
GO:0032947; F:protein complex scaffold
GO:0017025; F:TBP-class protein binding
GO:0008270; F:zinc ion binding
GO:0006355; P:regulation of transcription, DNA-dependent
GO:0051123; P:RNA polymerase II transcriptional preinitiation complex assembly
KEGG
sce:YGR274C;
InterPros
IPR022591; TFIID_sub1_DUF3591.
IPR001878; Znf_CCHC.
Pfam
PF12157; DUF3591; 1.
SMARTs
SM00343; ZnF_C2HC; 1.
Prosites
Prints
Created Date20-Feb-2013