The genetic material present in the nucleus of eukaryotic cells is packaged into chromatin, which functions as a dynamic scaffold for the regulation of various nuclear processes including DNA transcription, replication, repair, chromosome segregation, and apoptosis. The basic, repetitive unit of chromatin is the nucleosome core particle, a nucleoprotein structure that contains 147 base pairs of DNA wrapped around a histone octomeric core of a histone H3/H4 heterotetramer and two histone H2A/H2B heterodimers. Together with the H1 linker histone, nucleosome core particles form arrays of compacted higher-order chromatin structures that allow the entire human genome with a linearized length of more than 1 meter to be packed into a nucleus with a micrometer-scale diameter. Compacted chromatin restricts the access of many DNA regulatory proteins, thus neccesitating the need for chromatin regulatory proteins to facilitate DNA regulatory activities. Of the proteins that regulate chromatin function, the enzymes that post-translationally modify the histone proteins, particularly on the N-terminal histone tail regions, form the largest family. Histone acetyltransferases (HATs) were the first family of histone modification enzymes characterized at the molecular, biochemical and structural levels.
HAT enzymes catalyze the transfer of an acetyl group from the co-factor acetyl-coenzyme A (acetyl-CoA) to the ε-amine of a substrate lysine side chain. Since the isolation and characterization of the Tetrahymena Gcn5 HAT in 1996 [
1], a large number of HATs have now been identified and characterized. To date, nuclear HATs can be categorized into four major families based on primary sequence homology: Gcn5/PCAF (General control nonrepressed protein 5 and p300 and CBP associated factor); MYST (named for the founding members MOZ, Ybf2/Sas3, Sas2, and Tip60); p300/CBP (protein of 300 kDa and CREB Binding Protein); and Rtt109 (Regulator of Ty1 Transposition gene product 109). Other putative HATs that have been reported include ATF-2 [
2], TAF250 [
3], nuclear steroid receptor coactivators [
4,
5], and more recently the circadian CLOCK protein [
6], but their enzymatic properties have not been rigorously characterized. In this review, we will describe the two HAT families that have been most recently characterized, p300/CBP and Rtt109, with a particular focus on their strucutres and chemistry with implications for HAT evolution and function.
A brief review of p300/CBP and Rtt109 biology
The p300 and CBP (CREB binding protein) paralogs were originally identified as binding partners of the adenovirus early-region 1A (E1A) protein [
7,
8], and the cAMP-regulated enhancer (CRE) binding proteins [
9], respectively. These proteins were later shown to function as transcriptional adaptors, and, about a decade ago, they were shown to harbor HAT activity [
10,
11]. p300 and CBP HAT domains have >90% sequence identity and are conserved in metazoans with many overlapping functions and will henceforth be referred to as p300/CBP. p300/CBP has multiple domains and, in addition to the enzymatic HAT domain, contains other protein interaction domains including three cysteine-histidine rich domains (CH1, CH2 and CH3), a KIX domain, a bromodomain, and a steroid receptor coactivator interaction domain (SID, also the SRC-1 interaction domain). p300/CBP not only catalyzes the acetylation of all four core histones, but has been reported to acetylate over 70 other proteins and itself (
Table S1). Therefore, p300/CBP appears to be a versatile, and perhaps rather general, transcriptional integrator.
Abberrent p300/CBP activity has also been observed in various diseases. For example, Rubinstein-Taybi syndrome (RTS) is a developmental disorder caused by heterozygous germline mutations in the CBP or p300 genes [
12,
13]. p300/CBP gene mutations have been detected in human tumors with loss-of-function point mutations found in colorectal, breast, ovarian, oral, gastric, lung, and pancreatic carcinomas [
14–
16]. p300/CBP involved chromosome translocations have been observed in acute myeloid leukemia (AML) and in hematological disorders such as myeloid/lymphoid or mixed lineage (MLL) [
17]. p300/CBP also functions as a transcriptional cofactor for proteins involved in tumorigenic pathways, including oncoproteins (such as myb, jun, and fos), transforming viral proteins (such as E1A, E6 and large T antigen), and tumor-suppressor proteins (such as p53, E2F, Rb, Smads, and BRCA1) [
17]. p300/CBP may also contribute to respiratory epithelial carcinogenesis via cyclin D1, cyclooxygenase-2 (COX-2), and activator protein-1 (AP-1) [
18]. More recent studies have linked p300/CBP to other diseases like progressive neurodegenerative diseases (such as Huntington Disease) [
19], Kennedy’s disease [
20], Alzheimer’s disease [
21], cardiac disease [
22], and fibrosis [
23].
In contrast to the exclusive presence of p300/CBP sequence homologues in metazoans, Rtt109 sequence homologues are only present in fungi. Rtt109 was first identified to be a budding yeast regulator of Ty1 transposition that also plays an important role in the DNA damage response to genotoxic agents [
24,
25]. Recent genomic and proteomic screens as well as a more directed effort led to the observation that Rtt109 promotes genome stability and resistance to a variety of DNA-damaging agents through the direct acetylation of histone Lys56 of histone H3 (H3K56), a residue positioned at the DNA entry and exit points of the nucleosome core particle, during S-phase [•26, •
27,
•28,
•29]. Rtt109 was subsequently reported to also acetylate H3K9 [
•30].
Similar to the p300/CBP family, Rtt109 has no detectable sequence homology to other known HATs. Unlike other HATs, Rtt109 does not contain additional protein interaction domains and requires the presence of one of two histone chaperone proteins, Asf1 or Vps75, for HAT activity [
•27,
•29,
•30,
31,
32].
Structure and chemistry of p300/CBP
In early 2008, we reported on the crystal structure of the p300 HAT domain in complex with a Lys-CoA bisubstrate inhibitor and accompanying biochemical analysis of the enzyme [
••33]. The overall fold of the p300 HAT domain consists of a central β-sheet composed of 7 β-strands surrounded by 9 α-helices and several loops (). Despite its sequence divergence from other HATs, a central core region of the p300 HAT domain, associated with acetyl-CoA cofactor binding, overlays well with the corresponding regions of the Gcn5/PCAF [
34] and MYST [
35] HATs (). This structurally conserved core region corresponds to the A, B and D sequence motifs of the GNAT (Gcn5 related N-acetyltransferase) superfamily of proteins reported by Neuwald and Landsman [
36]. In contrast, the structural elements that flank the central core diverge significantly between the three HAT familes. Two other features also distinguish p300 from other HATs. First, the acetyl-CoA binding site of p300 is more buried than other HATs. This is largely due to the presence of a p300-specific L1 loop that covers one side of the cofactor and contributes to about one third of the protein-cofactor interactions
(). This loop was also shown to be important for p300 catalysis. Second, the surface charge of the p300 HAT domain, particularly proximal to the lysine substrate binding site, is largely electronegative, compared to the more neutral character of the corresponding substrate binding sites of other HATs. Indeed, two shallow electronegative patches, one harboring the lysine moiety of the Lys-CoA inhibitor (P1) and another located about 10 Å away (P2) were shown to be mutationally sensistive for histone binding and catalysis (). Correlating with the importance of the P1 and P2 pockets, positively-charged residues are typically present within 3–4 amino acid residues (about 10 Å) upstream or downstream of the acetylated lysine residues of known p300/CBP substrates [
37]. The promiscuous substrate specificity of p300 relative to other HATs can be accounted for by such a shallow and presumably versatile P2 pocket governing substrate recognition. An interesting property of the p300/CBP HATs is the presence of a highly basic autoacetylation loop that is hyperacetylated in the activated form of the p300/CBP HAT [
•38] and that has been proteolytically cleaved in the current p300 HAT domain structure [
••33]. We propose that this highly basic loop sits in the electronegative substrate binding site to block lysine substrate binding and is released from this site upon autoacetylation, as schemetically depicted in .
Recent enzymatic studies on the p300 HAT domain is consistent with a Theorell-Chance (or “hit and run”) catalytic mechanism that is distinct from other HATs. This mechanism, as it applies to p300, invokes that during catalysis, there is no stable ternary complex formed during the reaction. Rather, the p300 protein first forms a stable complex with the acetyl-CoA cofactor and then the positively charged substrate lysine side chain from the protein substrate transiently binds to the negatively charged P1 pocket and departs immediately after acetyl transfer. Such a mechanism differs from the classical ordered Bi-Bi ternary complex mechansim employed by the Gcn5/PCAF [
39] and the ternary [
40] and ping-pong mechanisms [
35] that have been proposed for the Esa1 member of the MYST family of HATs. The Theorell-Chance mechanism proposed for p300/CBP is consistent with the broad substrate specificity of p300 as it does not require a specific substrate binding pocket. Structure-based p300 mutagenesis studies also identify two critical catalytic residues: Tyr 1467 that likely funcitons as a general acid to protonate the CoA leaving group, and Trp1436 that likely helps steer the substrate lysine into its binding site (). Interestingly, p300 does not critically depend on a specific Asp/Glu general base for catalysis, similar to serotonin N-acetyltransferase [
41], but unlike the Gcn5/PCAF [
34,
42] and MYST [
35] HATs that employ glutamate residues for this purpose.
The structure of p300 allows for a rationalization of several p300/CBP-inactivating mutations that have been observed in various cancers [
14–
16]. For example, the catalytic Trp1436 residue is mutated to cysteine in lung cancer, as is Arg1410, which plays a key role in acetyl-CoA cofactor binding. In addition, Asp1399, a residue important for the appropriate conformation of the L1 loop (), is mutated to a tyrosine residue in colon cancer.
Structure and chemistry of Rtt109
Shortly after the structural determination of the p300 HAT domain, we reported on the crystal structure of the Rtt109 HAT domain bound to the acetyl-CoA cofactor [
••43]. Despite its sequence divergence from other HATs, the Rtt109 structure reveals a strucutrally conserved core region for CoA binding. Unexpectedly, the rest of the Rtt109 domain also looks remarkably similar to p300 with an overall r.m.s.d of 3.5 Å for 236 out of 354 Ca atoms (). This similarity extends to the L1 loop where an Asp89-Lys210 salt-bridge (reminiscent of the Asp1399-His1451 salt-bridge in p300) is essential for holding the loop in position for acetyl-CoA cofactor binding (). Given that the Rtt109 and p300/CBP HAT family only exists in fungi and metazoans, respectively, these structural similarities suggest an evolutionary link between p300/CBP and Rtt109.
Despite the structural similarities between p300 and Rtt109, several biochemical properties of the two enzymes suggest that they diverge significantly at the functional level. Specifically, the surface charge of Rtt109 proximal to the lysine binding site () is unlike the electronegative surface of p300 () and, in fact, more like the apolar surfaces of the Gcn5/PCAF and MYST family of HATs. Indeed, unlike the broad substrate specificity of p300/CBP, Rtt109 appears to be quite selective for H3K56 [
•27,
•28,
•29] and H3K9 [
•30]. The potent p300 inhibitor, Lys-CoA [
44] does not show detectable inhibition of Rtt109 [
••43]. The mechanism of acetyl transfer catalyzed by Rtt109 also appears to be distinct from that of p300, as well as the Gcn5/PCAF and MYST HATs. Similar to p300, Rtt109 does not contain an appropriately positioned glutamate residue in the active site that might function as a general base for catalysis, as seen for the Gcn5/PCAF HATs. However, dissimilar to p300, the two important catalytic residues of p300, Trp1436 and Tyr1467, are not structurally conserved in Rtt109 (). Interestingly, although tyrosine and tryptophan residues of Rtt109 (Tyr199 and Trp222) have been shown to be important for catalysis, they are in different positions in the three-dimensional structure from the corresponding catalytic residues in p300 and appear to play different catalytic roles (). Kinetic analysis of Rtt109 also reveals an unusual sigmoidal histone H3 substrate concentration dependence at fixed acetyl-CoA that fits to a Hill equation with a Hill coefficient of 3, suggesting a high degree of cooperativity of the histone substrate in the acetylation reaction. Although the delineation of the precise catalytic mechanism employed by Rtt109 requires further study, it is already clear that Rtt109 employs a catalytic mechanism that differs from other HATs and is consistent with a more complex nature of protein substrate recognition.
Like p300, Rtt109 has been shown to acetylate itself [
45]. Indeed the structure of the Rtt109/Ac-CoA complex reveals that a partially buried lysine residue (K290), located about 8 Å away from the active site of the enzyme, is acetylated (). Moreover, mass spectrometry data on bacterially produced recombinant and yeast produced Rtt109 protein reveals that this lysine is autoacetylated
in vitro and in yeast cells [
••43]. Although the functional consequence of this autoacetylation is still unclear, it is interesting to note that in addition to Rtt109 and p300/CBP, other HATs, including Esa1 [
40] and PCAF [
46] have been shown to undergo autoacetylation. These observations point to the possibility that the autoacetylation of HATs may play conserved functional roles in regulating HAT funciton.
A unique feature of Rtt109 that distinguishes it from p300 and other HATs, is that it requires the association of histone chaperones, Asf1 and/or Vps75 for catalysis [
•27,
•29,
•30,
31]. We suggest that the binding of these chaperones may help present the H3 substrate to Rtt109 protein, which is consistent with the observation that the addition of excess Vps75 to a stoichiometric Rtt109/Vps75 complex inhibits histone H3 acetylation by Rtt109 [
••43]. The histone chaperone requirement for Rtt109 HAT activity may be related to the fact that Rtt109, unlike other HATs, does not contain other associated domains to interact with other chromatin modifiers.
During the review of this manuscript, two additional manuscripts describing analogous crystal structures of Rtt109 were reported [
••51,
••52]. Both studies utilized protein constructs with internal loop deletions similar to what we had described (delta 130–179) and reveal essentially the same Rtt109 structure. Moreover, the study by Stavropoulos
et al. also confirmed that this particular loop region is essential for Vps75 histone chaperone binding [
••43,
••51]. Also similiar to the initial Rtt109 structure report is the finding that K290 of Rtt109 is acetylated [
••51,
••52]. Interestingly, while our study found that a K290R mutant is wild-type for both H3K56 acetylation and sensitivity to genotoxic stress
in vivo, the more recent studies show that mutation of K290 reduces the ability of Rtt109 to acetylate H3K56
in vitro [
••52] and
in vivo [
••51]. These apparently conflicting results may be related to the fact that we employed a K290R mutant in the context of an intact Rtt109 protein using a W303 yeast strain [
••43], while Stavropoulos
et al. employed K290 mutants in the context of a (delta 128–170) loop deletion using a different BY4741 yeast strain [
••51]. Lin and Yuan also found that the K290R Rtt109 mutant is inactive
in vitro towards an Asf1/H3-H4 substrate [
••52]. Additional experiments are clearly needed to resolve the importance of K290 acetylation in Rtt109 function.