We analyzed the binding of the Cbf1p protein and its deletion mutant
containing only the DNA binding domain to 16 different centromeric
DNA fragments representing a degenerate set of specific binding
sites. Each of the 337 bp DNA centromere fragments used in this
study contains a single CDEI binding site of the consensus sequence
5′-RTCACRTG-3′ specific
for
CEN1 to
CEN16, respectively.
Cbf1p binds to these sites as a homodimer. In addition, the DNA
fragments carry the CDEIII region, which represents the binding
site of the hetero-oligomeric protein complex Cbf3 (
5).
The binding constant of Cbf1p to DNA sequences other than the consensus
site was determined to be 2 × 10
–4 M
in competition experiments (
26).
Assuming 329 independent non-specific binding sites per centromere fragment,
one can estimate their contribution to the binding constant as two
orders of magnitude off the constants for specific binding (Table ). Thus, non-specific binding to the fragments
can be neglected.
The binding constants of full-length Cbf1p agree with literature
values obtained for related proteins. Studies by Baker
et
al. (
30) revealed that the
Cbf1p binding equilibrium constant
KE to
CEN3 DNA is 3 × 10
8 M
–1 (±40%) which corresponds (equation
4) to a
Kd of 3.3 nM.
Baker
et al. (
30)
had purified the Cbf1p protein from yeast, thus the endogenous protein from
yeast exhibits a very similar binding constant when compared to
the binding constant of 2.5 nM obtained for the recombinant Cbf1
protein used in this work. The binding constant of the human upstream
stimulatory factor USF, also containing a bHLH DNA binding motif,
to its binding sequence 5′-dCACGTG-3′ was determined to 1.3 nM (
31).
The 16
CEN DNAs offer a variation of binding
sequences. It could be speculated that the binding constants are
related to the particular CDEI DNA sequence suggesting similar binding constants
for those
CEN fragments with 5′-dRTCACATG-3′ or with 5′-dRTCACGTG-3′. However, this is not observed. For example,
within the CDEI sequence group 5′-dRTCACATG-3′ we find the strongest (
CEN11)
and the weakest (
CEN8) binding by full-length Cbf1p.
Very weak (
CEN8) and very strong (
CEN3)
binding within this DNA sequence group is also observed for Cbf1pΔN209p. Thus, DNA sequences next to
the consensus binding site seem to influence the DNA binding of Cbf1p
and Cbf1ΔN209p. Similar findings have
been reported previously by Wilmen
et al. (
20) who showed that Cbf1p binding
in
vitro as well as full
in vivo centromere function requires
a 10 bp recognition sequence including the 8 bp CDEI. Moreover,
these studies indicated that Cbf1p interaction with CDEI is different
in vitro and
in vivo suggesting
that the CDEI–Cbf1p complex interacts with other centromere
components in the
CEN complex (
20).
Cbf1p and Cbf1ΔN209p contain the
same bHLHzip DNA binding motif. Nevertheless, both proteins bind
differently to the 16 DNA sequences indicating that the N-terminal
domain of the protein has an influence on binding. The relative
value of the binding constants of the two proteins (Table ) varies from 1.2 (CEN5)
to 4.4 (CEN4) showing that the N-terminal part contributes
to the binding in a DNA sequence specific way either by interacting
with CDEI-surrounding DNA sequences or by modulating the Cbf1 protein
structure and thus stabilizing the various interactions between
the protein and the 16 different CDEI elements.
Cbf1p and Cbf1ΔN209p bind to the
degenerate set of the 16
CEN DNAs with
binding constants varying by a factor of 2.2 and 3.8, respectively.
Since,
in vivo, both the Cbf1p–CDEI and
the Cbf1ΔN209p–CDEI complexes
exert the same biological function (
15),
this system gives us an idea of the spectrum of binding strengths
of a particular protein–DNA interaction tolerated by the
cell. We are not aware of reports on other biological systems with
such a DNA sequence and binding strength variation but common function
within one cell. It is likely that a variety of protein–protein
and protein–DNA interactions are necessary for the formation
of fully assembled centromere complexes (
23).
Thus, the binding strength variations observed in our system represent
only one contributing part of the biologically active complex. Obviously,
a binding strength variation of a factor of 3.8 appears to be fully
tolerated
in vivo.
The mobility of DNA molecules in native polyacrylamide gels
depends on their 3D structure. All 16
CENDNAs of
S.cerevisiae display different degrees of reduced
mobility in gels (
9). Here we
show that all 16
CEN DNA–Cbf1p complexes
display a migration pattern that is clearly distinct from the pattern
observed for free
CEN DNA. It is conceivable that
the migration characteristics of the complexes are a result of intrinsic
CEN DNA curvature (
9)
combined with bending of CDEI by Cbf1p (
22).
The intrinsic curvature of
CEN DNAs may also influence
the accessibilty of Cbf1 proteins to CDEI, which could also be responsible
for the variety of binding constants.
Yeast mutants in which
Cbf1 is replaced by
Cbf1Δ
N209 show
no difference in their growth rate when compared to wild-type (
32). Obviously, for centromere function
the N-terminal protein domain does not seem to be essential and
thus for the biological function a considerable difference in the
binding constant of Cbf1p to the 16 different DNA sites, as well
as differences in the Cbf1p–
CEN DNA structure
are tolerated
in vivo. It was shown previously
that binding of Cbf1p to its binding motif in the methionine promoter
can be stimulated by the transcription factor Met28 (
13).
It is possible that similar mechanisms act at centromeres. There
are more than 12 established kinetochore proteins contributing to
structure and function of the centromeres in budding yeast (
23). Based on biochemical and genetic
data, the centromere proteins Mif2p, Cse4p, Mcm21p and the Cbf3
subunits p110 and p64 have been proposed as Cbf1p-interacting proteins
(
18,
23,
24,
33). These
proteins may function at the centromere to influence Cbf1p–CDEI
complex formation. This has been reported recently for the p64 and
p23 CBF3 subunits, which stimulate the DNA binding activity of Cbf1ΔN209p (
25).
p64 stimulated the
CEN DNA binding activity of
both Cbf1ΔN209p (
25)
and full-length Cbf1p with similar efficiencies (Fig. ). If this interaction does also occur
in
vivo, we speculate that the observed differences in binding
affinities of Cbf1ΔN209p to different
CENs are tolerated because p64 shifts the affinity
of Cbf1ΔN209p to
CEN DNA
into a range that makes these differences negligable. Since Cbf1p
does not bind to CDEI in the absence of CBF3
in vivo (
33), the observed p64 activity may be
a critical event that decides whether Cbf1p will bind to CDEI or
not.
Taken together the results presented here provide further evidence
that the structure of the S.cerevisiae centromere complex
is determined by a dynamic network of specific DNA–protein
and protein–protein interactions.