|Home | About | Journals | Submit | Contact Us | Français|
Advances in x-ray light sources and detectors have created opportunities for advancing our understanding of structure and structural dynamics for supramolecular assemblies in solution by combining x-ray scattering measurement with coordinate-based modeling methods. In this review the foundations for x-ray scattering are discussed and illustrated with selected examples demonstrating the ability to correlate solution x-ray scattering measurements to molecular structure, conformation, and dynamics. These approaches are anticipated to have a broad range of applications in natural and artificial photosynthesis by offering possibilities for structure resolution for dynamic supramolecular assemblies in solution that can not be fully addressed with crystallographic techniques, and for resolving fundamental mechanisms for solar energy conversion by mapping out structure in light-excited reaction states.
Key design features of natural photosynthesis are the hierarchal, modular architectures with module-specific protein host-cofactor guest chemistries. This partitioning of photochemical function allows the individual tasks of solar energy conversion such as light-harvesting, primary charge separation, charge accumulation, and terminal water-splitting reductive chemistries to be optimized within individual functional modules, which can then be integrated, regulated, and repaired for overall optimal solar energy conversion throughput. Advances in supramolecular chemistry are allowing synthetic molecular devices to be made that mimic aspects of the hierarchal, modular designs found in biology. (Gust et al. 2001; Wasielewski 2006) Central challenges in solar energy research lie in deciphering the fundamental structural bases for photosynthetic function in biological and biomimetic modular architectures.
Three dimensional structure determinations of proteins are generally obtained through x-ray crystallography or NMR techniques. While crystallography provides the most detailed and accurate measurement of molecular structure, structural information is necessarily confined to the crystalline state. Conformational dynamics and preferred conformations for proteins in crystals can differ from those in non-crystalline states. The differences in conformational preference between liquid and solid state are particularly critical for problems concerning protein-protein interactions. NMR approaches are employed for determination of biomolecular structures in solution, but as discussed below structural determination can frequently be limited by the number of atom pair distance constraints resolved by NMR measurements.
X-ray scattering approaches offer additional opportunities to resolve links between the structural hierarchies and function in natural and artificial photosynthesis that are complementary to crystallographic and NMR measurements. Advantages of scattering technique for structure characterization are that they provide tools for measuring atomic pair distances for molecules in solution and other non-crystalline media that are directly relevant to photosynthetic function, and with a resolution range that extends from single bond distances to the submicron range. Further, developments in synchrotron and pulsed x-ray facilities are offering x-ray light sources of exceptional brilliance, energy tunability, and stability(Hellemans 1997; Riekel et al. 1996; Winick 1998). These extended capabilities include opportunities to record scattering patterns to high spatial resolution (Fischetti et al. 2004; Makowski et al. 2008; O’Donnell et al. 2007; Zuo et al. 2006), with element-specific sensitivity (anomalous scattering), and with ultrafast time resolution using laser pump x-ray probe scattering methods (Cammarata et al. 2008; Ihee et al. 2005; Plech et al. 2004).
An inherent limitation in scattering is that the measurement provides information only on the distribution of atom pair distances but does not directly provide information on three dimensional structure. Emerging techniques are demonstrating opportunities to achieve solution-state biomolecular structure refinement using scattering data as a fitting constraint in conjunction with model-based conformational searches (Bernadó et al. 2007; Kojima et al. 2004; Petoukhov et al. 2002; Petoukhov and Svergun 2005), and in combination with crystallographic data (Putnam et al. 2007), and NMR distance constraints (Gabel et al. 2006; Grishaev et al. 2005a; Schwieters and Clore 2006; Zuo et al. 2008). Used in this way, the x-ray scattering measurement when combined with complementary structural information and coordinate-based modeling provides a method to achieve atomic-scale insight into the structure and conformational dynamics of complex supramolecular architectures in solution. In this educational review we provide examples demonstrating opportunities for photosynthetic supramolecular structure resolution using coordinate-based analyses of solution x-ray scattering data.
An outline of the x-ray scattering experiment is shown in Figure 1. A solution sample in capillary flow cell is placed in the path of a collimated or long focused monochromatic X-ray beam from a synchrotron light source that typically has a cross sectional dimensions of 0.3 mm × 0.5 mm or less (Seifert et al. 2000). The transmitted beam is blocked by a beam stop, while the 2-dimensional scattering pattern is measured by an area detector. For solution samples the isotropic 2-dimensional scattering image is radially averaged as a function of the scattering angle 2Θ with respect to the transmitted beam to produce a 1-dimensional reciprocal space scattering pattern of scattered X-ray intensity plotted as a function of angle. Typically thin-walled capillaries with X-ray pathlengths of 1.0 mm to 1.5 mm are used. The pathlength combined with the X-ray cross-section results in a sample irradiated volume on the order of 0.25 μl or less. At an undulator beam line at the Advanced Photon Source (Argonne National Laboratory), we find that high quality scattering patterns can be recorded for photosynthetic samples with data acquisition times that range from a few seconds to minutes (Svensson et al. 2004; Tiede et al. 2002; Zuo et al. 2006), depending upon the sample concentration and the spatial resolution of the measurement as discussed below. In addition, we have found that radiation damage can be avoided by flowing samples at a rate of 50 μl/min. Hence, high quality synchrotron X-ray scattering patterns can typically be recorded for samples with a total volume of 0.1 ml or less, and with protein or photosynthetic complex concentrations in the 0.01 mM to 10 mM range. More in-depth descriptions of x-ray scattering experiments, data processing, and analyses can be found in several excellent recent reviews (Koch 2006; Lipfert and Doniach 2007; Putnam et al. 2007; Tsuruta and Irving 2008).
Fundamental aspects of x-ray scattering can be understood by considering X-ray scattering for a hypothetical diatomic molecule with atomic spacing, r, shown in Figure 2. As a result of the interactions between the oscillating electric fields in the X-rays and electrons surrounding each atom, a small fraction of the impinging X-ray intensity is scattered coherently. The magnitude and angular dependence of this scattering are described by the atomic scattering factors that are in turn dependent upon the atomic number and ionization state of each atom (Warren 1990). Coherent scattering changes only the direction of the X-rays, not the phase or energy. However, the pathlengths taken by the X-rays scattered from each atom to arrive at a detector positioned at an angle 2θ with respect to the transmitted beam will differ, as illustrated in Figure 2. As a result, the combined X-rays scattered from each atom will vary between constructive (in-phase) and destructive (out-of-phase) interference as a function of the measured scattering angle, usually defined in terms of the vector q= (4π/λ)sinθ, where λ is the X-ray wavelength and θ is the half scattering angle. The function describing the pair-wise interference from N atoms in a molecule is (Svergun et al. 1995):
where Aj is the atomic scattering amplitude for the jth atom, rj,k is the distance between the jth and kth atoms. Eq. 1 has the format of a Fourier transform indicating that information on inter-atomic distance, r, is converted to a frequency pattern in scattered X-ray intensity when measured as a function of the scattering vector, q, often referred to as the reciprocal space parameter where q has the units of inverse distance. For randomly oriented molecules in solution, Eq. 1 must be averaged over all orientations of the vectors q and r, and the atomic scattering amplitudes must include methods to account for the electron density background of the solvent and the solvent excluded volumes occupied by each of the atoms (Svergun et al. 1995; Zhang and Friesner 1998). Programs CRYSOL (Svergun et al. 1995) and solX (Zhang and Friesner 1998; Zuo et al. 2006) have been developed for calculation of solution scattering patterns based on coordinate molecules and are available on request.
Molecular scattering patterns can be understood to arise from the sum of scattering from all pairs of atoms. Figure 3A illustrates how the reciprocal space solution scattering patterns change for individual atom pairs with inter-atomic spacings of 5 Å, 10 Å, and 50 Å. Figure 3B shows the corresponding inverse Fourier transforms for each or the scattering curves to produce pair distribution functions, PDF, (Putnam et al. 2007; Semenyuk and Svergun 1991) that reconstruct the originating real-space atom pair distances. Corresponding scattering and PDF patterns for multi-atom molecules would sum traces for all of the composite individual atom pairs.
The individual atom-pair scattering patterns demonstrate the angle dependent spatial resolution contained in a scattering measurement. At small angle the pathlengths for scattering from each atom converge, causing interference and spatial resolution (d = 2π/q) to be lost, and the amplitude of the scattered X-rays asymptotically approaches the value at zero scattering angle, I(0). The scattered intensity at zero angle is proportional to the concentration weighted square of the electron density contributed by all of the atoms in the complex: I(0)= conc*V2 (ρ – ρo)2 where V, the total solvent excluded volume for the molecule, ρ, the average electron density of the molecule, and ρo, the average electron density of the solvent (Guinier and Fournet 1955) Measurement of I(0) for a molecule can be used to determine the molecular weight (Lipfert and Doniach 2007). In the small angle region, the asymptotic approach of scattered intensity to I(0) follows the Guinier relationship(Guinier and Fournet 1955; Lipfert and Doniach 2007; Putnam et al. 2007): , where Rg is the radius of gyration corresponding to the mean electron density-weighted atomic distance from the center of mass. In this region, a plot of the logarithm of the scattered intensity versus q2 has a linear format as shown in Figure 4, illustrated by Guinier plots for the calculated diatomic scattering curves. The Guinier plots have slopes corresponding to the radii of gyration, in this case 2.5 Å, 5 Å, and 25 Å, respectively, and a common I(0) intercept. In photosynthesis research, examples of the use of measurements of I(0) and Rg include the determination of the aggregation state of recombinantly expressed manganese stabilizing protein from PSII (Svensson et al. 2002; Svensson et al. 2004), characterization of the aggregation state of reaction centers in crystallization mixtures (Marone et al. 1998), and determine the stoichiometry of cytochrome c2 docking in electron transfer complexes with the bacterial reaction center (Tiede et al. 2000).
At high angles an oscillatory interference pattern can be seen to develop in the model atom pair scattering shown in Figure 3A. The onset of the oscillatory interference and the corresponding frequencies provide a precise measure of interatomic spacings that can be obtained by Fourier transform. The high angle scattering features are of interest since they provide an atomic scale measurement of molecular structure that can be carried out in solution. However, the ability to utilize high-resolution scattering data for structural analysis depends on a combination of factors, including the size, complexity, and dynamics of the molecular system. In the following sections, we will give examples of scattering characteristics for molecular systems that demonstrate how characteristics of hierarchical molecular structure are reflected in wide angle X-ray scattering data, and describe progress and prospects for using scattering data for solution structure resolution.
Features of atomic coordinate-based scattering for a small molecule transition metal complex can be illustrated by comparing experimental and calculated x-ray scattering for an aqueous solution of ruthenium-tris(2,2′-bipyridyl) dichloride, Ru(bpy)32+, in Figure 5. Ru(bpy)32+ is a light-absorbing molecule used widely as a photosensitizer in dye-sensitized solar cells and artificial photosynthesis. Calculated scattering patterns are shown using two different sets of atomic volumes. The trace in blue shows a calculation using the atomic volumes use typically used for calculation of x-ray scattering for proteins in aqueous solution (Fraser et al. 1978; Svergun et al. 1995), while the trace in green shows a calculation using volumes that we have found empirically to be useful for simulation of scattering for a variety of molecules in organic solvents (Lee et al. 2008; O’Donnell et al. 2007; Tiede et al. 2004). The atomic volumes used for molecules in non-aqueous solvents have volumes for aromatic carbon atoms reduced by 25% from the values used for aqueous solvent, and expansion of the volumes around polar and charged groups, analogous to the atomic volumes found in the inner, anhydrous core of proteins (Pontius et al. 1996). The experimental scattering pattern for an aqueous solution of 10 mM Ru(bpy)32+ containing 0.1 M KH2PO4 is shown by the red trace in Figure 5. The q-range measured in this experiment was limited by the dimensions of the detector (Mar CCD 165) and the single fixed distance used between the sample and the detector. This causes the sudden truncation of scattering data at q~ 2.8 Å−1. The small angle limit is determined by the size and position of the X-ray beam stop.
A comparison of the two calculated curves shows that adjustments in the atomic volumes have the greatest effect in the small angle region where the loss of spatial resolution makes the scattering insensitive to the details of atomic structure, but is primarily sensitive to the size, shape, and the average electron density contrast of the solute molecule with the solvent. A characteristic feature of the experimental scattering pattern is the bend, or broad peak, seen in the small angle region (q ~ 0.7 Å−1). Maximum of this type in small angle scattering can arise from repulsive interactions that establish a minimum spacing between charged molecules in solution (Ducruix et al. 1996; Tiede et al. 2002). However, inter-molecular electrostatic interactions and their effects on scattering patterns are modulated by changing solution ionic strength and molecular concentration. The Ru(bpy)32+ scattering was found to be insensitive to these changes, indicating that the peak in the small angle scattering is a property of intra-molecular scattering.
The calculated scattering patters show that low angle peaks can be understood to arise from a partial electron density match between the solvent and the Ru(bpy) 32+. The reduction of atomic volumes for the aromatic carbon atoms causes all parts of the Ru(bpy) 32+ molecule to have a higher electron density compared to the solvent water, the I(0) calculated for the molecule increases, and the calculated scattering pattern follows a straightforward Guinier relationship in the small angle region. With the standard atomic volumes, the bipyridyl groups have a lower contrast with the water solvent, and amplitude of the small angle scattering, I(0), is reduced. Both curves converge to the same pattern at high angle, exhibiting an oscillatory pattern that is the reciprocal space representation of the sum of the atom pair distances associated with the bipyridyl ligands. Measurement of the interference pattern can be extended to higher angle either by changing the X-ray wavelength or by re-positioning the area detector subtend a larger portion of the high angle scattering. Because of the fixed dimensions of the beam stop and detector, the shift to record high angle scattering data will occur with loss of the small angle scattering data. Recording of scattering patterns across an extended q-range requires multiple measurements with different data acquisition geometries. In cases where the scattering patterns have been measured to q ≥ 6 Å−1, corresponding to a spatial resolution of better than 1 Å, PDF analysis allows the individual metal-ligand atom pair distances to be resolved, and structure of the transition metal complex to be determined in solution (Megyes et al. 2008; O’Donnell et al. 2007; Vaughan et al. 1950).
The scattering calculations for Ru(bpy) 32+ illustrate that atomic coordinate structure and the solvent excluded volumes make distinguishable, angle-dependent contributions to solution scattering. In addition, the experimental data shows a peak near q=1.8 Å−1 that falls within the solvent envelope. The experimental Ru(bpy) 32+ scattering pattern is measured as the difference between scatting for the Ru(bpy) 32+ solution and the aqueous buffer background (O’Donnell et al. 2007). The normalized subtraction of the buffer removes contributions of bulk solvent from the Ru(bpy) 32+ solution scattering. The residual peak has a position and shape different than the bulk solvent, and is suggestive of structure within a solvation layer associated with Ru(bpy)32+ or the accompanying counter ions. This kind of solvation layer structure can often be resolved in transition metal complex (Megyes et al. 2008; Megyes et al. 2005) and supramolecular (Lee et al. 2006; O’Donnell et al. 2007) scattering.
Wide-angle x-ray scattering patterns for proteins can be expected to be complicated by the much larger ensemble of atom pair distance correlations than is seen in photosynthetic model compounds. Scattering for a protein can generally be resolved into three regions. As discussed above, scattering in the small angle Guinier region provides information on the size, shape, and organization of domains in a multimeric protein (Koch 2006; Lipfert and Doniach 2007; Petoukhov and Svergun 2005; Putnam et al. 2007). Scattering in the q-region with a spatial resolution of 20 Å to 6 Å (0.2 Å−1 < q < 1.0 Å−1) provides a measure of the tertiary or protein fold structure, while scattering with a spatial resolution of 6 Å to 2 Å (1.0 Å−1 < q < 3.0 Å−1) provides a measure of secondary protein structure as reflected by characteristic hydrogen bonding and near-neighbor repeat distances in α-helix and β-sheet structures. Numerous groups have noted these scattering features and the fact that they provide a fingerprint for protein domain structure (Fedorov and Denesyuk 1978; Hirai et al. 2004; Makowski et al. 2008; Svergun et al. 2001; Tiede et al. 2002). For example, we have used x-ray scattering in combination with FTIR measurements to identify the β-sheet domain structure of the isolated manganese stabilizing protein from photosystem II, and demonstrated the consistency of the structure within crystalline photosystem II and in solution (Svensson et al. 2002; Svensson et al. 2004).
An example of x-ray scattering for a protein is illustrated in Figure 6 showing a comparison of experimental (in solution; top) and calculated (from crystals; bottom) scattering patterns for the photoactive yellow protein, PYP (Philip et al. 2008). Scattering patterns are shown measured for PYP samples in the dark (blue lines; also see (Kim et al. 2004)) and during continuous blue light illumination during a 10 second scattering measurement (red lines D. M. Tiede and W.D. Hoff, unpublished). The experimental light and dark scattering patterns show readily reversible changes (see e.g. at q = 0.25 Å-1 and q = 1.8) reflecting significant changes in secondary and tertiary protein structures. Comparable light-induced solution scattering results were also published previously with a lower 6 Å spatial resolution (Kamikubo et al. 2007).
The calculated scattering patterns used crystal coordinates obtained for the dark adapted crystal (Borgstahl et al. 1995) and for crystals in which the photo-stationary state (50% population) was produced on the milliseconds time-scale following laser excitation (Genick et al. 1997). Calculations from both crystal structures produce nearly identical scattering patterns, indicating that the light-induced structural changes are much smaller in crystals than in solution. The calculated scattering patterns show a pair of peaks in the protein tertiary structure region that correspond nicely to those observed for the dark-adapted solution sample, although the solution experiment shows “flatter” features possibly reflective of a dynamic conformational ensemble as described below. In the protein secondary region scattering features measured in experiment with the dark-adapted sample also align with features calculated from crystal structures, but the experiment shows a broader peak, particularly near the 3.1 Å spatial resolution position. This may be indicative of the presence of a solvation layer that is measured in experiment but is absent in the present model calculation, akin to the interactions observed in the Ru(bpy)32+ described above.
The light-generated state in solution (red line) produces changes throughout the PYP scattering pattern, including small angle, Rg region, a substantial flattening of the tertiary scattering peak, and changes in the tertiary protein structure scattering peak (see also (Kamikubo et al. 2007)), and in the secondary structure and protein-solvent scattering peak. These multiple length-scale changes are indicative of the light-induced “protein-quake” that propagates throughout the protein (Xie et al. 2001). The scattering data are a direct measure of the amplitude of this quake and demonstrate that the quake is much larger in solution than in crystals (small differences are seen between scattering patterns calculated for the light and dark crystal structures, but these differences are not easily seen with the scale of the plots in Figure 6), and possibly correlates with the altered photocycle function observed in crystals (Yeremenko et al. 2006). Laser-initiated, time-resolved wide-angle solution scattering has been extended to the ultrafast timescale using synchrotron-based pump-probe techniques allowing the structural cascade initiated byphoto-deligation of CO to be followed for hemoglobin in solution, and quantitatively compared to crystal coordinated data (Cammarata et al. 2008). Time-resolved scattering studies spanning the ultrafast to seconds time scales have clear applications for resolving the connections between light-induced structural dynamics and photosynthetic function.
The determination of the structure and dynamics of macromolecules in non-crystalline media is critical for understanding in-situ function. In solution, macromolecules exist in a number of different conformational states governed by a typically complex potential energy surface. This ensemble of states is in contrast to the picture of the single, rigid structure produced by x-ray crystallography. One of the strengths of the x-ray scattering technique is that it provides a direct, all-atom measure of the configurational ensemble present in solution that can be quantitatively compared to coordinate models of the static crystal structure and dynamics simulations of the ensemble. Multiple computational methods are available for generating molecular conformations.(Christen and van Gunsteren 2008; Howard and Kollman 1988; Tama and Brooks 2006; Vengadesan and Gautham 2005) Of these, when dynamic evolution of molecular motions is of interest, molecular dynamics is the method of choice. Molecular dynamics (MD) is capable of producing weighted ensembles with accompanying dynamic information, provided that empirical force fields and simulation times are adjusted to representatively sample thermally accessible conformational states. However, the validation of MD simulation results has been a long-standing problem.(van Gunsteren et al. 2006)
Several studies have demonstrated the ability to use experimental X-ray scattering as a discriminating benchmark for evaluating the accuracy of molecular simulation to represent real world configurational ensembles. For example, small angle scattering data parameters have been tested against those calculated from the simulated ensembles produced byMD simulation (Kim et al. 2008; Sivaramakrishnan et al. 2008; Vigil et al. 2001; von Ossowski et al. 2005) and other configurational search methods (Bernadó et al. 2007; Petoukhov et al. 2002; Petoukhov and Svergun 2005). The sensitivity of scattering data to conformations contained within the simulated ensembles have led to the development of structural refinement algorithms that use small angle scattering data in restrained MD simulations (Kojima et al. 2008; Kojima et al. 2004). The experimental evaluation of MD simulations have been further extended by comparing simulated ensembles with wide-angle scattering features for DNA (Zuo et al. 2006), a transition metal complex (Megyes et al. 2008), and light-harvesting porphyrin supramoleuclar assembly (Mardis et al. 2009). The extension of the combined MD and x-ray scattering analyses to wide angle permits a higher resolution, atomic-scale comparison between experimental and simulated ensembles.
As an example of a solution state conformational ensemble characterization by combined x-ray scattering and MD analyses, we summarize here the results for the light-harvesting, hexameric, diphenylethyne-linked porphyrin array in solution (Mardis et al. 2009). This specific example will be used to illustrate the synergies that exist between MD simulation and solution-phase x-ray scattering.
X-ray scattering measured for the porphyrin array in dilute toluene solutions at room temperature were found to exhibit broadening of scattering features arising from a distribution of the solution conformers (Tiede et al. 2004). An example is shown in Figure 7 that compares part of an experimentally measured scattering pattern (red trace) to a scattering pattern (blue dots) calculated from a single energy minimized structure (Figure 8A and B). The portion of the scattering pattern shown in Figure 7 covers the 0.1 Å−1 < q < 1.0 Å−1 region, and probes structure across the 60 Å to 6 Å spatial resolution range (d = 2π/q). This distance range provides a direct measure of the hexameric ring conformation. While a correspondence is seen between the number and approximate positions of peaks in the experimental and calculated patterns, the amplitude of the oscillations measured in experiment is markedly damped compared to that seen in patterns calculated for single supramolecular conformers. In addition, slight differences in the oscillatory peak patterns can be seen between the patterns calculated for the energy-minimized structure and experiment. This has been shown to correspond to a 1.4 Å compression of the overall dimension of the hexameric ring measured in experiment compared to the energy-minimized structure (Tiede et al. 2004). Observation that small distortions in the porphyrin array conformers produce significant shifts in calculated inference peak positions and amplitudes in scattering patterns suggests that the dampening of experimental scattering features could arise from a distribution of porphyrin array conformers in solution, and that dampening in experimental scattering data provides a monitor of the magnitude of this configurational dispersion.
Molecular dynamics simulations were used to create ensembles of porphyrin array conformer using atomic force fields based on the CHARMM 22 force field (Mardis et al. 2009). Figure 8(C and D) shows representatives of an ensemble of 1700 conformers populated during a 2.05 ns constant pressure simulation in a bath of explicit solvent toluene molecules. The scattering profile for each member of the ensemble is shown plotted in Figure 7 using grey lines and the overall ensemble scattering is obtained as the average of the individual scattering patterns (black line). The ensemble average simulates a scattering pattern that can be experiment. Compared to scattering calculated from a single energy-minimized single conformer, the scattering pattern calculated from the MD ensemble can be seen to provide a better model for the experimental scattering data, notably by a close matching of the position of the first oscillatory peak and by the strong dampening of oscillatory features, particularly for q values above 0.5 Å−1 (Mardis et al. 2009).
The thermal fluctuations present in solution result in distance-dependent structural fluctuations that are directly detected in an x-ray scattering experiment by q-dependent broadening of x-ray scattering features. A superposition of these structural fluctuations in the simulated ensemble is shown in Figure 8. These conformations are obviously much less ordered than the energy minimized structure shown in Figure 8. Their corresponding scattering, shown as a grey line in Figure 7, form a shaded area that is the reciprocal space representation of distance-dependent atomic pair disorder simulated in solution. Careful investigation of this ensemble shows large amplitude motions that include porphyrin ring “tipping” around the porphyrin linkage axes and extended hexameric porphyrin array “breathing” motions involving torsional distortions collectively distributed along porphyrin and diphenylethyne groups. These motions result in dampening of the scattering profile.
While the overall agreement between MD and experiment is encouraging, mismatches in the magnitudes of experimental and simulated dampening, particularly in the insufficient dampening of the first two oscillatory peaks in the 0.2 Å−1 < q < 0.5 Å−1 region, show that large amplitude, hexamer array breathing-type motions are significantly under-represented in the simulated ensembles. Increases in the simulation temperature and time were found not to substantially shift the simulation ensemble. These shortfalls suggest that the force field parameters may be unrealistically stiff, or that accessible timescales for simulation do not allow the full conformational landscape to be explored (Mardis et al. 2009). Accelerated molecular dynamics (Hamelberg et al. 2007; Markwick et al. 2007; Yang et al. 2007), course graining (Marrink et al. 2004; Tozzini 2005), and multiscale (Schatz 2007) methods have been developed to provide more accurate simulation methods for modeling complex supramolecular dynamic landscapes. The scattering analysis described here provides a means to evaluate the ability of these advanced simulation methods to accurately model solution-phase ensembles compared to other experimental information which focus on a single, low-energy structure. This comparison between experimental solution-phase scattering and MD simulation provides a means not only to interpret scattering data in terms of an explicit atomic model, but more generally demonstrates the use of solution x-ray scattering as an experimental benchmark for the development of simulation methods that more accurately predict configurational dynamics of supramolecular assemblies.
Numerous approaches are being developed for enhancing macromolecular structure resolution in the solution by combining x-ray scattering data with coordinate data from complementary structural techniques. For example, crystal coordinate data has been used in combination with configuration search algorithms to derive coordinate models consistent with solution scattering data (Putnam et al. 2007). Numerous methods have also been developed to exploit the complementarities of NMR and x-ray scattering data from both biomolecular and transition metal complexes. The biological NMR structural method is mainly based on the nuclear Overhauser effect (NOE) that measures hydrogen-hydrogen distances in short ranges, i.e., < 5 Å. As a result global biomolecular structure is often under-determined because the number of detectable NOE-based distance restraints is far less than the number of freedom degree of atoms in the molecules. This has been a particularly significant problem in NMR structure determination in DNA and RNA. For example, the overall conformation and details of base rise parameters for a model dodecamer DNA were found to vary significantly in different NMR structure refinements using the identical NMR dataset, demonstrating that the set of NMR distance constraints was not sufficient to uniquely determine the structure (Kuszewski et al. 2001; Zuo and Tiede 2005).
The global, all atom nature of the x-ray scattering measurement provides a useful complement to the predominately short range distance constraints measured by NMR and thus x-ray scattering data can provide additional constraints in structure refinement fitting procedures. Small-angle scattering data have been employed in the structure refinements as a global dimension restraint for NMR structure refinement for proteins and polynucleic acids (Gabel et al. 2006; Grishaev et al. 2005b; Grishaev et al. 2008; Lee et al. 2007). In general, the inclusion of scattering data as fitting constraints have been found to improve global structure and refinement convergence, and are a benefit particularly for biomolecules with elongated or flexible structures because the number of available NOE distance restraints tends to be relatively small compared to the total number of atoms. High angle X-ray scattering data have also been used in structural refinement of a model dodecamer DNA (Schwieters and Clore 2006).
Another increasingly important application of scattering data is for the resolution of global structure for multiple domain or multiple component supramolecular complexes. Frequently the structures of subunits can be determined using crystallographic or NMR approaches. However, structure determination for complex, multi-component architectures has become a major challenge for structural biology. Complex architecture determination can be described as the problem of determining relative translational positions and orientations of subunits within the complex. Approaches have been developed that use small angle X-ray scattering, SAXS, data to screen all possible arrangements of subunits within both translational and rotational spaces (Bernadó et al. 2007; Petoukhov and Svergun 2005; Svergun and Koch 2003). These approaches often are computationally expensive and result in ambiguous structures because the degeneracy in SAXS data. Similarly, NMR methods have been developed to resolve the relative positions of subunits within a complex by measuring inter-subunit NOEs. This approach requires sophisticated sample preparation and isotope-label filter/edited NOE experiments to define the interface between subunits. As a result, the inter-subunit NOE approach is costly, time-consuming, and often difficult to obtain a sufficient number of inter-subunit to unambiguously define the interface.
To relieve these limitations, another NMR technique, residual dipolar coupling (RDC), has been introduced that measures the orientations of bonds comprising of active nuclei, such as 13C-H and 15N-H in isotope-labeled biomolecules relative to the external magnetic field (Grishaev et al. 2008; Zuo et al. 2008). With knowledge of individual subunit structure, RDC measurements on an assembled complex provide a measure of the relative orientations of bonds, and therefore, the relative orientations of subunits. Use of RDC data can dramatically reduce the configurational search load for structure refinement because one set of RDC data yields only four possible discrete orientations for each subunit. The combination of SAXS and RDC data constraints simplifies the structural search and also dramatically reduces the degeneracy in both data, or even completely lifts the degeneracy in many cases.
The joint use of SAXS and RDC for determining the bimolecular complex structure has been demonstrated by using a tetraloop-receptor RNA:RNA homodimer with 86 nucleotides as an example (Zuo et al. 2008). The structure of the monomeric subunit of this dimer is highly asymmetric and elongated, determined solely using NMR data. The RDC measurements provide four possible orientations for the two subunits within the dimer, shown in Figure 9A. Each of these four subunit orientations can be grid searched with respect to the coordinates (,θ,r) of an inter-subunit position vector, r, that connects the center of masses for the two subunits, Figure 9A. At each rigid-body grid step, +iΔ, θ+jΔθ, r+kΔr, the scattering pattern for the trial dimer complex is calculated and ranked with respect to experimental SAXS data. This search can be expedited by including Rg, steric hindrance, Dmax (largest distance within the complex obtained from experimental PDF analysis), and symmetry filters. The resulting best fit dimer complex structure is shown in green in Figure 9B. This structure has an identical dimer interface (rmsd 0.4 Å) as the one determined using NMR data containing 72 inter-subunit NOEs alone, and shown by the red structure in Figure 9B. This structure can be further refined by using a combination of SAXS data and NMR distance constraints to achieve the structure shown in cyan in Figure 9B that differs from the structure obtained by NMR data alone by a backbone rmsd of 3.2 Å. This new structure illustrates the deficiency of obtaining a solution structure determined solely from NMR data without including SAXS as global restraints (Zuo et al. 2008).
This method can be readily modified to solve structures for multi-component complexes. We expect this methodology could become a routine method for rapid structure determination of modular architectures composed of structurally determined building blocks. The combined NMR and x-ray scattering refinement analyses gains relevance because both sets of data can be recorded using identical conditions, and they provide fully complementary data.
The examples cited in this review demonstrate the opportunityto gain an atomic-scale understanding of structure and structural dynamics for supramolecular assemblies in solution by combining x-ray scattering with coordinated based modeling. These approaches can be expected to have a broad range of applications for structure-function determination, particularly for a variety of structurally dynamic supramolecular assemblies in natural and artificial photosynthesis. In addition, the emerging ultrafast time resolved scattering techniques offer entirely new possibilities to resolve fundamental mechanisms for solar energy conversion by mapping out structure in light-excited reaction states.
This work was supported by the Office of Science, Basic Energy Sciences, U. S. Department of Energy under contract numbers DE-AC02-06CH11357 (D.M.T. and work at APS Sector 12), National Science Foundation IL-LSAMP grant HRD-0413000 and National Institutes of Health Grant 1SC2GM083717 (K.L.M.). The software program, solX, used for coordinated based x-ray scattering calculations is available by request to D.M.T. or X.Z.