|Home | About | Journals | Submit | Contact Us | Français|
Four-dimensional (4D) NOESY spectra provide unambiguous distance information at a resolution that can not be achieved in fewer dimensions and thus increase the quality of biomolecular structure determination substantially. Since the degree of chemical shift degeneracy increases with protein size, the use of 4D NOESY spectra is particularly important for large proteins. The potential high resolution in 4D spectra can not be achieved in a reasonable time with conventional acquisition routines that sample the Nyquist grid uniformly. It can however be obtained with non-uniform sampling of the data grid, but optimal processing of such data has not yet been established. Here we describe a processing method for a pair of sparsely sampled 4D NOESY spectra, a methyl–methyl and an amide–methyl NOESY, recorded on a perdeuterated protein with protonated isoleucine, leucine and valine methyl groups. The coupled multi-dimensional decomposition (Co-MDD) of these two spectra together with a 2D template spectrum results in a substantial increase in sensitivity, evidenced by 50–100% additional cross peaks, when compared to alternative processing schemes. At the same time, Co-MDD allows the use of low sparse levels of 10–15% of the full data grid for NOESY spectra. For the 283-residue integral human membrane protein VDAC-1, which has a rotational correlation time of about 70 ns in detergent micelles, the two 4D Co-MDD NOESYs yielded a total of 366 NOEs, resulting in 139 unambiguous upper limit distance constraints for the structure calculation.
The nuclear Overhauser effect provides the core data for the structure determination of large biomolecules.1 Cross peaks in NOESY spectra reveal short distances in the spatial structure of the molecule and can thus be used to derive constraints for structure calculations.2 It is crucial for these calculations that cross peaks are well-resolved for unambiguously identifying the connected protons. Three-dimensional NOESY spectra are widely used, but for large systems it is advantageous to resolve the NOEs in four dimensions, since this drastically reduces the ambiguity of the constraints, leading to better structures.3-6
With conventional acquisition and processing schemes, the achievable resolution in 4D NOESYs is low, due to the long acquisition times of 50–100 days that would be needed to sample the time domain at the high resolution achievable with modern high-field spectrometers. This obstacle can be circumvented by sampling data only at a fraction of the grid points and using processing techniques other than the standard fast Fourier transformation.7-18 Multidimensional decomposition (MDD) has been used to process a single 4D methyl–methyl NOESY of a protein in D2O solution, sampled non-uniformly at a sparse level of 30%.19
In a previous work, we have shown that coupled (Co)-MDD processing of several spectra can be used to build a model of a hyper-dimensional (HD) spectrum.20 An HD spectrum has a dimensionality higher than that of any of the input experiment. Each additional spectrum in the input HD set contributes at least one new spectral dimension. Thus, in addition to the direct experimentally established correlations, HD spectra contain indirect spin correlations.
In the present work, we explore the ability to use Co-MDD for improving the quality of individual input spectra. By exploiting prior knowledge about shared dimensions, the coupled processing allows high sensitivity in one spectrum to enhance the sensitivity in other spectra. We describe the use of Co-MDD for processing a non-uniformly sampled 4D methyl–methyl NOESY and a non-uniformly sampled 4D amide–methyl NOESY, together with a regularly sampled 2D [13C,1H]-HMQC. The Co-MDD scheme also enables the use of very low sparse levels of 10–15% for the individual experiments. The experiments are recorded in H2O solution, since the exchangeable amide signals are needed and consequently, proper handling of the water magnetization is important in the pulse sequence. We describe the technical details of acquisition and processing with the new method on the background of the previously published de novo structure determination of the integral human membrane protein VDAC-1 in detergent micelles, during which these experiments were used (PDB code 2K4T, BMRB code 16381).21, 22 In LDAO micelles in aqueous solution, VDAC-1 has a rotational correlation time of 70 ns, as determined by the TRACT method.23 In the meantime, two crystal structures of VDAC-1 have become available, providing an unbiased reference point for analyzing the 4D NOESY experiments.24, 25 We compare the Co-MDD NOESYs with spectra resulting from the same raw data using alternative processing schemes and analyze the completeness and correctness of the observed NOE cross peaks with the three-dimensional protein structure.
Human VDAC-1(1–283)-Leu-Glu-His6 was recombinantly expressed in BL21(DE3) cells on [U-99%-2H,15N]-M9 medium containing 50 mg/L [4-13C;3,3-2H2]-α-ketobutyrate and 100 mg/L [4,4′-13C2;3-2H]-α-ketoisovalerate.26 The protein was isolated, purified and refolded into perdeuterated lauryldimethylamine oxide (LDAO) micelles.21 The final sample conditions were 25 mM Na·PO4, 5 mM DTT, pH 6.8, 310 mM [U-99%-2H]-LDAO, 0.5 mM [U-99%-2H,15N; 99%-1Hδ,13Cδ-IL; 99%-1Hγ,13Cγ-V]-VDAC-1. Perdeuterated LDAO was purchased from FB Reagents, Cambridge, MA (www.fbreagents.com), all other isotopes from Cambridge isotopes, Andover, MA.
All experiments were recorded on a Bruker Avance 900 spectrometer equipped with a cryogenic probe. A 3D [1H,1H]-NOESY-13C-HMQC and a 2D [13C,1H]-HMQC were recorded with conventional sampling and processing. The 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC27, 19 was collected following a random sampling schedule corresponding to 11.5 % of the full time domain grid defined by (24, 44, 96, 1024) complex points (11620 of 101376 time domain points) for (ω1(1H), ω2(13C), ω3(13C), ω4(1H)) with sweep widths of (1200 Hz, 3400 Hz, 3400 Hz, 12626 Hz). The 4D 15N-HMQC-[1H,1 H]-NOESY-13C-HMQC (Fig. 1) was collected following a random sampling schedule corresponding to 15% of the full data grid defined by (28, 28, 96, 1024) complex points (11200 of 75264 time domain points) for (ω1(1H), ω2(15N), ω3(13C), ω4(1H)) with spectral widths of (3600 Hz, 2900 Hz, 3400 Hz, 12626 Hz). The sample schedules were created using the nussampler routine from the MDDNMR software package with non-biased probability density.
Since the two spectra are recorded here for a large protein in H2O solution, water flip-back pulses were used.28 The preferred way for water handling during the long NOE mixing time of 200 ms in both NOESY experiments is to utilize radiation damping. This is achieved by phase shifting two 1H-pulses by 45° relative to the first one (Fig. 1), ensuring that the water magnetization is never in the –z state at the onset of the NOESY mixing time.29 For both pulse sequences, the intrinsic water flip back rates on a 900 MHz spectrometer with a cryogenic probe were found to be 80–90%, resulting in a steady state water flip back of 60–70%.30 Throughout the experiments, the 1H carrier frequency remained on the water resonance and the indirect proton dimension ω1 with the chemical shift ranges of 0.0–1.3 ppm and 6.7–10.7 ppm were aliased three and one time around the water resonance for the 13C-HMQC–[1H,1H]-NOESY–13C-HMQC and the 15N-HMQC–[1H,1H]-NOESY–13C-HMQC, respectively. Further, we found that it was not beneficial to use the available constant time periods on 1H, since placing a pair of 180° pulses on 1H and 15N in the [15N,1H]-HMQC element converts CRINEPT to INEPT type transfer, resulting in a signal loss of 20–30%.31, 32
Coupled MDD processing was performed using the MDDNMR software package 33, which is available on request from Vladislav Orekhov. The data flow chart for the coupled processing of two 4D NOESY and one 2D HMQC spectra is shown in Figure 2. First, the acquisition dimension is processed using the nmrPipe software 34 for all three spectra: The time domain signal is multiplied by a shifted squared sine-bell weighting function, the number of points is doubled by zero-filling, Fourier transform is performed and the region of interest is extracted (for VDAC-1 the range 1.3 – 0.2 ppm). The resulting data sets are CCS (t1,t2,t3,ω4) for the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC, NCS (t1,t2,t3,ω4) for the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC and HMQCS (t3,ω4) for the 2D [13C,1H]-HMQC. The two dimensions of the 2D [13C,1H]-HMQC spectrum are termed t3 and ω4 to indicate that the line shapes observed in these two dimensions correspond to dimensions t3 and ω4 of the two 4D NOESY spectra. The spectra span four evolution times t1, t2, t3, and one frequency domain ω4.
For co-processing of the HMQCS (t3,ω4) and CCS (t1,t2,t3,ω4) spectra, these experimental data sets are approximated by an MDD model:
The summations run over all diagonal and cross-peaks, with the indices γ and β distinguishing methyl groups of origination and destination of magnetization, respectively. The normalized one-dimensional functions L describe the time or frequency domain signals of the methyl nuclei 1H or 13C and thus encode their peak positions, phases, and line-profiles. The peak intensities are subsumed in the coefficients θβ and λγ,β.
Substituting, Zβt1,t2 = Σγ λγ,β LHγ (t1) LCγ (t2), Cβt3 = LCβ (t3), and Hβω4 = Lhβ (ω4), Eqs 1 take the form
In Eqs. 2 the factors Cβt3 and Hβω4 correspond to 13C and 1H profiles, or “shapes”, associated with methyl group β. A shape Zβt1,t2 can be thought of as a two-dimensional [1H;13C] time domain correlation spectrum. A 2D Fourier transform of such a shape would contain a diagonal peak and cross-peaks from all methyl protons γ that are proximal to methyl β.
The evolution times and spectral frequencies are defined on a grid with regular intervals Δt1, Δt2, Δt3, and Δω4
where indexes p, q, m, n run from 1 to the maximal values P, Q, M, N defined by the data sizes in dimensions 1–4, respectively. Thus, the shapes Cβt3, Hβω4 are vectors and Zβt1,t2 are matrices with elements Cβm, Hβn, and Zβp,q, respectively. In practice, the NMR time domain signal is comprised of both cosine- and sine-modulated components, facilitating quadrature detection. Thus, sizes of the real, time domain shapes P, Q, and M have twice as many points as the number of time intervals in corresponding spectral dimensions. MDD operates only with one-dimensional shapes and, as we described it in our previous publications19, it is convenient to think about the matrix Z as a vector with elements Zk. The index k spans all combinations of indexes of p and q and runs from 1 to K=PQ. Similar to Zk, we can write Zβt1,t2 as a function of one argument ζ, which spans the combination of evolution times t1,t2 on a defined grid (Fig. 2B). After normalizing vectors Zβζ and putting the normalization factor to element α=2 of new vector Aβ, Eq. 2b converts to
The new index α=1 and α=2 distinguishes normalization factors for the 2D and 4D experiments, respectively (Fig. 2B). Eq. 2a is multiplied by an arbitrary element of the shape Zβ, using a new normalization Aβα=1 = θβ/ZβζḲ1 and can thus rewritten as
where in the Z dimension, all data points from MQS (t3,ω4) spectrum are assigned to point ζḲ1. In theory, any point from the Zβ shapes can be used. However, for stability of the MDD calculations, we take the point with the index K + 1, which is not present in the CCS (ζ,t3,ω4) data set. Later on for the reconstruction of the complete 4D spectrum, this point in the Zβ shapes is discarded. The two experiments on the left sides of Eqs. 4 and corresponding models on the right sides can be combined so that the resulting four dimensional data set COS is described by a common model:
The experimental data COS (α, ζ, t3, ω4) has the size of 2 × (K+1) × M × N and is sparse because of the non-uniform sampling of the 4D spectrum and because index α separates the two experiments. It serves as input for MDD calculations, which is performed independently for overlapping sub-regions along ω4, resulting in the calculation of the complete vectors A,Z,C, and H.35 For VDAC-1, nine sub-regions of about 0.11 ppm each, cover the range 1.3–0.2 ppm. For each sub-region, MDD is performed with the number of components Nc = 1.5 · Np + 4, where Np is the number of peaks automatically picked in this sub-region in the 2D [13C,1H]-HMQC spectrum using the nmrPipe peak picking routine. The number of component for the individual regions varied from 5 to 70. All MDD calculations are performed with the default λ value of 0.005 and with 200 iterations. For each sub-region the four-dimensional decomposition results in a set of components enumerated by the index β. Each component consists of four complete real one-dimensional shapes: βAα, - shapes contain scaling factors for the peaks in the 4D and 2D spectra; βZζ, βCt3 – time domain shapes correspond to the dimensions t1,2(1H,13C) and t3(13C) of the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum; and βHω4 - frequency domain shapes for the dimension ω4(1H). Using the procedure described above, the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum can be arranged to data set NCS(ζ,t3,ω4) and described by the MDD model as
where shapes βVζ correspond to the dimensions t1,2(1H, 15N), i.e. they are analogous to the shapes βZζ in Eq. 4a. In the decomposition of Eq. 6, we use the shapes βCt3 and βHω4 obtained in the co-MDD decompositions of the data set in Eq. 5. These shapes were fixed and only shapes βVζ, were allowed to be adjusted. With all shapes defined from the MDD calculations, the complete 4D NOESY data sets are reconstructed using Eqs. 4a and 6 for the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC and the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC, respectively. Reconstructions for the complete region of interest are produced by combining reconstructions for all sub-regions. The remaining time domain dimensions of the reconstructed 4D spectra are processed conventionally using Fourier transformation in dimensions 1–3 and linear prediction of 20 points in dimensions 2 and 3, resulting in sizes of 64 × 128 × 256 × 162 points.
Two other processing schemes, discrete Fourier transformation (NUS-DFT)15 and the original MDD35, 19 were applied to the same non-uniformly sampled raw data. For the processing with DFT, zero values were assigned to the missing data points. The MDD calculations were performed using the same sub-region selection, number of components and λ as for the Co-MDD scheme. For both NUS-DFT and MDD, weighting functions, zero-filling, linear prediction and Fourier transformation parameters for processing of the resulting data set were the same as for the Co-MDD scheme.
A first step for the recording of 4D NOESYs in a structure determination is the identification of suitable magnetization transfer pathways. For large proteins, the selective labeling of methyl groups in Ile, Leu and Val residues is well established,36, 26 and together with the amide moieties this labeling scheme enables four possible magnetization transfer pathways for 4D NOESY spectra, the amide–amide, methyl–methyl, methyl–amide and amide–methyl. For large systems, such as VDAC-1, the methyl–methyl pathway is the most sensitive experiment among these. The amide–amide pathway is the least sensitive and it did yield hardly any detectable cross peaks in 4D NOESYs in the case of VDAC-1. Note that in 3D NOESYs, the amide–amide NOEs were above the sensitivity limit and were crucial to establish the backbone topology of the channel (see below). From the 4D amide–methyl and 4D methyl–amide experiment, only one is required, since these spectra are transposed relative to each other. Processing with Co-MDD requires that the experiments share the dimensions ω3 and ω4, corresponding to the second heteronuclear element, and thus the 4D NOESYs methyl–methyl and amide–methyl, together with the 2D [13C,1H]-HMQC, form a suitable set of experiments for Co-MDD.
The implementation of a desired 4D NOESY pathway into a pulse sequence requires the selection of the optimal 2D heteronuclear editing elements before and after the NOESY mixing time. For methyl groups in large molecular systems, the [13C,1H]-HMQC experiment has been identified as the most sensitive experiment.27, 19 For the amide groups, we compared the [15N,1H]-HMQC with the [15N,1H]-TROSY experiment.37, 32 With the digital resolutions used in the present work, the sensitivity of the [15N,1H]-HMQC experiment was higher than the [15N,1H]-TROSY by factors in the range of 2.0–3.0 for different residues of VDAC. We thus found the 15N-HMQC–[1H,1H]-NOESY–13C-HMQC experiment to be the most favorable implementation for the amide–methyl pathway for VDAC-1 (Fig. 1).
In the following we describe why the presence of the strong diagonal signals in the NOESY spectrum helps to detect small cross peaks in the same spectrum and why a similar effect can be achieved by co-processing with a reference spectrum or by using pre-calculated reference shapes. We restrict the present discussion to rationalizing the resulting MDD spectrum of the spectrum. A description of the mathematical MDD algorithm can be found elsewhere 38. The MDD spectrum is an approximation of a three- or higher dimensional spectrum by sum of a given number of components. Each component is the outer product of three one-dimensional shapes. No assumptions are made about functional forms of the shapes. In other words, each element of a shape is considered unknown by the MDD algorithm. The algorithm ascribes components for describing the most intense features in the data. In addition to signals, experimental data may contain strong artifacts. Moreover, even genuine signals may slightly deviate from the model assumption of the outer shapes product. In general, the MDD algorithm is thus likely to allocate available components to describe the strongest artifacts and the noise intensities instead of finding the weak signals. The weak peaks may be recovered, however, when several signals share line-shapes along all but one spectral dimension. All these peaks are described by a single component. The algorithm has to allocate a component for describing at least the strongest cross-peaks in the group. The strong peaks also define the shapes, notably including the shared ones. The remaining shape, which corresponds to the dimension, where peaks in the group are distinct, contains the strong peaks and all intensities matching the shared shapes, including the weak peaks and noise baseline. A good example for this feature is the 3D 1H-15N NOESY-HSQC spectrum10, where the diagonal signal and all the cross-peaks to the corresponding amide proton are described by one component. Thus, the key to detect a weak peak is to organize the MDD calculations so that weak cross peak signals are grouped with strong (e.g. diagonal) signals. If the peaks are distinct in more than one MDD dimension, they cannot be grouped in to one component and the weak peaks are lost. Such a case occurs if the 4D NOESY spectrum would be solved directly using four-dimensional MDD. In this work, similar to our previous publications 19, 35, the above formulated “all but one dimension” rule for grouping the peaks is implemented by combining the first two dimensions from the 4D NOESY spectra into one, that is Zζ and Vζ shapes for the carbon and nitrogen resolved spectra, respectively.
The strong peaks help to detect the weak ones by allocating a component to the right place in the frequency spectrum and defining all but one line shape for these peaks. There are cases, however, when peak groups do not contain sufficiently strong signals. Examples are peak constellations related to the weakest diagonals in the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum or the signals in the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC, where diagonal signals are not available at all. Data in these spectra alone are then not sufficient to correctly determine the shared shapes. Previously we showed that weak peaks can be recovered using predefined shapes obtained in a reference experiment 20 and we use this approach in the present work for processing the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC. In order to obtain good results with the fixed shapes technique, the spectra need be fully compatible with each other, i.e. grids in time and frequency domains, spectral widths, carrier frequencies, signal phases, etc must be the same in both spectra for the shared dimensions. Moreover, the reference set of shapes should be able to describe all strong features in the processed spectrum, including the artifacts (if any). In this work, the nitrogen resolved 4D spectrum was successfully solved using Ct3 and Hω4 shapes from 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum. The two 4D spectra matched very well, because their pulse sequences had equivalent [1H;13C]-HMQC blocks for the signal detection.
For processing of 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC, the more sensitive 2D [1H;13C]-HMQC spectrum served as a reference. The MDD algorithm is only applicable to data sets of three and more dimensions and we thus co-processed the 2D and 4D spectra by combining the two experiments into one four-dimensional data set as described in the Methods section. In this way, methyl with weak diagonal peaks in the 4D experiment are grouped with their stronger signals in the 2D. The 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC in turn was processed with the fixed from the above processing. In principle, it would be possible to coprocess the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC together with the 2D [1H;13C]-HMQC, or together with the 2D [1H;13C]-HMQC and the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC. However, due to the relatively low sensitivity of the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC, we found the fixed shape approach to yield the best result among these possibilities.
The 4D NOESY spectra described here were used during the de novo structure determination of the protein VDAC-1, and thus at a time when the correct 3D structure of the molecule was yet unknown. Prior to the analysis of the NOESY spectra, the sequence-specific assignment of the backbone had been established using HNCA and HNCACB experiments, supported by amino-acid specific labeling.39, 21 For the assignment of the ILV methyls, a HMCM[CG]CBCA-COSY experiment was used,40 but due to the large number of ILV residues and the resulting chemical shift degeneracy, only about 20% of the methyl groups could be assigned using this covalent correlations, while the other methyl assignments remained ambiguous.
In this situation, the two high-resolution 4D NOESY spectra, together with two conventional 3D NOESY spectra, one resolved for the methyls and one for the amides, were used. Strong interstrand 1HN–1HN NOEs were observed in the 3D 15N-resolved-[1H,1H]-NOESY, establishing a preliminary β-sheet topology, but many NOE cross peak assignments remained ambiguous from the 3D NOESYs alone due to frequent peak overlap (Fig. 3). Based on this initial structural model, the ambiguous side chain assignments could be resolved, mainly from the 4D NOESY spectra, leading to 96% complete sequence-specific resonance assignment of the methyl groups and subsequently to a large number of methyl–methyl and methyl–amide NOE contacts (Fig. 4). In particular, most methyl–methyl NOEs were resolved only in the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC, due to the small chemical shift dispersion of 0.7 ppm for the methyl 1H resonances, which for most cross peaks led to signal overlap with the strong diagonal peaks in the 3D 13C-resolved-[1H,1H]-NOESY.
For VDAC-1, the 28 leucine, 12 valine and 11 isoleucine residues span an NOE network around the β-barrel (Fig. 4). For all except one beta strand pairing (β7–β8), the relative orientation of the strands was supported by at least one unambiguous NOE from the 4D NOESY spectra. Of particular structural interest in this protein is the parallel pairing of strands β1–β19, a previously unknown feature of integral β-barrel membrane proteins, and the contact between the N-terminal tail and the inner barrel wall at residues 10, 143 and 150. The parallel pairing of β1– β19 was supported by six different 4D NOE contacts and the contact between residues 10, 143 and 150 by eight 4D NOEs (Fig. 4; each line in this Figure may represent more than one NOE). The availability of unambiguous high-resolution 4D NOEs thus had a high impact on confirming the topology and establishing the three-dimensional structure of VDAC-1.
For an evaluation of the sensitivity of the 4D NOESYs, we compare the Co-MDD spectra with spectra obtained using two alternative processing methods of the same non-uniformly sampled raw data: (i) a Fourier transformation of the sparse data, with all missing points set to zero (NUS-DFT) and (ii) processing using MDD for each of the two NOESY data sets individually, i.e. without coupling of the decompositions. It should be noted that methods are currently being developed that might improve the performance of the NUS-DFT method by nonlinear artifact cleaning.41, 18 The current comparison however uses the performance of the linear NUS-DFT method as a benchmark. It would be desirable also to compare the performance of the co-MDD analysis with recently introduced method of Forward Maximum Entropy Reconstruction 17. The comparison, however, is currently not feasible due to long computational times of the FM reconstruction for the presented data set.
The methyl groups of VDAC-1 can be distinguished in two groups, M1 and M2, according to the result from the three processing methods in the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum (Fig. 5). Group M1 was found to contain about 85% of the methyls and for these the 2D planes in the MDD and the Co-MDD spectra are of similar spectral quality, with a slight sensitivity advantage of the Co-MDD data. In the NUS-DFT spectrum however, the planes of these residues have a substantially lower sensitivity, so that many low intensity cross peaks are not distinguishable from the noise. For group M2, which comprises the remaining 15% of the methyl groups, the relative outcome of the three processing schemes is different than for group M1. The 2D planes from group M2 residues did not show peaks in the MDD spectrum, but similar sensitivity in the planes of the Co-MDD and the NUS-DFT spectra (Fig. 5). Overall, only 53 % of the cross peaks that are observed by Co-MDD are resolved by NUS-DFT in the 4D methyl–methyl spectrum.
The occurrence of the two groups can readily be rationalized with the additional observation that the residues from group M2 feature weaker diagonal peak intensities than those of group M1. The MDD method alone relies on strong diagonal peaks and can thus not detect the group M2 peaks. By coupling to the high-sensitivity 2D [13C,1H]-HMQC spectrum in the Co-MDD processing, these planes are however recovered and were in each case at least as sensitive as the NUS-DFT. In contrast, the weak performance of the NUS-DFT method for group M1 peaks with strong diagonal signals can be understood since the Fourier transformation of sparsely sampled data set corresponds to a Fourier transform of the hypothetically fully sampled data grid convoluted with a Fourier transformation of the sampling schedule. This convolution creates substantial noise from the strong diagonal peaks and other intense spectral features, such as residual water or detergent signals.
In the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum, which does not have any diagonal peaks, the result for all methyls resembles the outcome of the group M2 in the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum (Fig. 6). When processed with the MDD method alone, the ω1/ω2-planes from all methyl resonances are essentially void of cross peaks. The NUS-DFT spectrum shows a number of cross peaks for many residues (Fig. 6). If the Co-MDD method is used, which fixes the ω3(13C) and ω4(1H) shapes from the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC, about 20% additional cross peaks compared to the NUS-DFT are recovered from the noise (Fig. 6). For none of the residues, a decrease in sensitivity in the Co-MDD when compared to the NUS-DFT was observed.
Thus, for both the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC and the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC spectrum, processing with the Co-MDD method yielded for each methyl the best achievable results.
Other criteria for the spectral quality beside the sensitivity are the completeness and the correctness of the observed peaks. For an evaluation of the completeness of the NOEs, we use the 2.3 Å crystal structure of mouse VDAC-1 as a reference,25 which has been determined after the NMR structure of human VDAC-1. The NMR structure has been calculated using the 4D NOEs and is thus not a neutral reference point for these NOEs. Even though the crystal structure of mouse VDAC-1 was determined in bicelles and features three single amino acid mutations as compared to the NMR structure, a 3D comparison of the structures shows that the 19-stranded barrel, the relative orientation of the strands, and the contact of the N-terminal tail to the barrel wall inside at residues 10, 143 and 150 are identical in both structures.
For an assessment of the completeness we consider how well the observed NOEs compare to the NOEs expected from the 3D structure (Fig. 7). For the 4D 13C-HMQC-[1H,1H]-NOESY-13C-HMQC, the completeness is between 80% and 100% for short contacts and decreases for longer distances. A drop in percentage can be seen at 4.5 Å. This distribution is comparable with previous MDD applications and with those observed in conventional Fourier transform 3D NOESYs.42 For the 4D 15N-HMQC-[1H,1H] -NOESY-13C-HMQC, the average coverage of expected NOES in the range 2.0–4.5 Å is 31% (Fig. 7). In this spectrum, the observation of cross peaks correlated with the sensitivity of these peaks in the 3D spectra and the distribution of the distance dependence of the completeness indicates that the absence of peaks is indeed sensitivity dependent. The large effective molecular weight of VDAC-1 in detergent micelles thus approximately represents the size limit of this experiment and for systems with a smaller rotational correlation time an even higher coverage of peaks in the 4D 15N-HMQC-[1H,1H]-NOESY-13C-HMQC can be expected.
To assess the linearity of the peak intensities, we compare the intensities of the spectra with those in the DFT spectra mentioned above. Starting from the same non-linearly sampled data set, the data processing with DFT is a linear function. We observe that for both 4D spectra, a linear correlation between the two processing methods is obtained and the remaining differences are within the spectral noise (Fig. 8). It can thus be concluded that Co-MDD does not strongly disturb peak intensities, which is in line with previous similar investigations 35.
In terms of correctness we note that all observed NOE peaks were picked before assignment and structure were known. They fit the structure (Fig. 4) and are thus confirmed as real peaks and not artifacts induced by the MDD method. Overall, we found that the Co-MDD processing did not create artifacts that could be mistaken for real peaks.
Here, we have described a new method for the coupled processing of high-resolution 4D NOESY experiments, have analyzed its performance and discussed its use in a structure determination. Co-MDD is more sensitive than DFT NMR and MDD alone and recovers a substantial fraction of low-intensity peaks that are not detected by these methods. The experiments can be recorded on protein systems up to a rotational correlation time of at least 70 ns. Following the same lines of experiment design, the application of the Co-MDD method is readily applicable to other 4D NOESY data sets and to related pulse sequences.43-46 For example, on systems tumbling faster than VDAC-1, the intensity of amide–amide and methyl–amide NOEs will be above the noise limit and these spectra can then also be processed with the Co-MDD method. Due to these features and the general strength of the 4D NOESY approach, we anticipate that the method presented here will be widely used in structural studies of biomacromolecules and their complexes.
This work was supported by the grants from the Swedish Research Council (2005-2951, 2008-4299), the NIH (GM075879, GM066360, GM47467, EB002026); and the Wenner-Gren Foundation, Stockholm. S.H. was supported in part by the Swiss National Science Foundation.