Proteins carry out their functions through interactions with other molecules. Of particular interest here are assemblies of multiple proteins, which are often large, dynamic, flexible, and fragile, contributing to the difficulty of determining their structures. Even when single structure determination methods fail, however, atomic models of assemblies can be determined by combining multiple types of experimental data, including those from X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, electron microscopy (EM), small-angle X-ray scattering (SAXS), cross-linking, mass spectrometry (MS), and affinity purification (
Alber et al., 2008;
Lasker et al., 2010a;
Robinson et al., 2007). Computational integration of diverse experimental data into an ensemble of models that best satisfy the data is not yet an entirely automated process. Therefore, visualization software, used to setup calculations, assess the results, and troubleshoot problems, is essential for the quality and efficiency of iterative integrative structure modeling.
A common structure determination approach is the fitting of crystal structures and comparative models into an EM map of the full molecular assembly. The structures can be fit as rigid bodies by sampling globally (
Fabiola and Chapman, 2005) or locally (
Goddard et al., 2007;
Pintilie et al., 2010). Methods of flexible fitting include molecular dynamics (
Trabuco et al., 2008), Monte Carlo (
Topf et al., 2008), normal mode analysis (
Tama et al., 2004), and morphing (
Wriggers, 2010;
Wriggers and Chacón, 2001). Restraints such as symmetry (
Navaza et al., 2002) and intermolecular distances (
Rossmann et al., 2001) can be incorporated into the fitting process. Which available method is best depends on many factors, including the resolution and symmetry of the density map, the availability of additional restraints, and the accuracy of component models.
SAXS profiles have been used widely for low-resolution structural characterization of molecules in solution (
Petoukhov and Svergun, 2007;
Putnam et al., 2007;
Schneidman-Duhovny et al., 2010). While a SAXS profile can be converted into an assembly envelope that can in turn be used directly for fitting component molecules (
Svergun, 1999), the SAXS measurement has a relatively low information content - the rotationally averaged scattering intensity versus the scattering angle approximately determines only the system’s radial distribution function. Thus, a good use of an experimental SAXS profile is to compare it to a profile computed from a 3D structural model that was derived from other data (
Pons et al., 2010;
Schneidman-Duhovny et al., 2010;
Svergun et al., 1995). Also, changes in assembly conformation or composition under variations of
pH, salt, temperature, cofactors, and drugs can be recognized, and candidate models ranked by comparison of experimental and model-derived SAXS profiles.
Atomic assembly models often generate invaluable testable hypotheses. For example, models predict which residues are in contact at intermolecular interfaces and thus may be essential for assembly formation and function. In models built from individual X-ray crystal structures, the sidechain conformations may not reflect those in the complete assembly, either because of the induced fit or modeling errors. Thus, analysis of sidechain rotamers is useful for assessing residue interactions in the complex (
Guharoy et al., 2010).
We have recently integrated comparative (homology) protein structure modeling by MODELLER (
Fiser et al., 2000;
Marti-Renom et al., 2000;
Sali and Blundell, 1993), multiple simultaneous fitting into EM maps by IMP MultiFit (
Lasker et al., 2010a;
Lasker et al., 2009), SAXS profile fitting by IMP FoXS (
Forster et al., 2008;
Schneidman-Duhovny et al., 2010), and evaluation of sidechain conformations from backbone-dependent and backbone-independent rotamer libraries (
Dunbrack, 2002;
Lovell et al., 2000) into the UCSF Chimera molecular visualization package. These capabilities augment over 100 tools already provided by Chimera for the interactive analysis of atomic models, density maps, and protein sequences (
Couch et al., 2006;
Goddard et al., 2005;
Goddard et al., 2007;
Meng et al., 2006;
Morris et al., 2007;
Pettersen et al., 2004;
Pintilie et al., 2010). Chimera provides graphical user interfaces to simplify setting up input data and parameters for the fitting process, evaluating results, and performing cycles of refinement for building models of macromolecular assemblies. The homology modeling, EM fitting, and SAXS calculations are launched from Chimera and executed remotely via MODELLER- and IMP-based web services (
Russel et al., 2011), with results displayed in the molecular visualization environment as they become available. The web service approach allows incorporation of improvements without the user installing new software, and can provide transparent access to more powerful computing resources. Optionally, calculations can also be performed using locally installed copies of MODELLER and IMP. MultiFit and FoXS are part of the Integrated Modeling Platform (IMP) package (
Russel et al., 2011) that performs simultaneous optimization of multiple restraint types to generate ensembles of assembly structures consistent with diverse experimental data (
Alber et al., 2008;
Lasker et al., 2010b;
Robinson et al., 2007). The Chimera user interfaces described here are a first step towards a more comprehensive graphical user interface (GUI) to control and visualize results from this suite of tools.
Next, we describe the current assembly modeling tools and then illustrate their range of capabilities on two example systems, GroEL chaperonin and ARP2/3.