Home | About | Journals | Submit | Contact Us | Français |

**|**HHS Author Manuscripts**|**PMC4731355

Formats

Article sections

- Abstract
- Theory
- Model system
- DNA hairpin force spectroscopy
- Non-uniform diffusion
- Discussion
- Supplementary Material
- References

Authors

Related links

Phys Rev Lett. Author manuscript; available in PMC 2016 January 29.

Published in final edited form as:

Published online 2011 August 23. doi: 10.1103/PhysRevLett.107.098102

PMCID: PMC4731355

NIHMSID: NIHMS752756

The publisher's final edited version of this article is available at Phys Rev Lett

See other articles in PMC that cite the published article.

To explain the observed dynamics in equilibrium single-molecule measurements of biomolecules, the experimental observable is often chosen as a putative reaction coordinate along which kinetic behavior is presumed to be governed by diffusive dynamics. Here, we invoke the splitting probability as a test of the suitability of such a proposed reaction coordinate. Comparison of the observed splitting probability with that computed from the kinetic model provides a simple test to reject poor reaction coordinates. We demonstrate this test for a force spectroscopy measurement of a DNA hairpin.

A variety of new experimental techniques have made it possible to monitor the conformational fluctuations of single biological macromolecules under both equilibrium and nonequilibrium conditions. These experiments aim to probe the statistical dynamics and conformational substates relevant to folding and function. In a typical experiment, such as observation of the resonant energy transfer efficiency between two fluorophores incorporated into an RNA molecule [1], fluctuations of a spectroscopic observable in the absence of an external field are monitored. Other experiments allow the effect of an external biasing potential on the dynamics to be observed, as in an optical trap [2–4].

To describe the observed dynamics of the system, it is tempting to identify the observable with a reaction coordinate and construct a model in which the dynamics evolves by a diffusion process in an effective potential, such as by overdamped Langevin (also called “Brownian”) dynamics [5],

$$\stackrel{.}{x}\left(t\right)=-\beta \frac{\partial}{\partial x}F\left(x\right)+\sqrt{2D\left(x\right)}R\left(t\right).$$

(1)

Here, *x*(*t*) is the time-dependent motion along the resolved coordinate, *D*(*x*) is the diffusion constant (often assumed to be a constant independent of *x*), *β* (*k _{B}T* )

Many physical systems such as biomolecules exhibit strong metastabilities in the conformational degrees of freedom, resulting in the presence of two or more discrete conformational states in which the system remains for a for long time before transitioning to another metastable state [6]. While it is often easy to find an observable *x* that is a suitable *order parameter* that allows these metastable states to be discriminated to some degree, it is generally difficult to find a good *reaction coordinate* so that dynamics along the resolved coordinate are well-described by Eq. 1.

For data collected in a given single-molecule experiment, how can we determine whether the resolved coordinate provides a good reaction coordinate? Recent work on tests of reaction coordinate suitability in computer simulations has focused on the calculation of the *committor* or *splitting probabilities*, a concept dating back to Onsager [7]. This quantity, now extensively used in simulation studies of protein folding [8], represents the probability that a trajectory first encounters one absorbing boundary placed along the reaction coordinate before another, given an initial microscopic state of the system. For suitable choices of reaction coordinate, the distribution of committor probabilities along an equilibrium ensemble of configurations restricted to a given value of the reaction coordinate will be closely grouped about a characteristic value [8–13]; indeed, ideal reaction coordinates organize committor isosurfaces in an ordered fashion along the reaction coordinate [14].

Unfortunately, tests based on evaluating distributions of committor values along cuts of the putative reaction coordinate are impossible to apply in a physical experiment, since there is no way to prepare the system in precisely the same microscopic configuration to probe the statistics of committor probabilities. Instead, we propose a simple alternative that is readily computable from observed equilibrium trajectories of the resolved coordinate: Comparison of the *average* committor along each value of the reaction coordinate evaluated by Eq. 1 with the *empirical average* committor from the observed trajectory.

Consider the placement of absorbing boundaries at *a* and *b* near the periphery of the observed range of the resolved coordinate *x*. For a diffusion process in one dimension governed by Eq. 1, the probability of first encountering *a* before *b* starting from *x* [*a, b*] can be shown to be [5, 14, 15],

$${p}_{A}\left(x\right)=\frac{{\int}_{x}^{b}d{x}^{\prime}D{\left({x}^{\prime}\right)}^{-1}{e}^{\beta F}\left({x}^{\prime}\right)}{{\int}_{a}^{b}d{x}^{\prime}D{\left({x}^{\prime}\right)}^{-1}{e}^{\beta F}\left({x}^{\prime}\right)}.$$

(2)

The PMF along the resolved coordinate *x*, *F*(*x*), can be estimated from a single-molecule trajectory of sufficient length [4, 16] or from multiple trajectories under different equilibrium [17] or nonequilibrium [18, 19] conditions. The diffusion profile *D*(*x*) can be estimated in a number of ways (such as the Bayesian scheme of Best and Hummer that allows simultaneous computation of both PMF and diffusion constant [15]), though it is commonly assumed to be constant, in which case it cancels from both numerator and denominator.

An *empirical* estimate of the splitting probability * _{A}*(

$${\widehat{p}}_{A}\left(y\right)=\frac{{\int}_{0}^{\mathcal{T}}dt\phantom{\rule{thinmathspace}{0ex}}\delta (y-x\left(t\right)){c}_{A}\left(t\right)}{{\int}_{0}^{\mathcal{T}}dt\phantom{\rule{thinmathspace}{0ex}}\delta (y-x\left(t\right))}$$

(3)

where we have defined the hitting function *c _{A}*(

$${c}_{A}\left(t\right)=\{\begin{array}{cc}1\hfill & \text{if}\phantom{\rule{thickmathspace}{0ex}}{\tau}_{A}\left(t\right)<{\tau}_{B}\left(t\right)\hfill \\ 0\hfill & \text{otherwise}\hfill \end{array}\phantom{\}},$$

(4)

where auxiliary functions *τ _{A}*(

$$\begin{array}{cc}\hfill {\tau}_{A}\left(t\right)=& \mathrm{inf}\{{t}^{\prime}>t:x\left(t\right)<a\}\hfill \\ \hfill {\tau}_{B}\left(t\right)=& \mathrm{inf}\{{t}^{\prime}>t:x\left(t\right)>b\}.\hfill \end{array}$$

(5)

The hitting function *c _{A}*(

Our proposed test is simple: By comparing the splitting probability estimated from the PMF, *p _{A}*(

When the observed coordinate is determined to be a poor reaction coordinate, the consequences of assuming it to be an adequate reaction coordinate depend on the precise nature of the information extracted from the single-molecule data. The consequences could be as simple as underestimating the rate constant for a two-state process or as subtle as inferring an erroneous mechanism for more complex processes. The most obvious consequence is mistaking the location of the transition state—the point where the splitting probability *p _{A}* = 0.5—to be displaced from the free energy barrier in the potential of mean force. For systems like DNA hairpins and proteins, this can have consequences for the interpretation of how “brittle” or “compliant” the conformational states are perceived to be. Notably, similar tests have been found to be useful in validating putative reaction coordinate choices in computer simulations, despite the ability to inspect the atomic coordinates directly [21, 22].

As an illustrative example, we consider the two-dimensional model system previously studied by Rhee and Pande [14],

$$U(x,y)=[1-0.5\phantom{\rule{thickmathspace}{0ex}}\mathrm{tanh}(y-x)]{(x+y-5)}^{2}+0.2[{({(y-x)}^{2}-9)}^{2}+3(y-x)]+15{e}^{-{(x-2.5)}^{2}-{(y-2.5)}^{2}}-20{e}^{-{(x-4)}^{2}-{(y-4)}^{2}},$$

(6)

pictured here in the upper-left panel of Fig. 1. Two stable states are present, located roughly at (*x, y*)-coordinates (4, 1) and (1, 4). At *k _{B}T* = 5, the PMFs along both

A Brownian dynamics trajectory of 10^{6} steps was generated using the discretization of Eq. 1 by Ermak and Yeh [23, 24], with a diffusion constant of *D* = 1 and timestep Δ*t* = 0.1. This trajectory was projected onto either poor choices of reaction coordinate *x* and *y*, or good reaction coordinate *q* (Supplementary Fig. 1). For each projection, the potential of mean force was estimated from an empirical histogram, e.g. *F*(*x*) ≈ –*k _{B}T* ln

The results of this comparison assuming a uniform diffusion constant are shown in Fig. 2. The poor suitability of *x* and *y* as reaction coordinates is easily seen by the large discrepancy between the the splitting probability *p _{A}* computed from the PMF (dashed line) and the empirical splitting probability

To demonstrate the utility of our proposed splitting probability test in a real laboratory measurement, we performed the same analysis on a single-molecule trajectory of a DNA hairpin in a double optical trap, previously reported by Woodside et al. [4]. The hairpin, referred to as 30R50/T4 due to the content of a 30 bp stem-forming sequence, is attached by means of dsDNA handles to two polystyrene beads held in a passive all-optical constant-force clamp [2] at an external force that encourages hopping among closed and open conformations over the course of the experiment. Bead displacements in the trap were recorded with a sampling frequency of 25 kHz [4], and the bead-to-bead extension trajectory was analyzed here.

Fig. 3 shows the observed trajectory of the molecular extension coordinate and corresponding splitting probability analysis for a uniform diffusion constant. From this analysis, it is evident there is poor agreement between *p _{A}*(

Recently, it has been suggested that non-uniformity of the diffusion constant along the resolved coordinate may have important ramifications for single-molecule biophysical experiments [15]. Could strong position-dependence of the diffusion constant *D*(*x*) may be responsible for the observed discrepancy in in Fig. 3? To judge whether non-uniform diffusion significantly impacted our test of reaction coordinate suitability, we used the Bayesian inference scheme proposed by Best and Hummer [15] to simultaneously compute position-dependent diffusion constant *D*(*x*) and potential of mean force *F*(*x*) for the systems considered here (Supplementary Figs. 2 and 3). Notably, the diffusion constant varies markedly with the bead-to-bead extension (Fig. 4, left), and the agreement of the PMF-derived *p _{A}* and empirical

We note that the reaction coordinate test presented here only allows us to test a condition that is *necessary*, but not *sufficient*, for Brownian dynamics to appropriately describe the observed dynamics on a one-dimensional landscape determined by the PMF. This does not rule out the possibility of pathological cases where poor reaction coordinates go unnoticed because the average splitting probability at a particular value of the resolved coordinate matches the PMF-derived model, but the splitting probability distribution is not tightly peaked about its average value. Additionally, if multiple reactive channels exist that are otherwise indistinguishable by this test, differences between the channels will not be resolvable.

Despite this, our test was able to discern good from poor choices of reaction coordinate in a model system, and reject the extension coordinate as a good choice of coordinate for a DNA hairpin unless a strongly position-dependent diffusion constant is permitted. Even then, there are statistically significant discrepancies between the observed splitting probability and the PMF-derived splitting probability that indicate this reaction coordinate choice is not ideal. We note that the presence of ~1 kb dsDNA handles tethering the DNA hairpin to the laser-trapped polystyrene beads is one potential source of the incomplete alignment of the extension coordinate with the reaction coordinate for hairpin unzipping. Shorter dsDNA handles have recently been suggested as a way to improve the signal-to-noise ratio [25], and may also improve the reaction coordinate quality. For proteins, techniques that allow the attachment of tethers at specific attachment points can be exploited to probe for improved reaction coordinate should the experimenter find that the current pulling coordinate under study is unsuitably poor [26]. Finally, we note that though this test is able to test the suitability of the extension coordinate for a polymer under force, we cannot determine from the present analysis whether a good reaction coordinate in the presence of external force would also be a good reaction coordinate in the *absence* of force, or even under different biasing forces; this concern is still the subject of active study [27, 28].

The authors thank Michael Woodside (University of Alberta and National Institute for Nanotechnology, NRC), Phillip Elms and David Chandler (U Berkeley), Gerhard Hummer and Attila Szabo (NIH), Steven Block and Imran Haque (Stanford University), Felix Ritort (University of Barcelona), and the anonymous referees for their helpful feedback on this work. The authors are grateful to Michael Woodside and Steven M. Block (Stanford University) for kindly providing original single-molecule data. JDC acknowledges support through an NSF grant for Cyberinfrastructure (NSF CHE-0535616) and a California Institute of Quantitative Biosciences (QB3) Distinguished Postdoctoral Fellowship. VSP acknowledges support from NIH R01-GM062868, NSF-DMS-0900700, NSF-MCB-0954714, and NSF EF-0623664. Matlab code implementing the analysis procedure described here can be obtained from https://simtk.org/home/splitting.

1. Smith GJ, Lee KT, Qu X, Xie Z, Pesic J, Sosnick TR, Pan T, Scherer NF. J. Mol. Biol. 2008;378:943. [PubMed]

2. Greenleaf WJ, Woodside MT, Abbondanzieri EA, Block SM. Phys. Rev. Lett. 2005;95:208102. [PMC free article] [PubMed]

3. Li PTX, Collin D, Smith SB, Bustamante C, Jr IT. Biophys. J. 2006;90:250. [PubMed]

4. Woodside MT, Anthony PC, Behnke-Parks WM, Larizadeh K, Herschlag D, Block SM. Science. 2006;314:1001. [PMC free article] [PubMed]

5. Gardiner CW. Handbook of Stochastic Methods. third ed. Springer; 2003.

6. Schütte C, Huisinga W. Biomolecular conformations can be identified as metastable sets of molecular dynamics. Computer Simulations in Condensed Matter: Systems: From Materials to Chemical Biology. I

7. Onsager L. Phys. Rev. 1938;54:554.

8. Du R, Pande VS, Grosberg AY, Tanaka T, Shakhnovich ES. J. Chem. Phys. 1998;108:334.

9. Geissler PL, Dellago C, Chandler D. J. Phys. Chem. B. 1999;103:3706.

10. Bolhuis PG, Chandler D, Dellago C, Geissler PL. Ann. Rev. Phys. Chem. 2002;53:291. [PubMed]

11. Ma A, Dinner AR. J. Phys. Chem. B. 2005;109:6769. [PubMed]

12. Peters B. J. Chem. Phys. 2006;125:241101. [PubMed]

13. Peters B. Chem. Phys. Lett. 2010;494:100.

14. Rhee YM, Pande VS. J. Phys. Chem. B. 2005;109:6780. [PubMed]

15. Best RB, Hummer G. Proc. Natl. Acad. Sci. USA. 2010;107:1088. [PubMed]

16. Gebhardt JCM, Bornschlögl T, Rief M. Proc. Nat. Acad. Sci. USA. 2010;107:2013. [PubMed]

17. Shirts MR, Chodera JD. J. Chem. Phys. 2008;129:124105. [PubMed]

18. Minh DDL, Adib AB. Phys. Rev. Lett. 2008;100:180602. [PMC free article] [PubMed]

19. Minh DDL, Chodera JD. J. Chem. Phys. 2009;131:134110. [PubMed]

20. Morrison G, Hyeon C, Hinczewski M, Thirumalai D. Phys. Rev. Lett. 2011;106:138102. [PMC free article] [PubMed]

21. Peters B, Beckham GT, Trout BL. J. Chem. Phys. 2007;127:034109. [PubMed]

22. Lechner W, Rogal J, Juraszek J, Ensing B, Bolhuis PG. J. Chem. Phys. 2010;133:174110. [PubMed]

23. Ermak DL, Yeh Y. Chem. Phys. Lett. 1974;24:243.

24. Ermak DL. J. Chem. Phys. 1975;62:4189.

25. Forns N, de Lorenzo S, Manosas M, Hayashi K, Huguet JM, Ritort F. Biophys. J. 2011;100:1765. [PubMed]

26. Cecconi C, Shank EA, Dahlquist FW, Marqusee S, Bustamante C. Eur. Biophys. J. 2008;37:729. [PMC free article] [PubMed]

27. Nummela J, Andricioaei I. Biophys. J. 2007;93:3373. [PubMed]

28. Best RB, Paci E, Hummer G, Dudko OK. J. Phys. Chem. B. 2008;112:5968. [PubMed]

PubMed Central Canada is a service of the Canadian Institutes of Health Research (CIHR) working in partnership with the National Research Council's national science library in cooperation with the National Center for Biotechnology Information at the U.S. National Library of Medicine(NCBI/NLM). It includes content provided to the PubMed Central International archive by participating publishers. |