# Umbrella sampling

Umbrella sampling is a technique in computational physics and chemistry, used to improve sampling of a system (or different systems) where ergodicity is hindered by the form of the system's energy landscape. It was first suggested by Torrie and Valleau in 1977 . It is a particular physical application of the more general importance sampling in statistics. In an energy landscape with a potential barrier separating two regions of configuration space (bottom sketch), Monte Carlo sampling may be unable to sample the system over a sufficient range of configurations to accurately calculate thermodynamic data, compared to a favourable energy structure (top plot).

Systems in which an energy barrier separates two regions of configuration space may suffer from poor sampling. In Metropolis Monte Carlo runs, the low probability of overcoming the potential barrier can leave inaccessible configurations poorly sampled – or even entirely unsampled – by the simulation. An easily visualised example occurs with a solid at its melting point: considering the state of the system with an order parameter Q, both liquid (low Q) and solid (high Q) phases are low in energy, but are separated by a free energy barrier at intermediate values of Q. This prevents the simulation from adequately sampling both phases.

Umbrella sampling is a means of "bridging the gap" in this situation. The standard Boltzmann weighting for Monte Carlo sampling is replaced by a potential chosen to cancel the influence of the energy barrier present. The Markov chain generated has a distribution given by:

$\pi (\mathbf {r} ^{N})={\frac {w({\textbf {r}}^{N})\exp {\left(-{\frac {U(\mathbf {r} ^{N})}{k_{B}T}}\right)}}{\int {w(\mathbf {r^{\prime }} ^{N})\exp {\left(-{\frac {U(\mathbf {r^{\prime }} ^{N})}{k_{B}T}}\right)}d\mathbf {r^{\prime }} ^{N}}}},$ with U the potential energy, w(rN) a function chosen to promote configurations that would otherwise be inaccessible to a Boltzmann-weighted Monte Carlo run. In the example above, w may be chosen such that w = w(Q), taking high values at intermediate Q and low values at low/high Q, facilitating barrier crossing.

Values for a thermodynamic property A deduced from a sampling run performed in this manner can be transformed into canonical-ensemble values by applying the formula:

$\langle A\rangle ={\frac {\langle A/w\rangle _{\pi }}{\langle 1/w\rangle _{\pi }}},$ with the $\pi$ subscript indicating values from the umbrella-sampled simulation.

The effect of introducing the weighting function w(rN) is equivalent to adding a biasing potential V(rN) to the potential energy of the system.

$V(\mathbf {r} ^{N})=-k_{B}T\ln w(\mathbf {r} ^{N})$ If the biasing potential is strictly a function of a reaction coordinate or order parameter $Q$ , then the (unbiased) free energy profile on the reaction coordinate can be calculated by subtracting the biasing potential from the biased free energy profile.

$F_{0}(Q)=F_{\pi }(Q)-V(Q)$ where $F_{0}(Q)$ is the free energy profile of the unbiased system and $F_{\pi }(Q)$ is the free energy profile calculated for the biased, umbrella-sampled system.

Series of umbrella sampling simulations can be analyzed using the weighted histogram analysis method (WHAM) or its generalization. WHAM can be derived using the Maximum likelihood method.

Subtleties exist in deciding the most computationally efficient way to apply the umbrella sampling method, as described in Frenkel & Smit's book Understanding Molecular Simulation.

Alternatives to umbrella sampling for computing potentials of mean force or reaction rates are free energy perturbation and transition interface sampling. A further alternative which functions in full non-equilibrium is S-PRES.