GMX-WHAM(1) | GROMACS | GMX-WHAM(1) |
gmx-wham - Perform weighted histogram analysis after umbrella sampling
gmx wham [-ix [<.dat>]] [-if [<.dat>]] [-it [<.dat>]] [-is [<.dat>]]
[-iiact [<.dat>]] [-tab [<.dat>]] [-o [<.xvg>]]
[-hist [<.xvg>]] [-oiact [<.xvg>]] [-bsres [<.xvg>]]
[-bsprof [<.xvg>]] [-xvg <enum>] [-min <real>] [-max <real>]
[-[no]auto] [-bins <int>] [-temp <real>] [-tol <real>]
[-[no]v] [-b <real>] [-e <real>] [-dt <real>]
[-[no]histonly] [-[no]boundsonly] [-[no]log] [-unit <enum>]
[-zprof0 <real>] [-[no]cycl] [-[no]sym] [-[no]ac]
[-acsig <real>] [-ac-trestart <real>] [-nBootstrap <int>]
[-bs-method <enum>] [-bs-tau <real>] [-bs-seed <int>]
[-histbs-block <int>] [-[no]vbs]
gmx wham is an analysis program that implements the Weighted Histogram Analysis Method (WHAM). It is intended to analyze output files generated by umbrella sampling simulations to compute a potential of mean force (PMF).
gmx wham is currently not fully up to date. It only supports pull setups where the first pull coordinate(s) is/are umbrella pull coordinates and, if multiple coordinates need to be analyzed, all used the same geometry and dimensions. In most cases this is not an issue.
At present, three input modes are supported.
By default, all pull coordinates found in all pullx/pullf files are used in WHAM. If only some of the pull coordinates should be used, a pull coordinate selection file (option -is) can be provided. The selection file must contain one line for each tpr file in tpr-files.dat. Each of these lines must contain one digit (0 or 1) for each pull coordinate in the tpr file. Here, 1 indicates that the pull coordinate is used in WHAM, and 0 means it is omitted. Example: If you have three tpr files, each containing 4 pull coordinates, but only pull coordinates 1 and 2 should be used, coordsel.dat looks like this:
1 1 0 0 1 1 0 0 1 1 0 0
By default, the output files are:
``-o`` PMF output file ``-hist`` Histograms output file
Always check whether the histograms sufficiently overlap.
The umbrella potential is assumed to be harmonic and the force constants are read from the .tpr files. If a non-harmonic umbrella force was applied a tabulated potential can be provided with -tab.
The data points that are used to compute the profile can be restricted with options -b, -e, and -dt. Adjust -b to ensure sufficient equilibration in each umbrella window.
With -log (default) the profile is written in energy units, otherwise (with -nolog) as probability. The unit can be specified with -unit. With energy output, the energy in the first bin is defined to be zero. If you want the free energy at a different position to be zero, set -zprof0 (useful with bootstrapping, see below).
For cyclic or periodic reaction coordinates (dihedral angle, channel PMF without osmotic gradient), the option -cycl is useful. gmx wham will make use of the periodicity of the system and generate a periodic PMF. The first and the last bin of the reaction coordinate will assumed be be neighbors.
Option -sym symmetrizes the profile around z=0 before output, which may be useful for, e.g. membranes.
If available, the number of OpenMP threads used by gmx wham can be controlled by setting the OMP_NUM_THREADS environment variable.
With -ac, gmx wham estimates the integrated autocorrelation time (IACT) tau for each umbrella window and weights the respective window with 1/[1+2*tau/dt]. The IACTs are written to the file defined with -oiact. In verbose mode, all autocorrelation functions (ACFs) are written to hist_autocorr.xvg. Because the IACTs can be severely underestimated in case of limited sampling, option -acsig allows one to smooth the IACTs along the reaction coordinate with a Gaussian (sigma provided with -acsig, see output in iact.xvg). Note that the IACTs are estimated by simple integration of the ACFs while the ACFs are larger 0.05. If you prefer to compute the IACTs by a more sophisticated (but possibly less robust) method such as fitting to a double exponential, you can compute the IACTs with gmx analyze and provide them to gmx wham with the file iact-in.dat (option -iiact), which should contain one line per input file (pullx/pullf file) and one column per pull coordinate in the respective file.
Statistical errors may be estimated with bootstrap analysis. Use it with care, otherwise the statistical error may be substantially underestimated. More background and examples for the bootstrap technique can be found in Hub, de Groot and Van der Spoel, JCTC (2010) 6: 3713-3720. -nBootstrap defines the number of bootstraps (use, e.g., 100). Four bootstrapping methods are supported and selected with -bs-method.
Bootstrapping output:
With -vbs (verbose bootstrapping), the histograms of each bootstrap are written, and, with bootstrap method traj, the cumulative distribution functions of the histograms.
Options to specify input files:
Options to specify output files:
Other options:
More information about GROMACS is available at <http://www.gromacs.org/>.
2023, GROMACS development team
February 3, 2023 | 2022.5 |