Glossary¶
This page provides definitions for abbreviations and technical terms used throughout the mlmm-toolkit documentation.
ML/MM & ONIOM¶
Term |
Full Name |
Description |
|---|---|---|
ML/MM |
Machine Learning / Molecular Mechanics |
A multi-scale method that couples a machine-learning interatomic potential (for the reactive region) with a classical force field (for the surrounding environment). Analogous to QM/MM but with ML replacing QM. |
ONIOM |
Our own N-layered Integrated molecular Orbital and molecular Mechanics |
A multi-layer energy decomposition scheme. mlmm-toolkit uses an ONIOM-like subtraction: E_total = E_REAL_low + E_MODEL_high - E_MODEL_low. |
QM/MM |
Quantum Mechanics / Molecular Mechanics |
A multi-scale method coupling QM for the reactive region with MM for the environment. ML/MM replaces the QM layer with an MLIP backend. |
Real system |
– |
The full set of atoms (all 3 layers). Evaluated at the MM (low) level in the ONIOM decomposition. Described by the parm7 topology; MM energy computed by hessian_ff. |
Model system |
– |
The ML region (Layer 1). Evaluated at both the MLIP (high) and MM (low) levels in the ONIOM decomposition. |
Link Hydrogen |
– |
A hydrogen atom added to cap severed bonds at the ML/MM boundary during pocket extraction. Forces are redistributed via a Jacobian. |
hessian_ff |
– |
A C++ native extension that evaluates Amber force field energies, forces, and analytical Hessians. Used as the MM engine in mlmm-toolkit. |
3-layer system |
– |
mlmm-toolkit’s B-factor partitioning scheme: ML (B=0.0), Movable-MM (B=10.0), Frozen (B=20.0). |
B-factor encoding |
– |
Convention of storing layer membership in the PDB B-factor (temperature factor) column: 0.0 = ML, 10.0 = Movable-MM, 20.0 = Frozen. Hessian-target MM is controlled by cutoffs/explicit indices. |
Amber & Force Field¶
Term |
Full Name |
Description |
|---|---|---|
parm7 |
Amber Parameter/Topology File |
A file containing atom types, partial charges, bonding connectivity, and force field parameters for an Amber system. Also called .prmtop. |
rst7 |
Amber Restart File |
A file containing atomic coordinates (and optionally velocities and box dimensions) for an Amber system. Also called .inpcrd. |
AmberTools |
– |
A free suite of tools for molecular dynamics preparation, including tleap, antechamber, and parmchk2. Required by |
tleap |
– |
An AmberTools program that builds Amber topology/coordinate files from PDB structures and force field libraries. |
antechamber |
– |
An AmberTools program that assigns GAFF2 atom types and AM1-BCC partial charges to small molecules. |
parmchk2 |
– |
An AmberTools program that checks and supplies missing force field parameters for GAFF2 typing. |
GAFF2 |
General Amber Force Field 2 |
A general-purpose force field for small organic molecules, used to parameterize non-standard residues (substrates, cofactors). |
ff19SB |
– |
An Amber protein force field used for standard amino acid residues. The default in mlmm-toolkit. |
ff14SB |
– |
An Amber protein force field (2014 version). Selectable with |
AM1-BCC |
– |
A charge model that combines AM1 (semi-empirical) Mulliken charges with bond charge corrections (BCC) to approximate HF/6-31G* RESP charges. |
Reaction Path & Optimization¶
Term |
Full Name |
Description |
|---|---|---|
MEP |
Minimum Energy Path |
The lowest-energy pathway connecting reactants to products through a transition state (on a potential energy surface). |
TS |
Transition State |
A first-order saddle point on the potential energy surface, typically the highest-energy point along the reaction coordinate. |
IRC |
Intrinsic Reaction Coordinate |
A mass-weighted steepest-descent path from a TS toward reactants and products. Often used to validate TS connectivity. |
GSM |
Growing String Method |
A string-based method that grows images from endpoints and optimizes them to approximate an MEP. |
DMF |
Direct Max Flux |
A chain-of-states method for optimizing an MEP by maximizing flux along the pathway. In mlmm-toolkit it is selected with |
NEB |
Nudged Elastic Band |
A chain-of-states method that uses spring forces to maintain image spacing along a reaction path. |
HEI |
Highest-Energy Image |
The image along an MEP with maximum energy; often used as a TS guess. |
Image |
– |
A single geometry (one “node”) along a chain-of-states path. |
Segment |
– |
An MEP between two adjacent endpoints (e.g., R -> I1, I1 -> I2,…). |
Optimization Algorithms¶
Term |
Full Name |
Description |
|---|---|---|
L-BFGS |
Limited-memory BFGS |
A quasi-Newton optimization algorithm that approximates the Hessian using a limited history of gradients. Used in |
RFO |
Rational Function Optimization |
A trust-region optimization method that uses explicit Hessian information. Used in |
RS-I-RFO |
Restricted-Step Image-RFO |
A variant of RFO for saddle point (TS) optimization that follows one negative eigenvalue. |
Dimer |
Dimer Method |
A TS optimization method that estimates the lowest curvature mode without computing the full Hessian. Used in |
PHVA |
Partial Hessian Vibrational Analysis |
Computing vibrational frequencies using only the Hessian block for active (non-frozen) atoms. Default in |
Machine Learning & Calculators¶
Term |
Full Name |
Description |
|---|---|---|
MLIP |
Machine Learning Interatomic Potential |
A model (often neural-network-based) that predicts energies and forces from atomic structures, trained on quantum-mechanical data. |
UMA |
Universal Machine-learning potential for Atoms |
Meta’s family of pretrained MLIPs. The default ML backend in mlmm-toolkit ( |
ORB |
ORB Models |
A family of pretrained MLIPs from Orbital Materials. Supported as an alternative ML backend ( |
MACE |
MACE (Message-passing Atomic Cluster Expansion) |
A message-passing equivariant neural network MLIP. Supported as an alternative ML backend ( |
AIMNet2 |
Atoms In Molecules Network 2 |
A neural network potential for organic molecules. Supported as an alternative ML backend ( |
xTB |
Extended Tight-Binding |
A semi-empirical quantum chemistry method. Used for point-charge embedding correction when |
Analytical Hessian |
– |
Computing the exact second derivatives of energy; faster but requires more VRAM. Currently available for the UMA backend only. |
Finite Difference |
– |
Approximating derivatives by small displacements; slower but more memory-efficient. Used by ORB, MACE, and AIMNet2 backends. |
Quantum Chemistry¶
Term |
Full Name |
Description |
|---|---|---|
QM |
Quantum Mechanics |
First-principles electronic structure calculations (DFT, HF, post-HF, etc.). |
DFT |
Density Functional Theory |
A quantum-mechanical method that models electronic structure via electron density functionals. |
Hessian |
– |
The matrix of second derivatives of energy with respect to atomic coordinates; used for vibrational analysis and TS optimization. |
SP |
Single Point |
A calculation at a fixed geometry (no optimization); often used for higher-level energy refinement. |
Spin Multiplicity |
– |
2S+1, where S is total spin. Singlet = 1, doublet = 2, triplet = 3, etc. |
Structural Biology & Pocket Extraction¶
Term |
Full Name |
Description |
|---|---|---|
PDB |
Protein Data Bank |
A file format and database for macromolecular 3D structures. |
XYZ |
– |
A simple text format listing atomic symbols and Cartesian coordinates. |
GJF |
Gaussian Job File |
An input format for Gaussian; mlmm-toolkit reads charge/multiplicity and coordinates from these files. |
Active-site Pocket |
A truncated structure around the substrate(s) used to reduce system size for MEP/TS search. Also called “cluster model”. |
|
Cluster Model |
– |
Synonym for pocket; a computationally tractable subset of the full enzyme-substrate complex. |
Link Hydrogen |
– |
A hydrogen atom added to cap severed bonds when extracting a pocket from a larger structure. |
Backbone |
– |
The main chain of a protein (N-C_alpha-C-O atoms). Can be excluded during pocket extraction with |
B-factor |
Temperature Factor |
The PDB temperature factor column. In mlmm-toolkit, used to encode 3-layer membership (0.0, 10.0, 20.0). |
Thermochemistry¶
Term |
Full Name |
Description |
|---|---|---|
ZPE |
Zero-Point Energy |
The vibrational energy at 0 K; a quantum correction to the electronic energy. |
Gibbs Energy |
Free Energy (G) |
G = H - TS; includes thermal and entropic contributions. |
Enthalpy |
(H) |
H = E + PV; total heat content at constant pressure. |
Entropy |
(S) |
A measure of disorder; contributes -TS to Gibbs energy. |
Units & Constants¶
Term |
Description |
|---|---|
Hartree |
Atomic unit of energy; 1 Hartree ≈ 627.5 kcal/mol ≈ 27.21 eV. |
kcal/mol |
Kilocalories per mole; a common unit for reaction energetics. |
kJ/mol |
Kilojoules per mole; 1 kcal/mol ≈ 4.184 kJ/mol. |
eV |
Electron volt; 1 eV ≈ 23.06 kcal/mol. |
Bohr |
Atomic unit of length; 1 Bohr ≈ 0.529 Angstrom. |
Angstrom |
10^-10 m; standard unit for atomic distances. |
CLI Conventions¶
Term |
Description |
|---|---|
Boolean option |
CLI flags that take |
Residue selector |
A specification like |
Atom selector |
A specification like |
B-factor layer encoding |
The convention of using the PDB B-factor column to encode 3-layer assignments (0.0, 10.0, 20.0). Hessian-target MM atoms are controlled separately. |
See Also¶
Getting Started – installation and a first run
Concepts & Workflow – how pocket extraction, ML/MM layers, MEP search, and post-processing fit together
Common Error Recipes – symptom-first failure routing
Troubleshooting – common errors and fixes
YAML Reference – configuration file format
ML/MM Calculator – machine learning potential details