Concepts & Workflow¶

This page explains the key terms in pdb2reaction—pockets, templates, segments, and images—and how the all command ties together the subcommands.

Workflow at a glance¶

Most workflows follow this flow:

Full system(s) (PDB/XYZ/GJF)
   │
   ├─ (optional) pocket extraction   [extract]     ← requires PDB when you use --center/-c
   │        ↓
   │   Pocket/cluster model(s) (PDB)
   │        │
   │        ├─ (optional) staged scan [scan]       ← single-structure workflows
   │        │        ↓
   │        │   Ordered intermediates
   │        │        ↓
   │        └─ MEP search             [path-search] or [path-opt]
   │                 ↓
   │            MEP trajectory (mep.trj) + energy diagrams
   │                 ↓
   └─ (optional) TS optimization + IRC  [tsopt] → [irc]
             └─ (optional) thermo     [freq]
             └─ (optional) single-point DFT     [dft]

Each stage is available as an individual subcommand. The pdb2reaction all command runs many stages end-to-end.

Important

Transition states: treat HEI / tsopt outputs as TS candidates until validated via freq (a single imaginary mode) and irc (endpoints reach intended minima).

Key objects and terms¶

Full system vs. pocket (cluster model)¶

Full system: your original structure(s). In enzyme use-cases this is typically a protein–ligand complex.
Pocket / cluster model: a truncated structure around the substrate(s) used to reduce system size for MEP/TS search.

Pocket extraction is controlled by:

-c/--center: how to locate the substrate (residue IDs, residue names, or a substrate-only PDB).
-r/--radius, --radius-het2het, --include-H2O, --exclude-backbone, --add-linkH, --selected-resn.

Images and segments¶

Image: a single geometry (one “node”) along a chain-of-states path.
Segment: an MEP between two adjacent endpoints (e.g., R → I1, I1 → I2, …). A multi-structure run is decomposed into segments.

Templates and file conversion (`--convert-files`)¶

pdb2reaction often writes a trajectory (e.g., mep.trj, irc.trj). When you supply topology-aware inputs (PDB templates or Gaussian inputs), it can optionally write companion files:

.pdb companions when a PDB template exists
.gjf companions when a Gaussian template exists

This behavior is controlled globally by --convert-files {True|False} (default: True).

Three common workflow modes¶

1) Multi-structure MEP search (R → … → P)¶

Use this when you already have two or more full structures along a reaction coordinate.

Typical command:

pdb2reaction -i R.pdb P.pdb -c 'SAM,GPP' --ligand-charge 'SAM:1,GPP:-3'

2) Single-structure staged scan → MEP¶

Use this when you only have one structure, but you can define a scan that generates endpoints.

Typical command:

pdb2reaction -i holo.pdb -c '308,309' \
  --scan-lists '[("TYR,285,CA","MMT,309,C10",2.20)]'

3) TSOPT-only mode (pocket TS optimization)¶

Use this when you already have a TS candidate (or want a quick TS optimization on one structure).

Typical command:

pdb2reaction -i ts_guess.pdb -c 'SAM,GPP' --tsopt True

When to use `all` vs individual subcommands¶

Prefer `pdb2reaction all` when…¶

You want an end-to-end run (extract → MEP → TSOPT/IRC → freq/DFT).
You are still exploring the workflow and want a single command to manage outputs.

Prefer subcommands when…¶

You want to debug a specific stage (e.g., only extract, only path-search).
You want to mix-and-match a custom workflow (e.g., your own endpoint preparation).

A few CLI conventions worth knowing¶

Important

Boolean options are passed explicitly as True/False (e.g., --tsopt True).
With multiple PDB inputs, all files should have the same atoms in the same order (only coordinates differ).
For enzyme use-cases, you usually want hydrogens present in the input PDB.

Next steps¶

Getting started¶

Getting Started — installation and first run
Troubleshooting — common errors and fixes

Core subcommands¶

Subcommand	Purpose	Documentation
`all`	End-to-end workflow	all.md
`extract`	Pocket extraction	extract.md
`path-search`	Recursive MEP search	path_search.md
`tsopt`	TS optimization	tsopt.md
`freq`	Vibrational analysis	freq.md
`dft`	Single-point DFT	dft.md

Reference¶

YAML Reference — complete YAML configuration options
Glossary — terminology reference

Concepts & Workflow¶

Workflow at a glance¶

Key objects and terms¶

Full system vs. pocket (cluster model)¶

Images and segments¶

Templates and file conversion (--convert-files)¶

Three common workflow modes¶

1) Multi-structure MEP search (R → … → P)¶

2) Single-structure staged scan → MEP¶

3) TSOPT-only mode (pocket TS optimization)¶

When to use all vs individual subcommands¶

Prefer pdb2reaction all when…¶

Prefer subcommands when…¶

A few CLI conventions worth knowing¶

Next steps¶

Getting started¶

Core subcommands¶

Reference¶

Templates and file conversion (`--convert-files`)¶

When to use `all` vs individual subcommands¶

Prefer `pdb2reaction all` when…¶