Computational Biology on Hunter Heidenreich | Senior AI Research Scientist

Umeyama's Method: Corrected SVD for Point Alignment

Mon, 16 Mar 2026 00:00:00 +0000

Fixing the Reflection Problem in SVD-Based Alignment

This Method paper addresses a specific failure mode in prior SVD-based solutions to the point set registration problem. Both Arun et al. (1987) and Horn, Hilden, and Negahdaripour (1988) presented SVD-based methods for finding the optimal rotation between two point patterns. (Note: this is a different paper from Horn’s 1987 quaternion method, which does not suffer from this issue.) These SVD-based methods can produce a reflection ($\det(R) = -1$) instead of a proper rotation when the data is severely corrupted. Umeyama provides a corrected formulation that always yields a proper rotation matrix.

The Similarity Transformation Problem

Given two point sets ${\mathbf{x}_i}$ and ${\mathbf{y}_i}$ ($i = 1, \ldots, n$) in $m$-dimensional space, find the similarity transformation parameters (rotation $R$, translation $\mathbf{t}$, and scale $c$) minimizing the mean squared error:

$$ e^2(R, \mathbf{t}, c) = \frac{1}{n} \sum_{i=1}^{n} \lVert \mathbf{y}_i - (cR\mathbf{x}_i + \mathbf{t}) \rVert^2 $$

This generalizes the Kabsch problem (rotation only) and the absolute orientation problem (rotation + translation + scale) to arbitrary dimensions $m$.

The Core Lemma: Corrected SVD Rotation

The key contribution is a lemma for finding the rotation $R$ minimizing $\lVert A - RB \rVert^2$. Given the SVD of $AB^T = UDV^T$ (with $d_1 \geq d_2 \geq \cdots \geq d_m \geq 0$), define the correction matrix:

$$ S = \begin{cases} I & \text{if } \det(AB^T) \geq 0 \\ \operatorname{diag}(1, 1, \ldots, 1, -1) & \text{if } \det(AB^T) < 0 \end{cases} $$

The minimum value is:

$$ \min_{R} \lVert A - RB \rVert^2 = \lVert A \rVert^2 + \lVert B \rVert^2 - 2\operatorname{tr}(DS) $$

When $\operatorname{rank}(AB^T) \geq m - 1$, the optimal rotation is uniquely determined as:

$$ R = USV^T $$

The critical insight is that when $\det(AB^T) = 0$ (i.e., $\operatorname{rank}(AB^T) = m - 1$), the matrix $S$ must instead be chosen based on $\det(U)\det(V)$:

$$ S = \begin{cases} I & \text{if } \det(U)\det(V) = 1 \\ \operatorname{diag}(1, 1, \ldots, 1, -1) & \text{if } \det(U)\det(V) = -1 \end{cases} $$

This handles the degenerate case where the sign of $\det(AB^T)$ is unreliable.

Complete Similarity Transformation Solution

Umeyama derives the full solution using centered coordinates and the covariance matrix $\Sigma_{xy} = \frac{1}{n} \sum_i (\mathbf{y}_i - \boldsymbol{\mu}_y)(\mathbf{x}_i - \boldsymbol{\mu}_x)^T$.

Given the SVD $\Sigma_{xy} = UDV^T$:

Rotation:

$$ R = USV^T $$

Scale:

$$ c = \frac{1}{\sigma_x^2} \operatorname{tr}(DS) $$

Translation:

$$ \mathbf{t} = \boldsymbol{\mu}_y - cR\boldsymbol{\mu}_x $$

Minimum error:

$$ \varepsilon^2 = \sigma_y^2 - \frac{\operatorname{tr}(DS)^2}{\sigma_x^2} $$

where $\sigma_x^2$ and $\sigma_y^2$ are the variances of the respective point sets around their centroids.

Why Prior Methods Fail

The methods of Arun et al. and Horn et al. use $R = UV^T$ directly from the SVD. This works when $\det(UV^T) = 1$ (proper rotation). When $\det(UV^T) = -1$, these methods either produce a reflection or apply an ad hoc correction (flipping the sign of the last column of $U$). Umeyama shows that the correct fix depends on $\det(\Sigma_{xy})$:

If $\det(\Sigma_{xy}) \geq 0$: set $S = I$, so $R = UV^T$
If $\det(\Sigma_{xy}) < 0$: set $S = \operatorname{diag}(1, \ldots, 1, -1)$, flipping the last singular value’s contribution

This distinction matters because corrupted data can make $\det(UV^T) = -1$ even when the true transformation is a proper rotation. Simply flipping a column of $U$ does not always yield the correct least-squares solution.

Generality

The formulation works for any dimension $m$, covering both 2D and 3D registration problems. The proof uses Lagrange multipliers with explicit enforcement of both orthogonality ($R^T R = I$) and the proper rotation constraint ($\det(R) = 1$), which prior methods enforced only partially.

Paper Information

Citation: Umeyama, S. (1991). Least-squares estimation of transformation parameters between two point patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(4), 376-380. https://doi.org/10.1109/34.88573

Publication: IEEE TPAMI, 1991

Additional Resources:

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX (tutorial with implementations including the Kabsch-Umeyama scaling extension)
Kabsch-Horn Cookbook (a differentiable, gradient-safe implementation of Kabsch, Horn, and Umeyama alignment across NumPy, PyTorch, JAX, TensorFlow, and MLX)

@article{umeyama1991least,
  title={Least-squares estimation of transformation parameters between two point patterns},
  author={Umeyama, Shinji},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  volume={13},
  number={4},
  pages={376--380},
  year={1991},
  publisher={IEEE},
  doi={10.1109/34.88573}
}

Horn et al.: Absolute Orientation Using Orthonormal Matrices

Mon, 16 Mar 2026 00:00:00 +0000

A Matrix-Based Companion to the Quaternion Method

This Method paper presents a closed-form solution to the absolute orientation problem using $3 \times 3$ orthonormal matrices directly, complementing Horn’s earlier quaternion-based solution (1987). The authors note that while quaternions are more elegant, orthonormal matrices are more widely used in photogrammetry, graphics, and robotics. The solution relies on the polar decomposition of the cross-covariance matrix via its matrix square root.

The paper also compares two approaches: (1) directly finding the best-fit orthonormal matrix (the main result), and (2) finding an unconstrained best-fit linear transformation and then projecting it onto the nearest orthonormal matrix. These give different results, and only the first approach has the desired symmetry property.

The Rotation via Polar Decomposition

As in the quaternion paper, the problem reduces to finding the orthonormal matrix $R$ maximizing $\operatorname{Tr}(R^T M)$, where $M = \sum_{i=1}^{n} \mathbf{r}’_{r,i} (\mathbf{r}’_{l,i})^T$ is the cross-covariance matrix of the centered point sets.

The key insight is the polar decomposition: any matrix $M$ can be written as:

$$ M = U S $$

where $U$ is orthonormal and $S = (M^T M)^{1/2}$ is positive semidefinite. When $M$ is nonsingular:

$$ U = M (M^T M)^{-1/2} $$

The matrix square root $(M^T M)^{1/2}$ is computed via eigendecomposition. If $M^T M$ has eigenvalues $\lambda_1, \lambda_2, \lambda_3$ and eigenvectors $\hat{\mathbf{u}}_1, \hat{\mathbf{u}}_2, \hat{\mathbf{u}}_3$:

$$ (M^T M)^{1/2} = \sqrt{\lambda_1} , \hat{\mathbf{u}}_1 \hat{\mathbf{u}}_1^T + \sqrt{\lambda_2} , \hat{\mathbf{u}}_2 \hat{\mathbf{u}}_2^T + \sqrt{\lambda_3} , \hat{\mathbf{u}}_3 \hat{\mathbf{u}}_3^T $$

The sign of $\det(U)$ equals the sign of $\det(M)$, so $U$ is a proper rotation when $\det(M) > 0$ and a reflection when $\det(M) < 0$.

Handling the Coplanar Case

When one set of measurements is coplanar, $M$ is singular ($\operatorname{rank}(M) = 2$) and one eigenvalue of $M^T M$ is zero. The matrix square root still exists (positive semidefinite rather than positive definite), but $S$ is no longer invertible.

In this case, $U$ is determined only for two of its three columns. The third column (corresponding to the zero eigenvalue) is fixed by the orthonormality constraint, with a sign ambiguity resolved by requiring $\det(U) = +1$ (proper rotation).

The Nearest Orthonormal Matrix (Alternative Approach)

The paper also derives a closed-form solution for finding the orthonormal matrix nearest to an arbitrary matrix $A$ (minimizing $\lVert A - R \rVert^2$). This uses the same polar decomposition machinery: if $A = U_A S_A$, then $U_A$ is the nearest orthonormal matrix.

This approach (find unconstrained best-fit transform, then project to nearest orthonormal matrix) was used by some earlier methods. Horn et al. show it gives a different result from the direct least-squares solution and lacks the symmetry property: the inverse transformation from right-to-left is generally not the exact inverse of the left-to-right solution.

Relationship to Other Methods

Method	Rotation representation	Core computation
Kabsch (1976)	Orthogonal matrix	Eigendecomposition of $\tilde{R}R$ ($3 \times 3$)
Horn (1987)	Unit quaternion	Eigenvector of $N$ ($4 \times 4$)
Horn et al. (1988)	Orthonormal matrix	Square root of $M^T M$ ($3 \times 3$)
Arun et al. (1987)	Orthonormal matrix	SVD of $H$ ($3 \times 3$)

The polar decomposition approach (this paper) and the SVD approach (Arun et al.) are closely related: the SVD $M = U \Lambda V^T$ gives the polar decomposition as $M = (UV^T)(V \Lambda V^T)$ where $UV^T$ is the orthonormal factor and $V \Lambda V^T$ is the positive semidefinite factor. Both methods can produce reflections under noisy data, which Umeyama (1991) later addressed.

Paper Information

Citation: Horn, B. K. P., Hilden, H. M., & Negahdaripour, S. (1988). Closed-form solution of absolute orientation using orthonormal matrices. Journal of the Optical Society of America A, 5(7), 1127-1135. https://doi.org/10.1364/josaa.5.001127

Publication: Journal of the Optical Society of America A, 1988

Additional Resources:

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX (tutorial with differentiable implementations)
Kabsch-Horn Cookbook (a differentiable, gradient-safe implementation of Kabsch, Horn, and Umeyama alignment across NumPy, PyTorch, JAX, TensorFlow, and MLX)

@article{horn1988closed,
  title={Closed-form solution of absolute orientation using orthonormal matrices},
  author={Horn, Berthold K. P. and Hilden, Hugh M. and Negahdaripour, Shahriar},
  journal={Journal of the Optical Society of America A},
  volume={5},
  number={7},
  pages={1127--1135},
  year={1988},
  publisher={Optica Publishing Group},
  doi={10.1364/josaa.5.001127}
}

Arun et al.: SVD-Based Least-Squares Fitting of 3D Points

Mon, 16 Mar 2026 00:00:00 +0000

SVD for 3D Point Set Registration

This Method paper presents a concise algorithm for finding the least-squares rotation and translation between two 3D point sets using the singular value decomposition (SVD) of a $3 \times 3$ cross-covariance matrix. The approach is closely related to the earlier Kabsch algorithm (1976), which used eigendecomposition, and was developed independently of Horn’s quaternion method (1987). The paper also identifies a reflection degeneracy that Umeyama later provided a complete fix for.

Problem Formulation

Given two 3D point sets ${p_i}$ and ${p’_i}$ ($i = 1, \ldots, N$) related by:

$$ p’_i = R p_i + T + N_i $$

where $R$ is a rotation matrix, $T$ is a translation vector, and $N_i$ is noise, find $\hat{R}$ and $\hat{T}$ minimizing:

$$ \Sigma^2 = \sum_{i=1}^{N} \lVert p’_i - (R p_i + T) \rVert^2 $$

Decoupling Translation and Rotation

The translation is eliminated by centering both point sets at their centroids $p$ and $p’$. Defining centered coordinates $q_i = p_i - p$ and $q’_i = p’_i - p’$, the problem reduces to:

$$ \Sigma^2 = \sum_{i=1}^{N} \lVert q’_i - R q_i \rVert^2 $$

Once $\hat{R}$ is found, the translation follows as $\hat{T} = p’ - \hat{R} p$.

The SVD Algorithm

The algorithm proceeds in five steps:

Center both point sets by subtracting centroids
Compute the $3 \times 3$ cross-covariance matrix: $H = \sum_{i=1}^{N} q_i q’^t_i$
Compute the SVD: $H = U \Lambda V^t$
Form the candidate rotation: $X = V U^t$
Check $\det(X)$: if $+1$, then $\hat{R} = X$; if $-1$, the result is a reflection

The key insight is that minimizing $\Sigma^2$ is equivalent to maximizing $\operatorname{Trace}(RH)$. Using a lemma based on the Cauchy-Schwarz inequality, Arun et al. show that $X = VU^t$ maximizes this trace over all orthonormal matrices.

The Reflection Problem

When $\det(VU^t) = -1$, the SVD produces a reflection rather than a proper rotation. Arun et al. analyze three cases:

Noiseless, non-coplanar points: The SVD always gives a proper rotation ($\det = +1$). No issue arises.

Coplanar points (including $N = 3$): One singular value of $H$ is zero. Both a rotation and a reflection achieve $\Sigma^2 = 0$. The fix is to flip the sign of the column of $V$ corresponding to the zero singular value:

$$ V’ = [v_1, v_2, -v_3], \quad X’ = V’ U^t $$

Noisy, non-coplanar points with $\det = -1$: The paper acknowledges this case cannot be handled by the algorithm. The reflection genuinely minimizes $\Sigma^2$ over all orthonormal matrices, meaning no rotation achieves a lower error. The authors suggest this only occurs with very large noise and recommend RANSAC-like approaches.

This last case is precisely what Umeyama (1991) later resolved with a corrected formulation using a sign matrix $S$ conditioned on $\det(\Sigma_{xy})$.

Computational Comparison

The paper includes VAX 11/780 benchmarks comparing three methods:

Points	SVD (ms)	Quaternion (ms)	Iterative (ms)
3	54.6	26.6	126.8
11	37.0	41.0	105.2
30	44.2	48.3	111.0

The SVD and quaternion methods have comparable speed, both significantly faster than the iterative approach. SVD becomes faster than quaternion for larger point sets since its core computation operates on a $3 \times 3$ matrix regardless of $N$.

Paper Information

Citation: Arun, K. S., Huang, T. S., & Blostein, S. D. (1987). Least-Squares Fitting of Two 3-D Point Sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-9(5), 698-700. https://doi.org/10.1109/TPAMI.1987.4767965

Publication: IEEE TPAMI, 1987

Additional Resources:

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX (tutorial with differentiable implementations)
Kabsch-Horn Cookbook (a differentiable, gradient-safe implementation of Kabsch, Horn, and Umeyama alignment across NumPy, PyTorch, JAX, TensorFlow, and MLX)

@article{arun1987least,
  title={Least-Squares Fitting of Two 3-D Point Sets},
  author={Arun, K. S. and Huang, T. S. and Blostein, S. D.},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  volume={PAMI-9},
  number={5},
  pages={698--700},
  year={1987},
  publisher={IEEE},
  doi={10.1109/TPAMI.1987.4767965}
}

Kabsch Algorithm: Optimal Rotation for Point Set Alignment

Sun, 15 Mar 2026 00:00:00 +0000

A Closed-Form Solution for Optimal Rotation

This short communication presents a Method paper: a direct, analytical solution to a constrained optimization problem. Given two sets of vectors, Kabsch derives the orthogonal matrix (rotation) that best superimposes one set onto the other by minimizing a weighted sum of squared deviations. Prior approaches either solved an unconstrained problem and factorized the result (Diamond, 1976) or used iterative methods (McLachlan, 1972). Kabsch shows that a direct, non-iterative solution exists despite the non-linear nature of the orthogonality constraint.

The Superposition Problem

The core problem arises frequently in crystallography and structural biology: given two sets of corresponding points (e.g., atomic coordinates from a known structure and experimentally measured coordinates), find the rigid rotation that best aligns them. Translations can be removed by centering both point sets at the origin, leaving only the rotational component.

Formally, given vector sets $\mathbf{x}_n$ and $\mathbf{y}_n$ ($n = 1, 2, \ldots, N$) with weights $w_n$, find the orthogonal matrix $\mathsf{U}$ minimizing:

$$ E = \frac{1}{2} \sum_{n} w_n (\mathsf{U} \mathbf{x}_n - \mathbf{y}_n)^2 $$

subject to orthogonality: $\tilde{\mathsf{U}} \mathsf{U} = \mathsf{I}$.

Derivation via Lagrange Multipliers

Kabsch introduces a symmetric matrix $\mathsf{L}$ of Lagrange multipliers to enforce orthogonality, forming the Lagrangian:

$$ G = E + \frac{1}{2} \sum_{i,j} l_{ij} \left( \sum_{k} u_{ki} u_{kj} - \delta_{ij} \right) $$

Setting $\partial G / \partial u_{ij} = 0$ and defining two key matrices:

$$ r_{ij} = \sum_{n} w_n , y_{ni} , x_{nj} \qquad s_{ij} = \sum_{n} w_n , x_{ni} , x_{nj} $$

where $\mathsf{R} = (r_{ij})$ is the weighted cross-covariance matrix and $\mathsf{S} = (s_{ij})$ is the weighted auto-covariance matrix, the stationarity condition becomes:

$$ \mathsf{U} \cdot (\mathsf{S} + \mathsf{L}) = \mathsf{R} $$

Eigendecomposition Solution

The key insight is that multiplying both sides by their transposes eliminates the unknown $\mathsf{U}$:

$$ (\mathsf{S} + \mathsf{L})(\mathsf{S} + \mathsf{L}) = \tilde{\mathsf{R}} \mathsf{R} $$

Since $\tilde{\mathsf{R}} \mathsf{R}$ is symmetric positive definite, it has positive eigenvalues $\mu_k$ and eigenvectors $\mathbf{a}_k$. The matrix $\mathsf{S} + \mathsf{L}$ shares the same eigenvectors with eigenvalues $\sqrt{\mu_k}$.

From the eigenvectors $\mathbf{a}_k$, a second set of unit vectors $\mathbf{b}_k$ is defined:

$$ \mathbf{b}_k = \frac{1}{\sqrt{\mu_k}} \mathsf{R} , \mathbf{a}_k $$

The optimal rotation matrix is then constructed directly:

$$ u_{ij} = \sum_{k} b_{ki} , a_{kj} $$

Handling Degeneracies and Generalizations

Kabsch addresses two extensions:

Planar point sets: When all vectors lie in a plane, one eigenvalue of $\tilde{\mathsf{R}} \mathsf{R}$ is zero. The missing eigenvectors are recovered via cross products: $\mathbf{a}_3 = \mathbf{a}_1 \times \mathbf{a}_2$ and $\mathbf{b}_3 = \mathbf{b}_1 \times \mathbf{b}_2$.
General metric constraints: The orthogonality constraint $\tilde{\mathsf{U}} \mathsf{U} = \mathsf{I}$ can be replaced by $\tilde{\mathsf{U}} \mathsf{U} = \mathsf{M}$ for any symmetric positive definite $\mathsf{M}$. By finding any specific solution $\mathsf{B}$ and transforming the input vectors as $\mathbf{x}’_n = \mathsf{B} \mathbf{x}_n$, the problem reduces back to the standard orthogonal case.

The method generalizes naturally to vector spaces of arbitrary dimension.

Legacy and Impact

This two-page communication became one of the most cited papers in structural biology. The “Kabsch algorithm” (or “Kabsch rotation”) is the standard method for computing the root-mean-square deviation (RMSD) between two molecular structures after optimal superposition. It underpins structure comparison tools across crystallography, NMR spectroscopy, cryo-EM, and computational chemistry.

Paper Information

Citation: Kabsch, W. (1976). A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A, 32(5), 922-923. https://doi.org/10.1107/s0567739476001873

Publication: Acta Crystallographica Section A, 1976

Additional Resources:

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX (tutorial with differentiable implementations)
Kabsch-Horn Cookbook (a differentiable, gradient-safe implementation of Kabsch, Horn, and Umeyama alignment across NumPy, PyTorch, JAX, TensorFlow, and MLX)

@article{kabsch1976solution,
  title={A solution for the best rotation to relate two sets of vectors},
  author={Kabsch, Wolfgang},
  journal={Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crystallography},
  volume={32},
  number={5},
  pages={922--923},
  year={1976},
  publisher={International Union of Crystallography},
  doi={10.1107/s0567739476001873}
}

Horn's Method: Absolute Orientation via Unit Quaternions

Sun, 15 Mar 2026 00:00:00 +0000

A Quaternion Approach to Point Set Registration

This Method paper presents a closed-form solution to the absolute orientation problem: given corresponding points measured in two different coordinate systems, find the optimal rotation, translation, and scale that maps one set onto the other. While the Kabsch algorithm (1976) solved the rotation subproblem via eigendecomposition of $\tilde{\mathsf{R}}\mathsf{R}$, Horn’s approach uses unit quaternions to represent rotation, reducing the problem to finding the eigenvector of a $4 \times 4$ symmetric matrix associated with its largest eigenvalue.

The Absolute Orientation Problem

Given $n$ point pairs ${\mathbf{r}_{l,i}}$ and ${\mathbf{r}_{r,i}}$ measured in “left” and “right” coordinate systems, find the transformation:

$$ \mathbf{r}_r = s , R(\mathbf{r}_l) + \mathbf{r}_0 $$

where $s$ is a scale factor, $R$ is a rotation, and $\mathbf{r}_0$ is a translation, minimizing the sum of squared residual errors:

$$ \sum_{i=1}^{n} \lVert \mathbf{r}_{r,i} - s , R(\mathbf{r}_{l,i}) - \mathbf{r}_0 \rVert^2 $$

Prior methods either used iterative numerical procedures or selectively discarded constraints (e.g., Thompson’s and Schut’s three-point methods). Horn derives a direct solution that uses all available information from all points simultaneously.

Decoupling Translation, Scale, and Rotation

Horn shows that the three components of the transformation can be solved sequentially.

Translation: After centering both point sets at their centroids ($\bar{\mathbf{r}}_l$ and $\bar{\mathbf{r}}_r$), the optimal translation is:

$$ \mathbf{r}_0 = \bar{\mathbf{r}}_r - s , R(\bar{\mathbf{r}}_l) $$

Scale: Horn derives three formulations (asymmetric left, asymmetric right, and symmetric). The symmetric version, which ensures the inverse transformation yields the reciprocal scale, is:

$$ s = \left( \frac{\sum_{i=1}^{n} \lVert \mathbf{r}’_{r,i} \rVert^2}{\sum_{i=1}^{n} \lVert \mathbf{r}’_{l,i} \rVert^2} \right)^{1/2} $$

the ratio of root-mean-square deviations from the respective centroids.

Rotation: After removing translation and scale, the remaining problem is to find the rotation $R$ that maximizes:

$$ \sum_{i=1}^{n} \mathbf{r}’_{r,i} \cdot R(\mathbf{r}’_{l,i}) $$

The Quaternion Eigenvector Solution

Horn represents rotation using unit quaternions $\dot{q} = q_0 + i q_x + j q_y + k q_z$ with $\lVert \dot{q} \rVert = 1$. A rotation acts on a vector (represented as a purely imaginary quaternion $\dot{r}$) via the composite product:

$$ \dot{r}’ = \dot{q} , \dot{r} , \dot{q}^* $$

Using the $4 \times 4$ matrix representations of quaternion products, the objective function becomes a quadratic form:

$$ \dot{q}^T N \dot{q} $$

where $N$ is a real symmetric $4 \times 4$ matrix whose elements are combinations of the sums of products $S_{xx}, S_{xy}, \ldots, S_{zz}$ from the $3 \times 3$ cross-covariance matrix $M = \sum_i \mathbf{r}’_{l,i} \mathbf{r}’^T_{r,i}$:

$$ N = \begin{bmatrix} (S_{xx} + S_{yy} + S_{zz}) & S_{yz} - S_{zy} & S_{zx} - S_{xz} & S_{xy} - S_{yx} \\ S_{yz} - S_{zy} & (S_{xx} - S_{yy} - S_{zz}) & S_{xy} + S_{yx} & S_{zx} + S_{xz} \\ S_{zx} - S_{xz} & S_{xy} + S_{yx} & (-S_{xx} + S_{yy} - S_{zz}) & S_{yz} + S_{zy} \\ S_{xy} - S_{yx} & S_{zx} + S_{xz} & S_{yz} + S_{zy} & (-S_{xx} - S_{yy} + S_{zz}) \end{bmatrix} $$

The trace of $N$ is always zero. The unit quaternion maximizing $\dot{q}^T N \dot{q}$ is the eigenvector corresponding to the most positive eigenvalue of $N$.

The Characteristic Polynomial

The eigenvalues satisfy a quartic $\lambda^4 + c_3 \lambda^3 + c_2 \lambda^2 + c_1 \lambda + c_0 = 0$ where:

$c_3 = 0$ (trace of $N$ is zero, so the four roots sum to zero)
$c_2 = -2 \operatorname{Tr}(M^T M)$ (always negative, guaranteeing both positive and negative roots)
$c_1 = -8 \det(M)$
$c_0 = \det(N)$

When points are coplanar (including the common case of exactly three points), $\det(M) = 0$, so $c_1 = 0$ and the quartic reduces to a biquadratic solvable in closed form.

Coplanar Points and the Three-Point Case

For coplanar measurements, the quartic simplifies to $\lambda^4 + c_2 \lambda^2 + c_0 = 0$, yielding:

$$ \lambda_m = \left[ \frac{1}{2} \left( (c_2^2 - 4c_0)^{1/2} - c_2 \right) \right]^{1/2} $$

Horn also provides a geometric interpretation for the coplanar case: first rotate one plane into the other (about their line of intersection), then solve a 2D least-squares rotation within the shared plane.

Comparison with the Kabsch Algorithm

Both methods solve the same underlying optimization problem but approach it differently:

Aspect	Kabsch (1976)	Horn (1987)
Rotation representation	Orthogonal matrix	Unit quaternion
Core computation	SVD or eigendecomposition of $\tilde{R}R$ ($3 \times 3$)	Eigenvector of $N$ ($4 \times 4$)
Scale estimation	Not addressed	Three formulations (including symmetric)
Constraint enforcement	Lagrange multipliers	Unit quaternion norm
Symmetry guarantee	Not addressed	Proven for symmetric scale
Degenerate cases	Cross-product fallback	Biquadratic closed form

Horn emphasizes a symmetry property: the inverse transformation should yield exactly the inverse parameters. This holds automatically for the quaternion rotation but requires a specific (symmetric) choice of scale formula.

Paper Information

Citation: Horn, B. K. P. (1987). Closed-form solution of absolute orientation using unit quaternions. Journal of the Optical Society of America A, 4(4), 629-642. https://doi.org/10.1364/JOSAA.4.000629

Publication: Journal of the Optical Society of America A, 1987

Additional Resources:

Kabsch Algorithm: NumPy, PyTorch, TensorFlow, and JAX (tutorial with differentiable implementations of the related SVD-based method)
Kabsch-Horn Cookbook (a differentiable, gradient-safe implementation of Kabsch, Horn, and Umeyama alignment across NumPy, PyTorch, JAX, TensorFlow, and MLX)

@article{horn1987closed,
  title={Closed-form solution of absolute orientation using unit quaternions},
  author={Horn, Berthold K. P.},
  journal={Journal of the Optical Society of America A},
  volume={4},
  number={4},
  pages={629--642},
  year={1987},
  publisher={Optica Publishing Group},
  doi={10.1364/josaa.4.000629}
}

DynamicFlow: Integrating Protein Dynamics into Drug Design

Sat, 20 Dec 2025 00:00:00 +0000

What kind of paper is this?

This is primarily a Methodological Paper ($\Psi_{\text{Method}}$) with a strong Resource ($\Psi_{\text{Resource}}$) component.

Method: It proposes DynamicFlow, a novel multiscale architecture combining atom-level SE(3)-equivariant GNNs (SE(3) is the special Euclidean group in 3D: the set of all 3D rotations and translations, and equivariance means predictions transform consistently under those symmetries) and residue-level Transformers within a flow matching framework to model the joint distribution of ligand generation and protein conformational change.
Resource: It curates a significant dataset derived from MISATO, pairing AlphaFold2-predicted apo structures with multiple MD-simulated holo states, specifically filtered for flow matching tasks.

What is the motivation?

Traditional Structure-Based Drug Design (SBDD) methods typically assume the protein target is rigid, which limits their applicability because proteins are dynamic and undergo conformational changes (induced fit) upon ligand binding.

Biological Reality: Proteins exist as ensembles of states; binding often involves transitions from “apo” (unbound) to “holo” (bound) conformational changes, sometimes revealing cryptic pockets.
Computational Bottleneck: Molecular Dynamics (MD) simulates these changes but incurs high computational costs due to energy barriers.
Gap: Existing generative models for SBDD mostly condition on a fixed pocket structure, ignoring the co-adaptation of the protein and ligand.

What is the novelty here?

The core novelty is the simultaneous modeling of ligand generation and protein conformational dynamics using a unified flow matching framework.

DynamicFlow Architecture: A multiscale model that treats the protein as both full-atom (for interaction) and residue-level frames (for large-scale dynamics), utilizing separate flow matching objectives for backbone frames, side-chain torsions, and ligand atoms.
Stochastic Flow (SDE): Introduction of a stochastic variant (DynamicFlow-SDE) that improves robustness and diversity compared to the deterministic ODE flow.
Coupled Generation: The model learns to transport the apo pocket distribution to the holo pocket distribution while simultaneously denoising the ligand, advancing beyond rigid pocket docking methods.

What experiments were performed?

The authors validated the method on a curated dataset of 5,692 protein-ligand complexes.

Baselines: Compared against rigid-pocket SBDD methods: Pocket2Mol, TargetDiff, and IPDiff (adapted as TargetDiff* and IPDiff* for fair comparison of atom numbers). Also compared against conformation sampling baselines (Str2Str).
Metrics:
- Ligand Quality: Vina Score (binding affinity), QED (drug-likeness), SA (synthesizability), Lipinski’s rule of 5.
- Pocket Quality: RMSD between generated and ground-truth holo pockets, Cover Ratio (percentage of holo states successfully retrieved), and Pocket Volume distributions.
- Interaction: Protein-Ligand Interaction Profiler (PLIP) to measure specific non-covalent interactions.
Ablations: Tested the impact of the interaction loss, residue-level Transformer, and SDE vs. ODE formulations.

What outcomes/conclusions?

Improved Affinity: DynamicFlow-SDE achieved the best (lowest) Vina scores ($-7.65$) compared to baselines like TargetDiff ($-5.09$) and Pocket2Mol ($-5.50$). Note that Vina scores are a computational proxy and do not directly predict experimental binding affinity. Moreover, Vina score optimization is gameable: molecules can achieve strong computed binding energies while remaining synthetically inaccessible. QED and SA scores, which assess drug-likeness and synthesizability respectively, were reported but were not primary optimization targets in the paper, which limits the strength of this affinity claim.
Realistic Dynamics: The model successfully generated holo-like pocket conformations with volume distributions and interaction profiles closer to ground-truth MD simulations than the initial apo structures.
Enhancing Rigid Methods: Holo pockets generated by DynamicFlow served as better inputs for rigid-SBDD baselines (e.g., TargetDiff improved from $-5.09$ to $-9.00$ and IPDiff improved from $-7.55$ to $-11.04$ when using “Our Pocket”), suggesting the method can act as a “pocket refiner”.
ODE vs. SDE Trade-off: The deterministic ODE variant achieves better pocket RMSD, while the stochastic SDE variant achieves better Cover Ratio (diversity of holo states captured) and binding affinity. Neither dominates uniformly.
Conformation Sampling Baseline: Str2Str, a dedicated conformation sampling baseline, performed worse than simply perturbing the apo structure with noise. One interpretation is that this highlights the difficulty of the apo-to-holo prediction task; another is that Str2Str was not designed specifically for apo-to-holo prediction, making it a limited test of its capabilities.

Reproducibility Details

Data

The dataset is derived from MISATO, which contains MD trajectories for PDBbind complexes.

Purpose	Dataset	Size	Notes
Training/Test	Curated MISATO	5,692 complexes	Filtered for valid MD (RMSD $< 3\text{\AA}$), clustered to remove redundancy. Contains 46,235 holo-ligand conformations total.
Apo Structures	AlphaFold2	N/A	Apo structures were obtained by mapping PDB IDs to UniProt and retrieving AlphaFold2 predictions, then aligning to MISATO structures.
Splits	Standard	50 test complexes	50 complexes with no overlap with the training set selected for testing. Note: 50 is a small held-out set; results should be interpreted cautiously.

Preprocessing:

Clustering: Holo-ligand conformations clustered with RMSD threshold $1.0\text{\AA}$; top 10 clusters kept per complex.
Pocket Definition: Residues within $7\text{\AA}$ of the ligand.
Alignment: AlphaFold predicted structures (apo) aligned to MISATO holo structures using sequence alignment (Smith-Waterman) to identify pocket residues.

Algorithms

Flow Matching Framework:

Continuous Variables (Pocket translation/rotation/torsions, Ligand positions): Modeled using Conditional Flow Matching (CFM).
- Prior: Apo state for pocket; Normal distribution for ligand positions.
- Target: Holo state from MD; Ground truth ligand.
- Interpolant: Linear interpolation for Euclidean variables; Geodesic for rotations ($SO(3)$, the rotation-only subgroup of SE(3) containing all 3D rotations but not translations); Wrapped linear interpolation for torsions (Torus).
Discrete Variables (Ligand atom/bond types): Modeled using Discrete Flow Matching based on Continuous-Time Markov Chains (CTMC).
- Rate Matrix: Interpolates between mask token and data distribution.
Loss Function: Weighted sum of 7 losses:
1. Translation CFM (Eq 5)
2. Rotation CFM (Eq 7)
3. Torsion CFM (Eq 11)
4. Ligand Position CFM
5. Ligand Atom Type CTMC (Eq 14)
6. Ligand Bond Type CTMC
7. Interaction Loss (Eq 18): Explicitly penalizes deviations in pairwise distances between protein and ligand atoms for pairs $\leq 3.5\text{\AA}$.

Models

Architecture: DynamicFlow is a multiscale model with 15.9M parameters.

Atom-Level SE(3)-Equivariant GNN:
- Input: Complex graph (k-NN) and Ligand graph (fully connected).
- Layers: 6 EGNN blocks modified to maintain node and edge hidden states.
- Function: Updates ligand positions and predicts ligand atom/bond types.
Residue-Level Transformer:
- Input: Aggregated atom features from the GNN + Residue frames/torsions.
- Layers: 4 Transformer blocks with Invariant Point Attention (IPA).
- Function: Updates protein residue frames (translation/rotation) and predicts side-chain torsions.

Evaluation

Metrics:

Vina Score: vina_minimize mode used for binding affinity.
RMSD: Minimum RMSD between generated pocket and ground-truth holo conformations.
Cover Ratio: % of ground-truth holo conformations covered by at least one generated sample (threshold $1.42\text{\AA}$).
POVME 3: For pocket volume calculation.

Hardware

Inference Benchmark: 1x Tesla V100-SXM2-32GB.
Speed: Generates 10 ligands in ~35-36 seconds (100 NFE), significantly faster than diffusion baselines like Pocket2Mol (980s) or TargetDiff (156s).

Paper Information

Citation: Zhou, X., Xiao, Y., Lin, H., He, X., Guan, J., Wang, Y., Liu, Q., Zhou, F., Wang, L., & Ma, J. (2025). Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows. International Conference on Learning Representations (ICLR). https://arxiv.org/abs/2503.03989

Publication: ICLR 2025

@inproceedings{zhouIntegratingProteinDynamics2025,
  title = {Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows},
  author = {Zhou, Xiangxin and Xiao, Yi and Lin, Haowei and He, Xinheng and Guan, Jiaqi and Wang, Yang and Liu, Qiang and Zhou, Feng and Wang, Liang and Ma, Jianzhu},
  booktitle = {International Conference on Learning Representations},
  year = {2025},
  url = {https://arxiv.org/abs/2503.03989}
}

Additional Resources:

arXiv Page
Code: no public repository available at time of writing

How to Fold Graciously: Levinthal's Paradox (1969)

Mon, 08 Sep 2025 00:00:00 +0000

What kind of paper is this?

This is technically a transcription of a conference talk, not a paper Levinthal wrote himself. The proceedings page credits “Notes by: A. Rawitch, Retranscribed: B. Krantz”, meaning what we have is a third-party record of an oral presentation Levinthal gave at the 1969 Mössbauer Spectroscopy in Biological Systems meeting at Allerton House, Illinois. This explains the informal, conversational register and the attached Q&A discussion.

In terms of contribution type, it functions as a Position paper (with Theory and Discovery elements):

Position: Defines a “Grand Challenge” and argues for a conceptual shift in how we view biomolecular assembly
Theory: Uses formal combinatorial arguments to establish the bounds of the search space ($10^{300}$ configurations)
Discovery: Uses experimental data on alkaline phosphatase to validate the kinetic hypothesis

What is the motivation?

The Central Question: How does a protein choose one unique structure out of a hyper-astronomical number of possibilities in a biological timeframe (seconds)?

Levinthal provides a “back-of-the-envelope” derivation to define the problem scope:

Degrees of Freedom: A generic, unrestricted protein with 2,000 atoms would possess ~6,000 degrees of freedom. However, physical constraints (specifically the planar peptide bond) reduce this significantly. For a 150-amino acid protein, these constraints lower the complexity to ~450 degrees of freedom (300 rotations, 150 bond angles).
The Combinatorial Explosion: Even with conservative estimates, this results in $10^{300}$ possible conformations.
The Time Constraint: Since proteins fold in seconds, Levinthal argues they can sample at most $10^8$ conformations (“postulating a minimum time from one conformation to another”) before stabilizing. Against $10^{300}$ possibilities, this search effectively covers 0% of the space, proving the impossibility of random search.

The Insight: The existence of folded proteins proves the impossibility of random global search. The system must be guided.

What is the novelty here?

Core Contribution: Levinthal reframes folding from a thermodynamic problem (seeking the absolute global minimum) to a Kinetic Control problem. He argues the native state is a “metastable” energy well found quickly by a specific pathway, which can differ from the system’s lowest possible energy state.

The Pathway Dependence Hypothesis

The key insights of kinetic control:

Nucleation: The process is “speeded and guided by the rapid formation of local interactions”
Pathway Constraints: Local amino acid sequences form stable interactions and serve as nucleation points in the folding process, restricting the conformational search space
The “Metastable” State: The final structure represents a “metastable state” in a sufficiently deep energy well that is kinetically accessible via the folding pathway, independent of the global energy minimum. Think of a ball that rolls into a valley on the side of a hill and stays there: it is not in the lowest valley on the entire landscape, but it is stable enough that it never escapes.

The Energy Landscape Funnel: The modern resolution to Levinthal’s Paradox. While Levinthal envisioned a single guided pathway, the ‘funnel’ model (Wolynes, Dill) shows that many different pathways can lead to the same native state basin. The roughness of the funnel surface represents local energy minima (kinetic traps) that can slow folding.

What experiments were performed?

To support the pathway hypothesis, Levinthal cites work on Alkaline Phosphatase (MW ~40,000), utilizing its property as a dimer of two identical subunits:

Renaturation Window: The wild-type enzyme refolds optimally at 37°C. However, mutants were isolated that only produce active enzyme (and renature) at temperatures below 37°C.
Stability vs. Formation: Crucially, once folded, both the wild-type and mutant enzymes are stable up to 90°C.
The Rate-Limiting Step: Levinthal notes that the rate-limiting step for activity is the formation of the dimer from monomers. This proves that the order of assembly (kinetic pathway) dictates the final structure, distinct from the final structure’s thermodynamic stability.

The talk concluded with a short motion picture Levinthal showed live, illustrating polypeptide synthesis and “the process of then forming a desired interaction via the most favored energy path as displayed on the computer controlled oscilloscope.”

The Q&A discussion following the talk includes one exchange directly relevant to the folding argument: when asked whether a protein is ever truly unfolded (devoid of all secondary and tertiary structure), Levinthal answered that both physical measurements and synthetic polypeptide work suggest yes. The other exchanges concerned the tangent formula for x-ray crystallographic phase refinement and whether computed structures had been tested for thermal perturbations.

What outcomes/conclusions?

Key Finding

The mutant experiments serve as the “smoking gun”: a protein seeking a global thermodynamic minimum would fold spontaneously at any temperature where the final state is stable (up to 90°C). The fact that mutants require specific lower temperatures for formation (while remaining stable at high temperatures once formed) proves that the kinetic pathway determines the outcome alongside the thermodynamic endpoint.

Broader Implications

Levinthal explicitly asks: “Is a unique folding necessary for any random 150-amino acid sequence?” and answers “Probably not.” He supports this by noting the difficulty many researchers face in attempting to crystallize proteins, suggesting that not all sequences produce stably folded structures.

He concludes by connecting these computational models to Mössbauer spectroscopy, suggesting that these computational studies may help in understanding how small perturbations of polypeptide structures affect the Mössbauer nucleus (a reminder of the specific conference context where this perspective was delivered).

Connection to Modern Work

Levinthal’s arguments remain relevant context for modern computational protein folding:

Early computational visualization: Levinthal used computer-controlled oscilloscopes and vector matrix multiplications to build and display 3D polypeptide structures, and showed a motion picture of forming a desired interaction via the most favored energy path. This was an early instance of computational molecular visualization.
Local interactions and folding pathways: The hypothesis that “local interactions” serve as nucleation points that guide folding remains central to how modern structure prediction methods (e.g., AlphaFold) model residue-residue interactions.
The paradox’s lasting influence: The impossibility of random conformational search that Levinthal articulated continues to motivate approaches that exploit the structure of the energy landscape rather than exhaustive enumeration.
Sequence-structure relationship: Levinthal’s suggestion that not every random amino acid sequence would fold uniquely foreshadows the modern challenge of inverse folding (protein design), where the goal is to find sequences within the subset that does fold to a target structure.

Paper Information

Citation: Levinthal, C. (1969). How to Fold Graciously. In Mössbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois (pp. 22-24). University of Illinois Press.

Publication: Mössbauer Spectroscopy in Biological Systems Proceedings, 1969

@inproceedings{levinthal1969fold,
  title={How to fold graciously},
  author={Levinthal, Cyrus},
  booktitle={M{\"o}ssbauer spectroscopy in biological systems},
  pages={22--24},
  year={1969},
  publisher={University of Illinois Press},
  url={https://faculty.cc.gatech.edu/~turk/bio_sim/articles/proteins_levinthal_1969.pdf}
}

Additional Resources:

Levinthal’s Paradox (Wikipedia)