Linear Triangulation

Triangulation reconstructs a 3D point from its projections in two or more calibrated views. It is the inverse of the projection operation: given pixel observations and camera poses, recover the world point.

Problem Statement

Given: $n \geq 2$ camera projection matrices ${P_{i}}$ (each $3 \times 4$ ) and corresponding pixel observations ${p_{i} = (u_{i}, v_{i})}$ .

Find: 3D point $X = [X, Y, Z]^{T}$ such that $p_{i} \sim P_{i} [X, 1]^{T}$ .

Assumptions:

Camera poses and intrinsics are known
Correspondences are correct
The point is visible from all views (not occluded)

Derivation (DLT)

For each view $i$ , the projection constraint $p_{i} \sim P_{i} X_{h}$ gives (by cross-multiplication):

$u_{i} (p_{3}^{(i) T} X_{h}) - p_{1}^{(i) T} X_{h} = 0$ $v_{i} (p_{3}^{(i) T} X_{h}) - p_{2}^{(i) T} X_{h} = 0$

where $p_{k}^{(i) T}$ is the $k$ -th row of $P_{i}$ and $X_{h} = [X, Y, Z, 1]^{T}$ .

Stacking all views gives a $2 n \times 4$ system:

$A X_{h} = 0$

Solve via SVD: $X_{h}$ is the right singular vector corresponding to the smallest singular value. Dehomogenize: $X = [X_{h} / w, Y_{h} / w, Z_{h} / w]^{T}$ .

Limitations

No uncertainty modeling: The DLT minimizes algebraic error, not geometric (reprojection) error
Baseline sensitivity: For small baselines (nearly parallel views), the triangulation is poorly conditioned — small pixel errors lead to large depth errors
Outlier sensitivity: A single incorrect correspondence corrupts the result; no robustness mechanism

For high-accuracy 3D reconstruction, triangulation should be followed by bundle adjustment that jointly optimizes points and cameras to minimize reprojection error.

API

#![allow(unused)]
fn main() {
let X = triangulate_point_linear(&projection_matrices, &pixel_points)?;
}

Where each projection matrix $P_{i} = K_{i} [R_{i} ∣ t_{i}]$ is precomputed from the camera intrinsics and pose.

vision-calibration Book

Linear Triangulation

Problem Statement

Derivation (DLT)

Limitations

API