I have three images of the same scene with a set of triple correspondences. I can calculate the trifocal tensor and recover pairwise fundamental matrices, rotation matrices and translation vectors (up to a scale between first and second translations). This scale can be easily calculated in a noiseless case (simulation).

However, as soon as even small noise is added (Gaussian, with sigma=0.1 pixel), all estimations go seriously wrong. All triples are guaranteed inliers, but even optimization techniques fail to find correct (known in simulation) scale.

Does anyone know where the catch might be?

Similar questions and discussions