cvg/pixel-perfect-sfm
A structure-from-motion and visual localization framework that refines keypoints, camera poses, and 3D points using deep feature alignment.

This research project provides a Python package (pixsfm) that integrates with COLMAP and hloc for improving 3D reconstruction accuracy. It refines structure-from-motion results through two-step featuremetric optimization: first adjusting 2D keypoints jointly before SfM, then refining 3D points and camera poses via bundle adjustment. The refinement uses pre-trained CNN features extracted from images to minimize consistency across multiple views.