Abstract:To address the limitations of traditional loop closure detection in terms of localization accuracy and computational efficiency, as well as the high computational cost of Transformer-based methods, this paper proposes an efficient loop detection approach utilizing stereo vision and frequency-domain features. A lightweight feature extraction module is employed to extract key features from stereo images and construct global descriptors, enhancing the efficiency of candidate keyframes selection. The Fast Fourier Transform is introduced to build frequency-domain descriptors, providing a spectral representation of local features. Frequency-domain feature matching is then applied to refine the ranking of candidate keyframes. Meanwhile, a tailored low-level computational strategy accelerates the image matching process. Experimental results show that our method achieves the highest recall on the New College dataset, outperforms 80% of existing algorithms on the Oxford and KITTI datasets, and accelerates image matching by a factor of 1-2. It ensures high localization accuracy, reduces computational costs, and significantly improves the robustness and efficiency of loop detection.