Abstract:To address issues such as detail loss and suboptimal dehazing performance in traditional dehazing networks, this paper proposes an image dehazing method based on a diffusion model and a shifted window strategy. First, a compressed encoder is employed to jointly extract features from hazy images and their corresponding haze-free images. The extracted features are then optimized using a diffusion models (DMs). Subsequently, the optimized features, along with the original hazy image, are fed into a dehazing network based on the U-Network (U-Net) structure. This network partitions the feature map into shifted windows and applies channel attention and spatial attention in parallel within each partition. This approach enables the model to more accurately capture correlations within the image and enhance focus on critical features. The classic encoder-decoder structure effectively restores image details and edge information. Experimental results demonstrate that the proposed method improves the PSNR by 0.47 to 2.62 dB across multiple public datasets.