高级检索+

深度图像压缩模型盲点: JPEG2000仍在高熵高频区域获胜

The blind spot of deep image compression: Why JPEG2000 still wins in high-entropy high-frequency regions

  • 摘要: 近年来,虽然基于深度学习的图像压缩方法在多个基准测试中超越传统算法。但应用于包含日常场景、遥感、无人机和含噪视觉数据等多源数据集时,在部分图像中传统的JPEG2000仍然优于深度学习模型,尤其是在具有高频能量和结构复杂的区域。本文系统地分析了8万多幅图像,通过统计验证表明,JPEG2000在等效码率下偶尔能够取得优异的峰值信噪比(PSNR),故提出了深度压缩模型的"高熵高频脆弱性"假说。基于率失真理论,量化了脆弱高频区域特征潜在变量的熵分布,为了克服这一局限性,开发了一个自适应频域分类器,用于动态压缩策略选择。实验证明,与深度压缩模型相比,选择器在日常场景、航拍和噪声数据集上分别实现了0.43%、1.19%和5.87%的相对PSNR提升。提升幅度(2.06 dB)达到人类视觉可察觉差异阈值(JND)的2.1~4.2倍。

     

    Abstract: In recent years, deep learning-based image compression methods have surpassed traditional algorithms in multiple benchmarking studies. However, when applied to multi-source datasets containing daily scenes, remote sensing imagery, UAV captures, and noisy visual data, traditional JPEG2000 still outperforms deep learning models, particularly in regions with high-frequency spectral energy or structurally complex features. This paper systematically analyzes over 80 000 images, with statistical verification demonstrating that JPEG2000 achieves superior peak signal-to-noise ratio (PSNR) at equivalent bitrates in some cases. This observation motivates our proposed High -Entropy High-Frequency Vulnerability Hypothesis for deep learning-based compression models. Based on rate-distortion theory, we quantitatively characterize the entropy distribution patterns of latent variables specifically within vulnerable high-frequency spectral bands. To mitigate this limitation, we develop an adaptive frequency-domain classifier for dynamic compression method selection. Experimental results demonstrate that our selector delivers relative PSNR improvements of 0. 43%, 1. 19%, and 5. 87% compared to end-to-end deep compression models on daily scenes, aerial imagery, and noisy image datasets, respectively. The improvement (2. 06 dB) is 2. 1-4. 2 times the just-noticeable difference (JND) of human visual system's threshold.

     

/

返回文章
返回