Abstract
Deep learning based compressive sensing (CS) methods typically learn sampling operators using convolutional or block wise fully connected layers, which limit receptive fields and scale poorly for high dimensional data. We propose MTSCSNet, a CS framework based on Multiscale Tensor Summation (MTS) factorization, a structured operator for efficient multidimensional signal processing. MTS performs mode-wise linear transformations with multiscale summation, enabling large receptive fields and effective modeling of cross-dimensional correlations. In MTSCSNet, MTS is first used as a learnable CS operator that performs linear dimensionality reduction in tensor space, with its adjoint defining the initial back-projection, and is then applied in the reconstruction stage to directly refine this estimate. This results in a simple feed-forward architecture without iterative or proximal optimization, while remaining parameter and computation efficient. Experiments on standard CS benchmarks show that MTSCSNet achieves state-of-the-art reconstruction performance on RGB images, with notable PSNR gains and faster inference, even compared to recent diffusion-based CS methods, while using a significantly more compact feed-forward architecture.
Abstract (translated)
基于深度学习的压缩感知(CS)方法通常使用卷积层或分块全连接层来学习采样算子,这些方法会限制感受野并导致在高维数据上的性能不佳。我们提出了MTSCSNet框架,该框架基于多尺度张量求和(MTS)分解,这是一种用于高效处理多维信号的结构化操作符。MTS通过模式级线性变换与多尺度求和实现大范围的感受野,并能有效建模跨维度的相关性。 在MTSCSNet中,首先使用MTS作为可学习的CS算子,在张量空间执行线性降维处理,其伴随算子定义了初始反投影。然后,在重建阶段应用MTS直接精炼该估计值。这一过程构建了一个简单的前馈架构,无需迭代或近似优化,并且保持参数和计算效率。 实验结果表明,基于标准CS基准测试的MTSCSNet在RGB图像重构性能上取得了当前最佳的结果,展示了显著的PSNR增益和更快的推理速度,即使与最近基于扩散的CS方法相比也表现出色。同时,它使用了一个更为紧凑的前馈架构。
URL
https://arxiv.org/abs/2602.07056