Paper Reading AI Learner

Two-layer Near-lossless HDR Coding with Backward Compatibility to JPEG

2019-05-09 09:12:41
Hiroyuki Kobayashi, Osamu Watanabe, Hitoshi Kiya

Abstract

We propose an efficient two-layer near-lossless coding method using an extended histogram packing technique with backward compatibility to the legacy JPEG standard. The JPEG XT, which is the international standard to compress HDR images, adopts a two-layer coding method for backward compatibility to the legacy JPEG standard. However, there are two problems with this two-layer coding method. One is that it does not exhibit better near-lossless performance than other methods for HDR image compression with single-layer structure. The other problem is that the determining the appropriate values of the coding parameters may be required for each input image to achieve good compression performance of near-lossless compression with the two-layer coding method of the JPEG XT. To solve these problems, we focus on a histogram-packing technique that takes into account the histogram sparseness of HDR images. We used zero-skip quantization, which is an extension of the histogram-packing technique proposed for lossless coding, for implementing the proposed near-lossless coding method. The experimental results indicate that the proposed method exhibits not only a better near-lossless compression performance than that of the two-layer coding method of the JPEG XT, but also there are no issue regarding the combination of parameter values without losing backward compatibility to the JPEG standard.

Abstract (translated)

我们提出了一种高效的两层近无损编码方法,该方法使用了一种扩展的柱状图压缩技术,与传统的jpeg标准具有向后兼容性。jpeg-xt是HDR图像压缩的国际标准,它采用了与传统jpeg标准向后兼容的两层编码方法。然而,这种双层编码方法存在两个问题。其一,对于单层结构的HDR图像压缩,其近无损性能不如其它方法好。另一个问题是,用jpeg-xt的两层编码方法,可能需要确定每个输入图像的编码参数的适当值,以获得接近无损压缩的良好压缩性能。为了解决这些问题,我们重点研究了一种考虑HDR图像直方图稀疏性的直方图填充技术。我们使用零跳量化(零跳量化)来实现所提出的近无损编码方法,这是对无损编码的直方图填充技术的扩展。实验结果表明,该方法不仅具有比jpeg-xt两层编码方法更好的近无损压缩性能,而且在不丢失对jpeg标准向后兼容性的情况下,参数值的组合也没有问题。

URL

https://arxiv.org/abs/1905.04129

PDF

https://arxiv.org/pdf/1905.04129.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot