Skip to main navigation Skip to search Skip to main content

On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Yi Hsin Chen, Kuan Wei Ho, Martin Benjak, Jorn Ostermann, Wen Hsiao Peng

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abstract

This paper aims to delve into the rate-distortion-complexity trade-offs of modern neural video coding. Recent years have witnessed much research effort being focused on exploring the full potential of neural video coding. Conditional auto encoders have emerged as the mainstream approach to efficient neural video coding. The central theme of conditional auto encoders is to leverage both spatial and temporal information for better conditional coding. However, a recent study indicates that conditional coding may suffer from information bottlenecks, potentially performing worse than traditional residual coding. To address this issue, recent conditional coding methods incorporate a large number of high-resolution features as the condition signal, leading to a considerable increase in the number of multiply-accumulate operations, memory footprint, and model size. Taking DCVC as the common code base, we investigate how the newly proposed conditional residual coding, an emerging new school of thought, and its variants may strike a better balance among rate, distortion, and complexity.

Original languageEnglish
Title of host publication2024 IEEE 26th International Workshop on Multimedia Signal Processing
Subtitle of host publicationMMSP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9798350387254
ISBN (Print)979-8-3503-8726-1
DOIs
Publication statusPublished - 2 Oct 2024
Event26th IEEE International Workshop on Multimedia Signal Processing, MMSP 2024 - West Lafayette, United States
Duration: 2 Oct 20244 Oct 2024

Conference

Conference26th IEEE International Workshop on Multimedia Signal Processing, MMSP 2024
Country/TerritoryUnited States
CityWest Lafayette
Period2 Oct 20244 Oct 2024

Keywords

  • conditional coding
  • conditional residual coding
  • Learned video compression

ASJC Scopus subject areas

  • Signal Processing
  • Media Technology
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition

Cite this