Abstract
A wide variety of computer vision applications rely on superpixel or supervoxel algorithms as a preprocessing step. This underlines the overall importance that these approaches have gained in recent years. However, most methods show a lack of temporal consistency or fail in producing temporally stable superpixels. In this paper, we present an approach to generate temporally consistent superpixels for video content. Our method is formulated as a contour-evolving expectation-maximization framework, which utilizes an efficient label propagation scheme to encourage the preservation of superpixel shapes and their relative positioning over time. By explicitly detecting the occlusion of superpixels and the disocclusion of new image regions, our framework is able to terminate and create superpixels whose corresponding image region becomes hidden or newly appears. Additionally, the occluded parts of superpixels are incorporated in the further optimization. This increases the compliance of the superpixel flow with the optical flow present in the scene. Using established benchmark suites, we show that our approach produces highly competitive results in comparison to state-of-the-art streaming-capable supervoxel and superpixel algorithms for video content. This is further shown by comparing the streaming-capable approaches as basis for the task of interactive video segmentation where the proposed approach provides the lowest overall misclassification rate.
| Originalsprache | Englisch |
|---|---|
| Aufsatznummer | 8353729 |
| Seiten (von - bis) | 1441-1454 |
| Seitenumfang | 14 |
| Fachzeitschrift | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Jahrgang | 41 |
| Ausgabenummer | 6 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 1 Juni 2019 |
ASJC Scopus Sachgebiete
- Software
- Maschinelles Sehen und Mustererkennung
- Theoretische Informatik und Mathematik
- Artificial intelligence
- Angewandte Mathematik
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver