Graph Moving Object Segmentation

Jhony H. Giraldo, Sajid Javed, Thierry Bouwmans

Research output: Contribution to journalArticlepeer-review

58 Scopus citations

Abstract

Moving Object Segmentation (MOS) is a fundamental task in computer vision. Due to undesirable variations in the background scene, MOS becomes very challenging for static and moving camera sequences. Several deep learning methods have been proposed for MOS with impressive performance. However, these methods show performance degradation in the presence of unseen videos; and usually, deep learning models require large amounts of data to avoid overfitting. Recently, graph learning has attracted significant attention in many computer vision applications since they provide tools to exploit the geometrical structure of data. In this work, concepts of graph signal processing are introduced for MOS. First, we propose a new algorithm that is composed of segmentation, background initialization, graph construction, unseen sampling, and a semi-supervised learning method inspired by the theory of recovery of graph signals. Second, theoretical developments are introduced, showing one bound for the sample complexity in semi-supervised learning, and two bounds for the condition number of the Sobolev norm. Our algorithm has the advantage of requiring less labeled data than deep learning methods while having competitive results on both static and moving camera videos. Our algorithm is also adapted for Video Object Segmentation (VOS) tasks and is evaluated on six publicly available datasets outperforming several state-of-the-art methods in challenging conditions.

Original languageBritish English
Pages (from-to)2485-2503
Number of pages19
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume44
Issue number5
DOIs
StatePublished - 1 May 2022

Keywords

  • graph signal processing
  • Moving object segmentation
  • semi-supervised learning
  • unseen videos
  • video object segmentation

Fingerprint

Dive into the research topics of 'Graph Moving Object Segmentation'. Together they form a unique fingerprint.

Cite this