Class Probability-based Visual and Contextual Feature Integration for Image Parsing

Basim Azam, Ranju Mandal, Ligang Zhang, Brijesh Verma

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Deep learning networks have become one of the most promising architectures for image parsing tasks. Although existing deep networks consider global and local contextual information of the images to learn coarse features individually, they lack automatic adaptation to the contextual properties of scenes. In this work, we present a visual and contextual feature-based deep network for image parsing. The main novelty is in the 3-layer architecture which considers contextual information and each layer is independently trained and integrated. The network explores the contextual features along with the visual features for class label prediction with class-specific classifiers. The contextual features consider the prior information learned by calculating the co-occurrence of object labels both within a whole scene and between neighboring superpixels. The class-specific classifier deals with an imbalance of data for various object categories and learns the coarse features for every category individually. A series of weak classifiers in combination with boosting algorithms are investigated as classifiers along with the aggregated contextual features. The experiments were conducted on the benchmark Stanford background dataset which showed that the proposed architecture produced the highest average accuracy and comparable global accuracy.

Original languageEnglish
Title of host publication2020 35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020
PublisherIEEE Computer Society
ISBN (Electronic)9781728185798
DOIs
Publication statusPublished - 25 Nov 2020
Event35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020 - Wellington, New Zealand
Duration: 25 Nov 202027 Nov 2020

Publication series

NameInternational Conference Image and Vision Computing New Zealand
Volume2020-November
ISSN (Print)2151-2191
ISSN (Electronic)2151-2205

Conference

Conference35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020
Country/TerritoryNew Zealand
CityWellington
Period25/11/2027/11/20

Keywords

  • deep learning
  • Image parsing
  • object detection
  • scene understanding
  • semantic segmentation

Fingerprint

Dive into the research topics of 'Class Probability-based Visual and Contextual Feature Integration for Image Parsing'. Together they form a unique fingerprint.

Cite this