Abstract
Deep learning networks have become one of the most promising architectures for image parsing tasks. Although existing deep networks consider global and local contextual information of the images to learn coarse features individually, they lack automatic adaptation to the contextual properties of scenes. In this work, we present a visual and contextual feature-based deep network for image parsing. The main novelty is in the 3-layer architecture which considers contextual information and each layer is independently trained and integrated. The network explores the contextual features along with the visual features for class label prediction with class-specific classifiers. The contextual features consider the prior information learned by calculating the co-occurrence of object labels both within a whole scene and between neighboring superpixels. The class-specific classifier deals with an imbalance of data for various object categories and learns the coarse features for every category individually. A series of weak classifiers in combination with boosting algorithms are investigated as classifiers along with the aggregated contextual features. The experiments were conducted on the benchmark Stanford background dataset which showed that the proposed architecture produced the highest average accuracy and comparable global accuracy.
| Original language | English |
|---|---|
| Title of host publication | 2020 35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020 |
| Publisher | IEEE Computer Society |
| ISBN (Electronic) | 9781728185798 |
| DOIs | |
| Publication status | Published - 25 Nov 2020 |
| Externally published | Yes |
| Event | 35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020 - Wellington, New Zealand Duration: 25 Nov 2020 → 27 Nov 2020 |
Publication series
| Name | International Conference Image and Vision Computing New Zealand |
|---|---|
| Volume | 2020-November |
| ISSN (Print) | 2151-2191 |
| ISSN (Electronic) | 2151-2205 |
Conference
| Conference | 35th International Conference on Image and Vision Computing New Zealand, IVCNZ 2020 |
|---|---|
| Country/Territory | New Zealand |
| City | Wellington |
| Period | 25/11/20 → 27/11/20 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
-
SDG 14 Life Below Water
Keywords
- deep learning
- Image parsing
- object detection
- scene understanding
- semantic segmentation
Fingerprint
Dive into the research topics of 'Class Probability-based Visual and Contextual Feature Integration for Image Parsing'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver