Tampere University of Technology

TUTCRIS Research Portal

Cross-Granularity Attention Network for Semantic Segmentation

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review


Original languageEnglish
Title of host publication2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
ISBN (Electronic)9781728150239
Publication statusPublished - 2019
Publication typeA4 Article in a conference publication
EventIEEE/CVF International Conference on Computer Vision Workshop - Seoul, Korea, Republic of
Duration: 27 Oct 201928 Oct 2019

Publication series

NameIEEE International Conference on Computer Vision workshops
ISSN (Electronic)2473-9944


ConferenceIEEE/CVF International Conference on Computer Vision Workshop
CountryKorea, Republic of


Despite the remarkable progress of semantic segmentation in recent years, much remains to be addressed in order to achieve better semantic coherence and boundary delineation. In this paper, we propose a novel convolutional neural network (CNN) architecture for semantic segmentation which explicitly addresses these two issues. Specifically, we propose a categorical attention mechanism to propagate consistent category-oriented information across multi-granularity contextual interpretations to close the semantic gap residing in CNN feature hierarchy. This novel design alleviates the semantic information loss during the feature combination and transformation process in decoder network. We further integrate a contour branch in our architecture to enhance the boundary awareness of the semantic feature derived in the form of a novel element-wise contour attention at each level of feature hierarchy. Additionally, we introduce a cross-granularity contour enhancement mechanism to propagate rich boundary cues from early layers to deep layers. We perform extensive quantitative evaluations in close proximity to object boundaries which confirms its superior effectiveness in boundary delineation. These novel mechanisms which boost the essentials in segmentation, i.e., region-wise semantic coherence and accurate object contour localization, allow our architecture 'MeshNet' to obtain state-of-the-art performance on two challenging datasets, i.e., PASCAL VOC 2012 and Cityscapes.

Publication forum classification

Field of science, Statistics Finland