Attention-Based Multi-Kernelized and Boundary-Aware Network for image semantic segmentation

  • Xuanchen Zhou
  • , Gengshen Wu
  • , Xin Sun
  • , Pengpeng Hu
  • , Yi Liu

Research output: Contribution to journalArticlepeer-review

40 Downloads (Pure)

Abstract

Recent studies have witnessed the prosperity of image semantic segmentation brought by deep learning technology, including the integration of diverse network structures, attention mechanisms, and even boundary perception. Essentially, the main challenge of semantic segmentation lies in how to incorporate and align more information like contexts and details in the feature representation enabling the network to better distinguish and recognize objects in an image for improved semantic understanding. To this end, we propose a novel end-to-end deep framework termed Attention-Based Multi-Kernelized and Boundary-Aware Network to enhance the semantic segmentation performance. Specifically, the proposed network enables thoroughly exploring and engaging the multi-kernelized contextual information with high-frequency boundary information by applying novel modules of Multi-Kernelized Spatial Attention and Boundary-Aware Hybrid Attention, aiming to enhance the segmentation result with more comprehensive feature representation capability. Extensive experiments further consolidate the superiority of the proposed network with improved segmentation results with the mIoU of 51.1% and 82.9% on ADE20K and Cityscapes datasets, as well as classification results with Top-1 accuracy of 84.0% on ImageNet-1K dataset against competitive baselines.
Original languageEnglish
Article number127988
Number of pages12
JournalNeurocomputing
Volume597
Early online date3 Jun 2024
DOIs
Publication statusPublished - 7 Sept 2024

Bibliographical note

© 2024, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/

Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders.

This document is the author’s post-print version, incorporating any revisions agreed during the peer-review process. Some differences between the published version and this version may remain and you are advised to consult the published version if you wish to cite from it

Funder

This work was supported by the Science and Technology Development Fund, Macao SAR [grant number 0004/2023/ITP1].

Funding

This work was supported by the Science and Technology Development Fund, Macao SAR [grant number 0004/2023/ITP1].

FundersFunder number
Fundo para o Desenvolvimento das Ciências e da Tecnologia0004/2023/ITP1

    Keywords

    • Boundary perception
    • Convolutional neural network
    • Image semantic segmentation
    • Spatial attention

    ASJC Scopus subject areas

    • Artificial Intelligence
    • Cognitive Neuroscience
    • Computer Science Applications

    Fingerprint

    Dive into the research topics of 'Attention-Based Multi-Kernelized and Boundary-Aware Network for image semantic segmentation'. Together they form a unique fingerprint.

    Cite this