国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

基于自適應注意力機制的輕量化語(yǔ)義分割網(wǎng)絡(luò )
DOI:
CSTR:
作者:
作者單位:

北京工商大學(xué) 計算機與人工智能學(xué)院

作者簡(jiǎn)介:

通訊作者:

中圖分類(lèi)號:

TP183

基金項目:

重慶自然科學(xué)基金(CSTB2022NSCO-MSX1415)


Lightweight Semantic Segmentation Network Based On Adaptive Attention Mechanism
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪(fǎng)問(wèn)統計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    針對語(yǔ)義SLAM(simultaneous localization and mapping)中語(yǔ)義分割速度較慢,實(shí)時(shí)性較低、占用資源過(guò)多等問(wèn)題,提出一種含有自適應通道注意力機制的輕量級Mask R-CNN網(wǎng)絡(luò ),由于原有的語(yǔ)義分割網(wǎng)絡(luò )里的殘差網(wǎng)絡(luò )復雜,且應用環(huán)境在室內,環(huán)境較為簡(jiǎn)單,故該輕量級網(wǎng)絡(luò )將原有復雜的主干網(wǎng)絡(luò )中的ResNet-50利用深度可分離卷積與分組卷積改進(jìn)為更加輕量的ResNet-DS-tiny(ResNet with depthwise separable convolutions),并加入自適應通道注意力機制。在自適應通道注意力模塊中,利用加權方式對輸入的RGB-D圖像從空間和通道賦予不同的權重,增強了特征的表達能力。此外,為了輕量化特征金字塔,使用使用不同空洞率的空洞卷積來(lái)提取不同大小感受野的特征信息,有效地獲取了多尺度的特征。相較于傳統的特征金字塔,空洞卷積減少了參數量。在更充分獲取 RGB 信息特征的同時(shí),提升了語(yǔ)義分割系統的實(shí)時(shí)性并減少了資源占用。

    Abstract:

    To address the issues of slow semantic segmentation speed, low real-time performance, and high resource consumption in semantic SLAM (simultaneous localization and mapping), a lightweight Mask R-CNN network with an adaptive channel attention mechanism is proposed. Given the complexity of the residual networks in existing semantic segmentation networks and the relatively simple indoor application environments, this lightweight network replaces the original complex backbone ResNet-50 with a more lightweight ResNet-DS-tiny (ResNet with depthwise separable convolutions) by incorporating depthwise separable convolutions and grouped convolutions. An adaptive channel attention mechanism is also introduced. In the adaptive channel attention module, a weighted approach is used to assign different weights to the input RGB-D images from both spatial and channel dimensions, thereby enhancing the feature representation capability. Additionally, to lighten the feature pyramid, dilated convolutions are employed to expand the receptive field, effectively aggregating multi-scale features with different dilation rates. Compared to traditional feature pyramids, the use of dilated convolutions reduces the number of parameters. This approach not only more effectively captures RGB information features but also improves the real-time performance of the semantic segmentation system while reducing resource consumption.

    參考文獻
    相似文獻
    引證文獻
引用本文

王艷莉,連曉峰,康毛毛.基于自適應注意力機制的輕量化語(yǔ)義分割網(wǎng)絡(luò )計算機測量與控制[J].,2024,32(12):223-228.

復制
分享
文章指標
  • 點(diǎn)擊次數:
  • 下載次數:
  • HTML閱讀次數:
  • 引用次數:
歷史
  • 收稿日期:2024-06-07
  • 最后修改日期:2024-07-19
  • 錄用日期:2024-07-19
  • 在線(xiàn)發(fā)布日期: 2024-12-24
  • 出版日期:
文章二維碼
湄潭县| 英吉沙县| 马公市| 调兵山市| 鄱阳县| 永济市| 鹿泉市| 六盘水市| 安图县| 盐津县| 东辽县| 岳阳县| 安福县| 嘉黎县| 遂昌县| 凤山县| 定日县| 南岸区| 渭南市| 繁峙县| 乌鲁木齐县| 武陟县| 新和县| 贵定县| 库尔勒市| 綦江县| 南乐县| 富阳市| 孝感市| 布尔津县| 龙南县| 河北区| 革吉县| 延长县| 霸州市| 蓬溪县| 洛宁县| 枣强县| 溧阳市| 慈溪市| 万山特区|