版权说明 操作指南
首页 > 成果 > 详情

Research on Scattering Transform of Urban Sound Events Detection Based on Self-Attention Mechanism

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Song, Shen;Zhang, Cong;Wei, Zhihui
通讯作者:
Zhang, C.
作者机构:
[Song, Shen; Wei, Zhihui] Wuhan Polytech Univ, Sch Math & Comp, Wuhan 430048, Peoples R China.
[Zhang, Cong] Wuhan Polytech Univ, Sch Elect & Elect Engn, Wuhan 430048, Peoples R China.
通讯机构:
[Zhang, C.] W
Wuhan Polytechnic University, China
语种:
英文
关键词:
feature granularity consistency;focal loss;Preload information;scattering transform;self-attention mechanism
期刊:
IEEE ACCESS
ISSN:
2169-3536
年:
2022
卷:
10
页码:
120804-120822
基金类别:
National Natural Science Foundation of China [61272278]; Natural Science Foundation of Hubei Province [2015CFA061, 2020CFB761, 2018CFB408]; Hubei Provincial Major Science and Technology Special Projects [2018ABA099]; Hubei Provincial Department of Education Research [D 20201601]; National Natural Science Foundation of China [61272278]; Natural Science Foundation of Hubei Province [2015CFA061, 2020CFB761, 2018CFB408]; Hubei Provincial Major Science and Technology Special Projects [2018ABA099]; Hubei Provincial Department of Education Research [D 20201601]
机构署名:
本校为第一机构
院系归属:
电气与电子工程学院
数学与计算机学院
摘要:
Urban sound event detection can automatically preload relevant information for a robot to ensure that it can be applied to various scene-activity tasks. To address the limitations of timbre similarity and scene recognition by audio collection devices, a fusion model based on the self-attention mechanism is proposed in this paper. The model consists of scattering transform and self-attention model. The scattering transform computes modulation spectrum coefficients of multiple orders through cascades of wavelet convolutions and modulus operators. It is learnable compared with Mel-scale Frequency...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com