期刊文献+
共找到281篇文章
< 1 2 15 >
每页显示 20 50 100
Coal/Gangue Volume Estimation with Convolutional Neural Network and Separation Based on Predicted Volume and Weight
1
作者 Zenglun Guan Murad S.Alfarzaeai +2 位作者 Eryi Hu Taqiaden Alshmeri Wang Peng 《Computers, Materials & Continua》 SCIE EI 2024年第4期279-306,共28页
In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using new... In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using newtechnologies and applying different features for recognition.One such method exploits the difference in substancedensity,leading to excellent coal/gangue recognition.Therefore,this study uses density differences to distinguishcoal from gangue by performing volume prediction on the samples.Our training samples maintain a record of3-side images as input,volume,and weight as the ground truth for the classification.The prediction process relieson a Convolutional neural network(CGVP-CNN)model that receives an input of a 3-side image and then extractsthe needed features to estimate an approximation for the volume.The classification was comparatively performedvia ten different classifiers,namely,K-Nearest Neighbors(KNN),Linear Support Vector Machines(Linear SVM),Radial Basis Function(RBF)SVM,Gaussian Process,Decision Tree,Random Forest,Multi-Layer Perceptron(MLP),Adaptive Boosting(AdaBosst),Naive Bayes,and Quadratic Discriminant Analysis(QDA).After severalexperiments on testing and training data,results yield a classification accuracy of 100%,92%,95%,96%,100%,100%,100%,96%,81%,and 92%,respectively.The test reveals the best timing with KNN,which maintained anaccuracy level of 100%.Assessing themodel generalization capability to newdata is essential to ensure the efficiencyof the model,so by applying a cross-validation experiment,the model generalization was measured.The useddataset was isolated based on the volume values to ensure the model generalization not only on new images of thesame volume but with a volume outside the trained range.Then,the predicted volume values were passed to theclassifiers group,where classification reported accuracy was found to be(100%,100%,100%,98%,88%,87%,100%,87%,97%,100%),respectively.Although obtaining a classification with high accuracy is the main motive,this workhas a remarkable reduction in the data preprocessing time compared to related works.The CGVP-CNN modelmanaged to reduce the data preprocessing time of previous works to 0.017 s while maintaining high classificationaccuracy using the estimated volume value. 展开更多
关键词 COAL coal gangue convolutional neural network CNN object classification volume estimation separation system
在线阅读 下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification 被引量:1
2
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional neural network depthwise Dilated separable convolution Hierarchical Multi-Scale Feature Fusion
在线阅读 下载PDF
MSSTNet:Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
3
作者 Changchen ZHAO Hongsheng WANG Yuanjing FENG 《Virtual Reality & Intelligent Hardware》 2023年第2期124-141,共18页
Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale regi... Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale region of interest(ROI).However,some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space.Also,existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information,which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling.Methods Here,we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC)and dimension separable attention(DSAT).First,to solve the problem of a single-scale ROI,we constructed a multi-scale feature space for initial signal separation.Second,SSTC and DSAT were designed for efficient spatiotemporal correlation modeling,which increased the information interaction between the long-span time and space dimensions;this placed more emphasis on temporal features.Results The signal-to-noise ratio(SNR)of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset,outperforming state-of-the-art algorithms.Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction. 展开更多
关键词 Remote photoplethysmography Heart rate separable spatiotemporal convolution Dimension separable attention MULTI-SCALE neural network
在线阅读 下载PDF
Automatic modulation recognition of radiation source signals based on two-dimensional data matrix and improved residual neural network
4
作者 Guanghua Yi Xinhong Hao +3 位作者 Xiaopeng Yan Jian Dai Yangtian Liu Yanwen Han 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第3期364-373,共10页
Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the ... Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR. 展开更多
关键词 Automatic modulation recognition Radiation source signals Two-dimensional data matrix Residual neural network depthwise convolution
在线阅读 下载PDF
Probability-Based Channel Pruning for Depthwise Separable Convolutional Networks 被引量:1
5
作者 Han-Li Zhao Kai-Jie Shi +4 位作者 Xiao-Gang Jin Ming-Liang Xu Hui Huang Wang-Long Lu Ying Liu 《Journal of Computer Science & Technology》 SCIE EI CSCD 2022年第3期584-600,共17页
Channel pruning can reduce memory consumption and running time with least performance damage,and is one of the most important techniques in network compression.However,existing channel pruning methods mainly focus on ... Channel pruning can reduce memory consumption and running time with least performance damage,and is one of the most important techniques in network compression.However,existing channel pruning methods mainly focus on the pruning of standard convolutional networks,and they rely intensively on time-consuming fine-tuning to achieve the performance improvement.To this end,we present a novel efficient probability-based channel pruning method for depthwise separable convolutional networks.Our method leverages a new simple yet effective probability-based channel pruning criterion by taking the scaling and shifting factors of batch normalization layers into consideration.A novel shifting factor fusion technique is further developed to improve the performance of the pruned networks without requiring extra time-consuming fine-tuning.We apply the proposed method to five representative deep learning networks,namely MobileNetV1,MobileNetV2,ShuffleNetV1,ShuffleNetV2,and GhostNet,to demonstrate the efficiency of our pruning method.Extensive experimental results and comparisons on publicly available CIFAR10,CIFAR100,and ImageNet datasets validate the feasibility of the proposed method. 展开更多
关键词 network compression channel pruning depthwise separable convolution batch normalization
原文传递
Lightweight and highly robust memristor-based hybrid neural networks for electroencephalogram signal processing
6
作者 童霈文 徐晖 +5 位作者 孙毅 汪泳州 彭杰 廖岑 王伟 李清江 《Chinese Physics B》 SCIE EI CAS CSCD 2023年第7期582-590,共9页
Memristor-based neuromorphic computing shows great potential for high-speed and high-throughput signal processing applications,such as electroencephalogram(EEG)signal processing.Nonetheless,the size of one-transistor ... Memristor-based neuromorphic computing shows great potential for high-speed and high-throughput signal processing applications,such as electroencephalogram(EEG)signal processing.Nonetheless,the size of one-transistor one-resistor(1T1R)memristor arrays is limited by the non-ideality of the devices,which prevents the hardware implementation of large and complex networks.In this work,we propose the depthwise separable convolution and bidirectional gate recurrent unit(DSC-BiGRU)network,a lightweight and highly robust hybrid neural network based on 1T1R arrays that enables efficient processing of EEG signals in the temporal,frequency and spatial domains by hybridizing DSC and BiGRU blocks.The network size is reduced and the network robustness is improved while ensuring the network classification accuracy.In the simulation,the measured non-idealities of the 1T1R array are brought into the network through statistical analysis.Compared with traditional convolutional networks,the network parameters are reduced by 95%and the network classification accuracy is improved by 21%at a 95%array yield rate and 5%tolerable error.This work demonstrates that lightweight and highly robust networks based on memristor arrays hold great promise for applications that rely on low consumption and high efficiency. 展开更多
关键词 MEMRISTOR LIGHTWEIGHT ROBUST hybrid neural networks depthwise separable convolution bidirectional gate recurrent unit(BiGRU) one-transistor one-resistor(1T1R)arrays
在线阅读 下载PDF
A Framework of Lightweight Deep Cross-Connected Convolution Kernel Mapping Support Vector Machines
7
作者 Qi Wang Zhaoying Liu +3 位作者 Ting Zhang Shanshan Tu Yujian Li Muhammad Waqas 《Journal on Artificial Intelligence》 2022年第1期37-48,共12页
Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classifi... Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classification.However,the depth kernel mapping support vector machine does not take into account the connection of different dimensional spaces and increases the model parameters.To further improve the recognition capability of deep kernel mapping support vector machines while reducing the number of model parameters,this paper proposes a framework of Lightweight Deep Convolutional Cross-Connected Kernel Mapping Support Vector Machines(LC-CKMSVM).The framework consists of a feature extraction module and a classification module.The feature extraction module first maps the data from low-dimensional to high-dimensional space by fusing the representations of different dimensional spaces through cross-connections;then,it uses depthwise separable convolution to replace part of the original convolution to reduce the number of parameters in the module;The classification module uses a soft margin support vector machine for classification.The results on 6 different visual datasets show that LC-CKMSVM obtains better classification accuracies on most cases than the other five models. 展开更多
关键词 convolutional neural network cross-connected lightweight framework depthwise separable convolution
在线阅读 下载PDF
基于GADF和CWT并行输入模型的滚动轴承智能诊断研究
8
作者 张小丽 和飞翔 +2 位作者 梁旺 李敏 王保建 《湖南大学学报(自然科学版)》 北大核心 2025年第2期98-108,共11页
滚动轴承运行工况的变化与噪声干扰等随机不确定性因素会导致网络特征提取不完整,从而无法捕捉故障突变等局部奇异信息.针对上述问题,提出一种并行二维深度可分离残差神经网络(parallel two-dimensional depthwise separable residual n... 滚动轴承运行工况的变化与噪声干扰等随机不确定性因素会导致网络特征提取不完整,从而无法捕捉故障突变等局部奇异信息.针对上述问题,提出一种并行二维深度可分离残差神经网络(parallel two-dimensional depthwise separable residual neural network,P2DDSResNet)模型,通过格拉姆角分场(Gramian angular difference field,GADF)和连续小波变换(continuous wavelet transform,CWT)将振动信号转变为二维时频图像,保留了完整的时频域信息.采用深度可分离卷积替代残差模块中的普通卷积,增强特征学习能力,从而使模型具有更强的特征提取能力,以解决在高噪声和变工况环境中故障诊断效果不佳的问题.采用滚动轴承故障模拟试验台获取的数据对其进行试验分析并与其他卷积神经网络方法对比,结果表明,优化后的算法模型具有良好的泛化性和准确率. 展开更多
关键词 故障诊断 深度可分离卷积 滚动轴承 残差神经网络 特征提取
在线阅读 下载PDF
基于自注意力机制的高分遥感影像语义分割
9
作者 杨军 张金影 康玥 《哈尔滨工程大学学报》 北大核心 2025年第2期344-354,共11页
针对遥感影像多尺度特征提取困难、上下文信息利用不足的问题,本文结合自注意力机制和深度可分离卷积提出一种线性多头自注意力网络模型,适用于高分辨率遥感影像语义分割。在自注意力模块之前引入深度可分离卷积,减少计算量的同时有助... 针对遥感影像多尺度特征提取困难、上下文信息利用不足的问题,本文结合自注意力机制和深度可分离卷积提出一种线性多头自注意力网络模型,适用于高分辨率遥感影像语义分割。在自注意力模块之前引入深度可分离卷积,减少计算量的同时有助于捕获局部特征;在编码器分支中提出线性的多头自注意力模块以降低模型的计算复杂度;设计一个解码器来恢复特征图分辨率,通过级联操作整合各层级的特征并生成高分辨率的语义分割结果。所提算法在ISPRS Vaihingen和Potsdam数据集上的分割结果的mF1分别达到了90.77%和92.36%,与目前主流算法相比,不透水表面、建筑、低矮植物、树木类的分割准确率及总体分割准确率均有提高。本文算法构建的线性多头自注意力网络是一种高效的高分辨率遥感影像语义分割模型。 展开更多
关键词 高分辨率遥感影像 多头自注意力 深度可分离卷积 语义分割 特征提取 卷积神经网络 编码器 解码器
在线阅读 下载PDF
基于多尺度融合神经网络的同频同调制单通道盲源分离算法
10
作者 付卫红 张鑫钰 刘乃安 《系统工程与电子技术》 北大核心 2025年第2期641-649,共9页
针对单通道条件下同频同调制混合信号分离时存在的计算复杂度高、分离效果差等问题,提出一种基于时域卷积的多尺度融合递归卷积神经网络(recursive convolutional neural network, RCNN),采用编码、分离、解码结构实现单通道盲源分离。... 针对单通道条件下同频同调制混合信号分离时存在的计算复杂度高、分离效果差等问题,提出一种基于时域卷积的多尺度融合递归卷积神经网络(recursive convolutional neural network, RCNN),采用编码、分离、解码结构实现单通道盲源分离。首先,编码模块提取出混合通信信号的编码特征;然后,分离模块采用不同尺度大小的卷积块以进一步提取信号的特征信息,再利用1×1卷积块捕获信号的局部和全局信息,估计出每个源信号的掩码;最后,解码模块利用掩码与混合信号的编码特征恢复源信号波形。仿真结果表明,所提多尺度融合RCNN不仅可以分离出仅有少量参数区别的混合通信信号,而且相较于U型网络(U-Net)降低了约62%的参数量和41%的计算量,同时网络也具有较强的泛化能力,可以高效面对复杂通信环境的挑战。 展开更多
关键词 单通道盲源分离 深度学习 同频同调制信号分离 多尺度融合递归卷积神经网络 通信信号处理
在线阅读 下载PDF
面向通信设备信号异常识别的深度学习算法
11
作者 王锦毅 茆政吉 《计算机仿真》 2025年第1期215-218,228,共5页
通信设备信号可能受到多种干扰,例如电磁干扰、电源噪声等,会对信号进行扭曲和干扰,影响异常识别的准确性。现提出面向通信设备信号异常识别的深度学习算法。采用基于相似性矩阵的信号盲源分离方法将通信设备原始信号中的有用信号从背... 通信设备信号可能受到多种干扰,例如电磁干扰、电源噪声等,会对信号进行扭曲和干扰,影响异常识别的准确性。现提出面向通信设备信号异常识别的深度学习算法。采用基于相似性矩阵的信号盲源分离方法将通信设备原始信号中的有用信号从背景噪声中分离出来,完成信号的去噪处理;通过自适应噪声补偿聚合经验模态分解算法分解通信设备信号,结合综合评价指标选取有效IMF分量作为信号特征;将信号特征输入卷积神经网络中,通过深度学习信号特征实现通信设备信号异常识别。通过测试发现,所提算法可在噪声背景下有效分离出有用信号,识别精度高、识别效率高。 展开更多
关键词 通信设备信号 信号盲源分离 经验模态分解 卷积神经网络 深度学习
在线阅读 下载PDF
基于2D-3D卷积神经网络的情绪识别模型
12
作者 杨朋辉 杨长青 +1 位作者 刘静 崔冬 《燕山大学学报》 北大核心 2025年第1期66-73,共8页
基于脑电信号的情绪识别是人机交互的重要部分,本文将二维卷积神经网络、三维卷积神经网络、深度可分离卷积进行结合,提出一种基于2D-3D卷积神经网络(2-3DCNN)模型,从时间、空间、频率三个方面进行特征提取。在网络中引入SE-ResNet网络... 基于脑电信号的情绪识别是人机交互的重要部分,本文将二维卷积神经网络、三维卷积神经网络、深度可分离卷积进行结合,提出一种基于2D-3D卷积神经网络(2-3DCNN)模型,从时间、空间、频率三个方面进行特征提取。在网络中引入SE-ResNet网络、深度残差收缩网络和Xception网络,挖掘脑电信号中更能显著反映情感变化的空间、时间和频率信息。本文在DEAP公共情感数据集上做性能测试,结果表明,2-3DCNN在唤醒度和效价的两个分类任务上的识别准确率分别达到了97.59%和97.21%,比目前最先进的模型分别高出2.36%和1.34%。 展开更多
关键词 情绪识别 脑电信号 卷积神经网络 深度残差收缩网络 深度可分离卷积
在线阅读 下载PDF
基于深度学习的时空特征融合网络入侵检测模型研究
13
作者 李聪聪 袁子龙 滕桂法 《信息安全研究》 北大核心 2025年第2期122-129,共8页
随着网络攻击日益增多,网络入侵检测系统在维护网络安全方面也越来越重要.目前多数研究采用深度学习的方法进行网络入侵检测,但未充分从多个角度利用流量的特征,同时存在实验数据集过于陈旧的问题.提出了一种并行结构的DSC-Inception-Bi... 随着网络攻击日益增多,网络入侵检测系统在维护网络安全方面也越来越重要.目前多数研究采用深度学习的方法进行网络入侵检测,但未充分从多个角度利用流量的特征,同时存在实验数据集过于陈旧的问题.提出了一种并行结构的DSC-Inception-BiLSTM网络,使用最新的数据集评估所设计的网络模型.该模型包括网络流量图像和文本异常流量检测2个分支,分别通过改进的卷积神经网络和循环神经网络提取流量的空间特征和时序特征.最后通过融合时空特征实现网络入侵检测.实验结果表明,在CIC-IDS2017,CSE-CIC-IDS2018,CIC-DDoS2019这3个数据集上,该模型分别达到了99.96%,99.19%,99.95%的准确率,能够对异常流量进行高精度分类,满足入侵检测系统的要求. 展开更多
关键词 网络入侵检测 深度学习 特征融合 深度可分离卷积 INCEPTION
在线阅读 下载PDF
基于三维深度分离网络的PET双示踪剂混合图像分离方法
14
作者 唐大洋 胡德斌 +8 位作者 齐宏亮 孙浩 韩彦江 李翰威 张新明 潘智林 喻文杰 路利军 陈宏文 《中国医学物理学杂志》 2025年第2期160-166,共7页
目的:提出一种基于三维深度分离网络方法用于^(18)F-FDG和^(18)F-FAPIPET双示踪剂混合图像分离成像。方法:收集120例同一患者在不同时间单独扫描的^(18)F-FDG和^(18)F-FAPIPET图像,本研究采用模拟的形式生成PET双示踪剂混合图像,首先对... 目的:提出一种基于三维深度分离网络方法用于^(18)F-FDG和^(18)F-FAPIPET双示踪剂混合图像分离成像。方法:收集120例同一患者在不同时间单独扫描的^(18)F-FDG和^(18)F-FAPIPET图像,本研究采用模拟的形式生成PET双示踪剂混合图像,首先对同一患者两种PET示踪剂图像进行配准保证空间位置匹配,然后对配准的PET图像进行前向投影生成弦图数据,将两种弦图数据累加得到混合弦图数据,随后采用最大似然期望法重建得到PET双示踪剂混合图像,输入到基于3DDSN架构的网络进行分离成像,从而得到两种单示踪剂的PET图像。结果:本文提出的方法相较于3DCNN方法,分离得到的^(18)F-FDG图像与真实^(18)F-FDG图像的结构相似性指数(SSIM)提升0.87%,峰值信噪比(PSNR)提升11.8%,归一化均方根误差(NRMSE)减小52%。分离得到的^(18)F-FAPI图像与真实^(18)F-FAPI图像的SSIM提升1.1%,PSNR提升17.0%,NRMSE减小51%。结论:本文方法可以很好地应用在PET双示踪剂同时成像上,减少患者的扫描次数、时间和金钱成本,为临床医生提供更精准和更丰富的诊断信息。 展开更多
关键词 正电子发射断层成像 双示踪剂成像 图像配准 深度分离网络 深度学习
在线阅读 下载PDF
不平衡数据下面向包粒度应用层负载的轻量化入侵检测模型
15
作者 杨毅铭 陈世平 《小型微型计算机系统》 北大核心 2025年第2期465-473,共9页
网络入侵检测是一种重要的网络安全方案.目前网络入侵检测模型都有较高精确度,但是模型复杂,参数量和计算量较大.针对该问题,设计了一种新的基于包粒度应用层负载的网络入侵检测一维卷积轻量模型.本文首先对UNSWNB15数据集的原始流量文... 网络入侵检测是一种重要的网络安全方案.目前网络入侵检测模型都有较高精确度,但是模型复杂,参数量和计算量较大.针对该问题,设计了一种新的基于包粒度应用层负载的网络入侵检测一维卷积轻量模型.本文首先对UNSWNB15数据集的原始流量文件进行包粒度应用层负载数据提取,构造一维灰度特征向量.在此基础上,本文提出一种由新的一维深度可分离卷积残差模块组成,融入了全局上下文注意力机制(Global Context Attention Module)的一维卷积轻量模型Fast Payload,并进行了针对性的模型优化和可行性论证.Fast Payload模型在UNSWNB15数据集上的9分类任务中宏平均准确率达到82.433%,加权平均精确率达到90.820%,均高于对比模型;同时,该模型计算量和参数量均低于对比模型.其次本文提出了二阶段类别平衡损失函数GHM2StageLoss,有效解决了数据集的类别不平衡问题,相比其他类别平衡损失函数,效果更好.为方便后续研究的复现,本研究开源部分源代码,网址为https://github.com/sadantange/FastPayload. 展开更多
关键词 入侵检测 一维卷积神经网络 深度可分离卷积 全局上下文注意力机制 类别平衡
在线阅读 下载PDF
基于解耦注意力与幻影卷积的轻量级人体姿态估计
16
作者 陈俊颖 郭士杰 陈玲玲 《计算机应用》 北大核心 2025年第1期223-233,共11页
随着轻量级网络的发展,人体姿态估计任务得以在计算资源有限的设备上执行,然而,提升精度变得更具有挑战性。这些挑战主要源于网络复杂度与计算资源的矛盾,导致模型在简化时牺牲了表示能力。针对上述问题,提出一种基于解耦注意力和幻影... 随着轻量级网络的发展,人体姿态估计任务得以在计算资源有限的设备上执行,然而,提升精度变得更具有挑战性。这些挑战主要源于网络复杂度与计算资源的矛盾,导致模型在简化时牺牲了表示能力。针对上述问题,提出一种基于解耦注意力和幻影卷积的轻量级人体姿态估计网络(DGLNet)。具体来说,DGLNet以小型高分辨率网络(Small HRNet)模型为基础架构,通过引入解耦注意力机制构建DFDbottleneck模块;采用shuffleblock的结构对基础模块进行重新设计,即用轻量级幻影卷积替代计算量大的点卷积,并利用解耦注意力机制增强模块性能,从而构建DGBblock模块;此外,用幻影卷积和解耦注意力重新构建的深度可分离卷积模块来替代原过渡层模块,从而构建GSCtransition模块,进一步减少计算量并增强特征交互性和提高性能。在COCO验证集上的实验结果显示,DGLNet优于轻量级高分辨率网络(Lite-HRNet),在计算量和参数量不增加的情况下,最高精度达到了71.9%;与常见的轻量级姿态估计网络MobileNetV2和ShuffleNetV2相比,DGLNet在仅使用21.2%和25.0%的计算量情况下分别实现了4.6和8.3个百分点的精度提升;在AP^(50)的评价标准上,DGLNet超过了大型高分辨率网络(HRNet)的同时计算量和参数量远小于HRNet。 展开更多
关键词 人体姿态估计 轻量级网络 注意力机制 幻影卷积 深度可分离卷积模块
在线阅读 下载PDF
融合多尺度注意力神经网络的港口起重装备故障时序数据预测方法
17
作者 雷鹏 谢敬玲 +4 位作者 许洪祖 焦锋 魏立明 张忠岩 吕成兴 《机电工程》 北大核心 2025年第2期277-286,共10页
近年来,深度神经网络在轴承时序预测领域得到了广泛应用。为了进一步提升港口起重装备滚动轴承时序模型预测的准确度,以青岛港门机为例对港口起重装备关键部位的滚动轴承时序预测进行了建模,提出了一种融合改进变分模态分解的多尺度注... 近年来,深度神经网络在轴承时序预测领域得到了广泛应用。为了进一步提升港口起重装备滚动轴承时序模型预测的准确度,以青岛港门机为例对港口起重装备关键部位的滚动轴承时序预测进行了建模,提出了一种融合改进变分模态分解的多尺度注意力机制港口装备故障时序数据预测方法。首先,采用了融合非线性策略与混沌映射的改进灰狼优化算法(IGWO),自适应地确定了变分模态分解(VMD)的模态数与惩罚因子;然后,将变分模态分解得到的本征模态函数进一步作为融合多尺度注意力神经网络(FMANN)模型的时序输入,进行了多尺度通道特征融合;最后,对各个本征模态函数的预测结果进行了融合,得到了最终预测结果。研究结果表明:FMANN模型在回转机构数据集上的均方根误差(RMSE)为0.001 12,平均绝对百分比误差(MAPE)为6.396 3%,决定系数为0.999 8;相比于其他预测模型,FMANN预测效果更加拟合实际数据。FMANN模型能够准确地预测设备轴承的时序振动,有望为未来实际工业生产提供一条新思路。 展开更多
关键词 滚动轴承 故障诊断 变分模态分解 注意力机制 灰狼优化算法 融合多尺度注意力神经网络 深度可分离卷积
在线阅读 下载PDF
Microphone Array Speech Separation Algorithm Based on TC-ResNet
18
作者 Lin Zhou Yue Xu +2 位作者 Tianyi Wang Kun Feng Jingang Shi 《Computers, Materials & Continua》 SCIE EI 2021年第11期2705-2716,共12页
Traditional separation methods have limited ability to handle the speech separation problem in high reverberant and low signal-to-noise ratio(SNR)environments,and thus achieve unsatisfactory results.In this study,a co... Traditional separation methods have limited ability to handle the speech separation problem in high reverberant and low signal-to-noise ratio(SNR)environments,and thus achieve unsatisfactory results.In this study,a convolutional neural network with temporal convolution and residual network(TC-ResNet)is proposed to realize speech separation in a complex acoustic environment.A simplified steered-response power phase transform,denoted as GSRP-PHAT,is employed to reduce the computational cost.The extracted features are reshaped to a special tensor as the system inputs and implements temporal convolution,which not only enlarges the receptive field of the convolution layer but also significantly reduces the network computational cost.Residual blocks are used to combine multiresolution features and accelerate the training procedure.A modified ideal ratio mask is applied as the training target.Simulation results demonstrate that the proposed microphone array speech separation algorithm based on TC-ResNet achieves a better performance in terms of distortion ratio,source-to-interference ratio,and short-time objective intelligibility in low SNR and high reverberant environments,particularly in untrained situations.This indicates that the proposed method has generalization to untrained conditions. 展开更多
关键词 Residual networks temporal convolution neural networks speech separation
在线阅读 下载PDF
煤矿工业物联网设备识别模型 被引量:1
19
作者 郝秦霞 李慧敏 《工矿自动化》 CSCD 北大核心 2024年第3期99-107,共9页
煤矿工业物联网(IIoT)设备计算与存储资源受限,易遭受非法网络入侵,造成敏感数据泄露或恶意篡改,威胁煤矿生产安全。精准识别煤矿IIoT设备可实现有效管理并维护设备正常运转,提高设备安全防护能力,然而现有设备识别算法存在特征构造复... 煤矿工业物联网(IIoT)设备计算与存储资源受限,易遭受非法网络入侵,造成敏感数据泄露或恶意篡改,威胁煤矿生产安全。精准识别煤矿IIoT设备可实现有效管理并维护设备正常运转,提高设备安全防护能力,然而现有设备识别算法存在特征构造复杂、内存与计算需求较高导致难以部署在资源受限的煤矿IIoT设备中等问题。针对上述问题,提出了一种煤矿IIoT设备识别模型。首先,对支持TCP/IP协议传输的流量数据进行流量切分、无关字段去除、去重、定长字段截取操作后转换为IDX格式存储;其次,使用卷积块注意力模块(CBAM)优化深度可分离卷积(DSC),从而搭建轻量级DSC−CBAM模型来过滤Non−IIoT设备;然后,利用带有阶段惩罚的Wasserstein生成对抗网络(WGAN−GP)扩充流量较少的煤矿IIoT设备数据,达到平衡偏移流量数据的目的;最后,在DSC−CBAM基础上引入多尺度特征融合(MFF)技术捕获浅层全局特征信息,并增加Mish激活函数提高模型训练稳定性,建立优化混合模态识别(MDCM)模型,实现煤矿IIoT设备精准识别。实验结果表明,该模型收敛速度快,准确率、召回率、精确率与F1−score指标均高达99.98%,且参数量小,能精准、高效识别煤矿IIoT设备。 展开更多
关键词 煤矿工业物联网 设备识别 深度可分离卷积 注意力机制 生成对抗网络
在线阅读 下载PDF
基于可分离卷积与小波变换融合的道路裂缝检测
20
作者 刘云清 吴越 +2 位作者 张琼 颜飞 陈姗姗 《计算机科学》 CSCD 北大核心 2024年第S02期304-312,共9页
针对目前对细小裂缝检测能力不强、分割精度低等问题,提出了一种改进的U-Net模型来检测路面裂缝,提高检测能力和分割精度。中文设计了新的模块MSDWBlock(Multi-Scale Depthwise Separable Convolutional Block),应用在编码器和解码器部... 针对目前对细小裂缝检测能力不强、分割精度低等问题,提出了一种改进的U-Net模型来检测路面裂缝,提高检测能力和分割精度。中文设计了新的模块MSDWBlock(Multi-Scale Depthwise Separable Convolutional Block),应用在编码器和解码器部分,通过深度可分离卷积增强模型的能力,扩大模型感受野,在跳跃连接部分引入了C2G注意力机制模块,提升模型对裂缝特征的感知能力;并引入了ASPP(Atrous Spatial Pyramid Pooling)和DWT(Discrete Wavelet Transformation)。ASPP通过在多个尺度上进行操作,有助于捕捉到裂缝的特征,而DWT能够减少卷积池化过程中的裂缝空间信息损失,保留裂缝边缘信息。这种结构设计使得网络更专注于裂缝的特征,从而提升了裂缝检测的准确性。通过实验证明所提模型显示出优于U-Net,Segnet,U2net等先进模型的精确性。在CFD数据集上mIoU,F1分别达到78.51%,0.868。这些成果表明,所提方法能有效提升道路裂缝检测的性能。 展开更多
关键词 裂缝检测 U-Net神经网络 深度可分离卷积 注意力机制 空间金字塔 小波变换
在线阅读 下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部