期刊文献+
共找到3,751篇文章
< 1 2 188 >
每页显示 20 50 100
Experiments on image data augmentation techniques for geological rock type classification with convolutional neural networks
1
作者 Afshin Tatar Manouchehr Haghighi Abbas Zeinijahromi 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第1期106-125,共20页
The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and hist... The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications. 展开更多
关键词 Deep learning(DL) Image analysis Image data augmentation convolutional neural networks(cnns) Geological image analysis Rock classification Rock thin section(RTS)images
在线阅读 下载PDF
Pluggable multitask diffractive neural networks based on cascaded metasurfaces 被引量:7
2
作者 Cong He Dan Zhao +8 位作者 Fei Fan Hongqiang Zhou Xin Li Yao Li Junjie Li Fei Dong Yin-Xiao Miao Yongtian Wang Lingling Huang 《Opto-Electronic Advances》 SCIE EI CAS CSCD 2024年第2期23-31,共9页
Optical neural networks have significant advantages in terms of power consumption,parallelism,and high computing speed,which has intrigued extensive attention in both academic and engineering communities.It has been c... Optical neural networks have significant advantages in terms of power consumption,parallelism,and high computing speed,which has intrigued extensive attention in both academic and engineering communities.It has been considered as one of the powerful tools in promoting the fields of imaging processing and object recognition.However,the existing optical system architecture cannot be reconstructed to the realization of multi-functional artificial intelligence systems simultaneously.To push the development of this issue,we propose the pluggable diffractive neural networks(P-DNN),a general paradigm resorting to the cascaded metasurfaces,which can be applied to recognize various tasks by switching internal plug-ins.As the proof-of-principle,the recognition functions of six types of handwritten digits and six types of fashions are numerical simulated and experimental demonstrated at near-infrared regimes.Encouragingly,the proposed paradigm not only improves the flexibility of the optical neural networks but paves the new route for achieving high-speed,low-power and versatile artificial intelligence systems. 展开更多
关键词 optical neural networks diffractive deep neural networks cascaded metasurfaces
在线阅读 下载PDF
Big Model Strategy for Bridge Structural Health Monitoring Based on Data-Driven, Adaptive Method and Convolutional Neural Network (CNN) Group
3
作者 Yadong Xu Weixing Hong +3 位作者 Mohammad Noori Wael A.Altabey Ahmed Silik Nabeel S.D.Farhan 《Structural Durability & Health Monitoring》 EI 2024年第6期763-783,共21页
This study introduces an innovative“Big Model”strategy to enhance Bridge Structural Health Monitoring(SHM)using a Convolutional Neural Network(CNN),time-frequency analysis,and fine element analysis.Leveraging ensemb... This study introduces an innovative“Big Model”strategy to enhance Bridge Structural Health Monitoring(SHM)using a Convolutional Neural Network(CNN),time-frequency analysis,and fine element analysis.Leveraging ensemble methods,collaborative learning,and distributed computing,the approach effectively manages the complexity and scale of large-scale bridge data.The CNN employs transfer learning,fine-tuning,and continuous monitoring to optimize models for adaptive and accurate structural health assessments,focusing on extracting meaningful features through time-frequency analysis.By integrating Finite Element Analysis,time-frequency analysis,and CNNs,the strategy provides a comprehensive understanding of bridge health.Utilizing diverse sensor data,sophisticated feature extraction,and advanced CNN architecture,the model is optimized through rigorous preprocessing and hyperparameter tuning.This approach significantly enhances the ability to make accurate predictions,monitor structural health,and support proactive maintenance practices,thereby ensuring the safety and longevity of critical infrastructure. 展开更多
关键词 Structural Health Monitoring(SHM) BRIDGES big model convolutional neural network(cnn) Finite Element Method(FEM)
在线阅读 下载PDF
Development of a convolutional neural network based geomechanical upscaling technique for heterogeneous geological reservoir 被引量:1
4
作者 Zhiwei Ma Xiaoyan Ou Bo Zhang 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2024年第6期2111-2125,共15页
Geomechanical assessment using coupled reservoir-geomechanical simulation is becoming increasingly important for analyzing the potential geomechanical risks in subsurface geological developments.However,a robust and e... Geomechanical assessment using coupled reservoir-geomechanical simulation is becoming increasingly important for analyzing the potential geomechanical risks in subsurface geological developments.However,a robust and efficient geomechanical upscaling technique for heterogeneous geological reservoirs is lacking to advance the applications of three-dimensional(3D)reservoir-scale geomechanical simulation considering detailed geological heterogeneities.Here,we develop convolutional neural network(CNN)proxies that reproduce the anisotropic nonlinear geomechanical response caused by lithological heterogeneity,and compute upscaled geomechanical properties from CNN proxies.The CNN proxies are trained using a large dataset of randomly generated spatially correlated sand-shale realizations as inputs and simulation results of their macroscopic geomechanical response as outputs.The trained CNN models can provide the upscaled shear strength(R^(2)>0.949),stress-strain behavior(R^(2)>0.925),and volumetric strain changes(R^(2)>0.958)that highly agree with the numerical simulation results while saving over two orders of magnitude of computational time.This is a major advantage in computing the upscaled geomechanical properties directly from geological realizations without the need to perform local numerical simulations to obtain the geomechanical response.The proposed CNN proxybased upscaling technique has the ability to(1)bridge the gap between the fine-scale geocellular models considering geological uncertainties and computationally efficient geomechanical models used to assess the geomechanical risks of large-scale subsurface development,and(2)improve the efficiency of numerical upscaling techniques that rely on local numerical simulations,leading to significantly increased computational time for uncertainty quantification using numerous geological realizations. 展开更多
关键词 Upscaling Lithological heterogeneity convolutional neural network(cnn) Anisotropic shear strength Nonlinear stressestrain behavior
在线阅读 下载PDF
Integrating Bayesian and Convolution Neural Network for Uncertainty Estimation of Cataract from Fundus Images
5
作者 Anandhavalli Muniasamy Ashwag Alasmari 《Computer Modeling in Engineering & Sciences》 2025年第4期569-592,共24页
The effective and timely diagnosis and treatment of ocular diseases are key to the rapid recovery of patients.Today,the mass disease that needs attention in this context is cataracts.Although deep learning has signifi... The effective and timely diagnosis and treatment of ocular diseases are key to the rapid recovery of patients.Today,the mass disease that needs attention in this context is cataracts.Although deep learning has significantly advanced the analysis of ocular disease images,there is a need for a probabilistic model to generate the distributions of potential outcomes and thusmake decisions related to uncertainty quantification.Therefore,this study implements a Bayesian Convolutional Neural Networks(BCNN)model for predicting cataracts by assigning probability values to the predictions.It prepares convolutional neural network(CNN)and BCNN models.The proposed BCNN model is CNN-based in which reparameterization is in the first and last layers of the CNN model.This study then trains them on a dataset of cataract images filtered from the ocular disease fundus images fromKaggle.The deep CNN model has an accuracy of 95%,while the BCNN model has an accuracy of 93.75% along with information on uncertainty estimation of cataracts and normal eye conditions.When compared with other methods,the proposed work reveals that it can be a promising solution for cataract prediction with uncertainty estimation. 展开更多
关键词 Bayesian neural networks(BNNs) convolution neural networks(cnn) Bayesian convolution neural networks(Bcnns) predictive modeling precision medicine uncertainty quantification
在线阅读 下载PDF
Downscaling Seasonal Precipitation Forecasts over East Africa with Deep Convolutional Neural Networks
6
作者 Temesgen Gebremariam ASFAW Jing-Jia LUO 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2024年第3期449-464,共16页
This study assesses the suitability of convolutional neural networks(CNNs) for downscaling precipitation over East Africa in the context of seasonal forecasting. To achieve this, we design a set of experiments that co... This study assesses the suitability of convolutional neural networks(CNNs) for downscaling precipitation over East Africa in the context of seasonal forecasting. To achieve this, we design a set of experiments that compare different CNN configurations and deployed the best-performing architecture to downscale one-month lead seasonal forecasts of June–July–August–September(JJAS) precipitation from the Nanjing University of Information Science and Technology Climate Forecast System version 1.0(NUIST-CFS1.0) for 1982–2020. We also perform hyper-parameter optimization and introduce predictors over a larger area to include information about the main large-scale circulations that drive precipitation over the East Africa region, which improves the downscaling results. Finally, we validate the raw model and downscaled forecasts in terms of both deterministic and probabilistic verification metrics, as well as their ability to reproduce the observed precipitation extreme and spell indicator indices. The results show that the CNN-based downscaling consistently improves the raw model forecasts, with lower bias and more accurate representations of the observed mean and extreme precipitation spatial patterns. Besides, CNN-based downscaling yields a much more accurate forecast of extreme and spell indicators and reduces the significant relative biases exhibited by the raw model predictions. Moreover, our results show that CNN-based downscaling yields better skill scores than the raw model forecasts over most portions of East Africa. The results demonstrate the potential usefulness of CNN in downscaling seasonal precipitation predictions over East Africa,particularly in providing improved forecast products which are essential for end users. 展开更多
关键词 East Africa seasonal precipitation forecasting DOWNSCALING deep learning convolutional neural networks(cnns)
在线阅读 下载PDF
Coal/Gangue Volume Estimation with Convolutional Neural Network and Separation Based on Predicted Volume and Weight
7
作者 Zenglun Guan Murad S.Alfarzaeai +2 位作者 Eryi Hu Taqiaden Alshmeri Wang Peng 《Computers, Materials & Continua》 SCIE EI 2024年第4期279-306,共28页
In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using new... In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using newtechnologies and applying different features for recognition.One such method exploits the difference in substancedensity,leading to excellent coal/gangue recognition.Therefore,this study uses density differences to distinguishcoal from gangue by performing volume prediction on the samples.Our training samples maintain a record of3-side images as input,volume,and weight as the ground truth for the classification.The prediction process relieson a Convolutional neural network(CGVP-CNN)model that receives an input of a 3-side image and then extractsthe needed features to estimate an approximation for the volume.The classification was comparatively performedvia ten different classifiers,namely,K-Nearest Neighbors(KNN),Linear Support Vector Machines(Linear SVM),Radial Basis Function(RBF)SVM,Gaussian Process,Decision Tree,Random Forest,Multi-Layer Perceptron(MLP),Adaptive Boosting(AdaBosst),Naive Bayes,and Quadratic Discriminant Analysis(QDA).After severalexperiments on testing and training data,results yield a classification accuracy of 100%,92%,95%,96%,100%,100%,100%,96%,81%,and 92%,respectively.The test reveals the best timing with KNN,which maintained anaccuracy level of 100%.Assessing themodel generalization capability to newdata is essential to ensure the efficiencyof the model,so by applying a cross-validation experiment,the model generalization was measured.The useddataset was isolated based on the volume values to ensure the model generalization not only on new images of thesame volume but with a volume outside the trained range.Then,the predicted volume values were passed to theclassifiers group,where classification reported accuracy was found to be(100%,100%,100%,98%,88%,87%,100%,87%,97%,100%),respectively.Although obtaining a classification with high accuracy is the main motive,this workhas a remarkable reduction in the data preprocessing time compared to related works.The CGVP-CNN modelmanaged to reduce the data preprocessing time of previous works to 0.017 s while maintaining high classificationaccuracy using the estimated volume value. 展开更多
关键词 COAL coal gangue convolutional neural network cnn object classification volume estimation separation system
在线阅读 下载PDF
Prediction of constrained modulus for granular soil using 3D discrete element method and convolutional neural networks
8
作者 Tongwei Zhang Shuang Li +1 位作者 Huanzhi Yang Fanyu Zhang 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2024年第11期4769-4781,共13页
To efficiently predict the mechanical parameters of granular soil based on its random micro-structure,this study proposed a novel approach combining numerical simulation and machine learning algorithms.Initially,3500 ... To efficiently predict the mechanical parameters of granular soil based on its random micro-structure,this study proposed a novel approach combining numerical simulation and machine learning algorithms.Initially,3500 simulations of one-dimensional compression tests on coarse-grained sand using the three-dimensional(3D)discrete element method(DEM)were conducted to construct a database.In this process,the positions of the particles were randomly altered,and the particle assemblages changed.Interestingly,besides confirming the influence of particle size distribution parameters,the stress-strain curves differed despite an identical gradation size statistic when the particle position varied.Subsequently,the obtained data were partitioned into training,validation,and testing datasets at a 7:2:1 ratio.To convert the DEM model into a multi-dimensional matrix that computers can recognize,the 3D DEM models were first sliced to extract multi-layer two-dimensional(2D)cross-sectional data.Redundant information was then eliminated via gray processing,and the data were stacked to form a new 3D matrix representing the granular soil’s fabric.Subsequently,utilizing the Python language and Pytorch framework,a 3D convolutional neural networks(CNNs)model was developed to establish the relationship between the constrained modulus obtained from DEM simulations and the soil’s fabric.The mean squared error(MSE)function was utilized to assess the loss value during the training process.When the learning rate(LR)fell within the range of 10-5e10-1,and the batch sizes(BSs)were 4,8,16,32,and 64,the loss value stabilized after 100 training epochs in the training and validation dataset.For BS?32 and LR?10-3,the loss reached a minimum.In the testing set,a comparative evaluation of the predicted constrained modulus from the 3D CNNs versus the simulated modulus obtained via DEM reveals a minimum mean absolute percentage error(MAPE)of 4.43%under the optimized condition,demonstrating the accuracy of this approach.Thus,by combining DEM and CNNs,the variation of soil’s mechanical characteristics related to its random fabric would be efficiently evaluated by directly tracking the particle assemblages. 展开更多
关键词 Soil structure Constrained modulus Discrete element model(DEM) convolutional neural networks(cnns) Evaluation of error
在线阅读 下载PDF
Review of Artificial Intelligence for Oil and Gas Exploration: Convolutional Neural Network Approaches and the U-Net 3D Model
9
作者 Weiyan Liu 《Open Journal of Geology》 CAS 2024年第4期578-593,共16页
Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Ou... Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Our review traces the evolution of CNN, emphasizing the adaptation and capabilities of the U-Net 3D model in automating seismic fault delineation with unprecedented accuracy. We find: 1) The transition from basic neural networks to sophisticated CNN has enabled remarkable advancements in image recognition, which are directly applicable to analyzing seismic data. The U-Net 3D model, with its innovative architecture, exemplifies this progress by providing a method for detailed and accurate fault detection with reduced manual interpretation bias. 2) The U-Net 3D model has demonstrated its superiority over traditional fault identification methods in several key areas: it has enhanced interpretation accuracy, increased operational efficiency, and reduced the subjectivity of manual methods. 3) Despite these achievements, challenges such as the need for effective data preprocessing, acquisition of high-quality annotated datasets, and achieving model generalization across different geological conditions remain. Future research should therefore focus on developing more complex network architectures and innovative training strategies to refine fault identification performance further. Our findings confirm the transformative potential of deep learning, particularly CNN like the U-Net 3D model, in geosciences, advocating for its broader integration to revolutionize geological exploration and seismic analysis. 展开更多
关键词 Deep Learning convolutional neural networks (cnn) Seismic Fault Identification U-Net 3D Model Geological Exploration
在线阅读 下载PDF
Audiovisual speech recognition based on a deep convolutional neural network
10
作者 Shashidhar Rudregowda Sudarshan Patilkulkarni +2 位作者 Vinayakumar Ravi Gururaj H.L. Moez Krichen 《Data Science and Management》 2024年第1期25-34,共10页
Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for India... Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for Indian English linguistics and categorized it into three main categories:(1)audio recognition,(2)visual feature extraction,and(3)combined audio and visual recognition.Audio features were extracted using the mel-frequency cepstral coefficient,and classification was performed using a one-dimension convolutional neural network.Visual feature extraction uses Dlib and then classifies visual speech using a long short-term memory type of recurrent neural networks.Finally,integration was performed using a deep convolutional network.The audio speech of Indian English was successfully recognized with accuracies of 93.67%and 91.53%,respectively,using testing data from 200 epochs.The training accuracy for visual speech recognition using the Indian English dataset was 77.48%and the test accuracy was 76.19%using 60 epochs.After integration,the accuracies of audiovisual speech recognition using the Indian English dataset for training and testing were 94.67%and 91.75%,respectively. 展开更多
关键词 Audiovisual speech recognition Custom dataset 1D Convolution neural network(cnn) Deep cnn(Dcnn) Long short-term memory(LSTM) LIPREADING Dlib Mel-frequency cepstral coefficient(MFCC)
在线阅读 下载PDF
Convolutional Neural Network-Based Deep Q-Network (CNN-DQN) Resource Management in Cloud Radio Access Network 被引量:2
11
作者 Amjad Iqbal Mau-Luen Tham Yoong Choon Chang 《China Communications》 SCIE CSCD 2022年第10期129-142,共14页
The recent surge of mobile subscribers and user data traffic has accelerated the telecommunication sector towards the adoption of the fifth-generation (5G) mobile networks. Cloud radio access network (CRAN) is a promi... The recent surge of mobile subscribers and user data traffic has accelerated the telecommunication sector towards the adoption of the fifth-generation (5G) mobile networks. Cloud radio access network (CRAN) is a prominent framework in the 5G mobile network to meet the above requirements by deploying low-cost and intelligent multiple distributed antennas known as remote radio heads (RRHs). However, achieving the optimal resource allocation (RA) in CRAN using the traditional approach is still challenging due to the complex structure. In this paper, we introduce the convolutional neural network-based deep Q-network (CNN-DQN) to balance the energy consumption and guarantee the user quality of service (QoS) demand in downlink CRAN. We first formulate the Markov decision process (MDP) for energy efficiency (EE) and build up a 3-layer CNN to capture the environment feature as an input state space. We then use DQN to turn on/off the RRHs dynamically based on the user QoS demand and energy consumption in the CRAN. Finally, we solve the RA problem based on the user constraint and transmit power to guarantee the user QoS demand and maximize the EE with a minimum number of active RRHs. In the end, we conduct the simulation to compare our proposed scheme with nature DQN and the traditional approach. 展开更多
关键词 energy efficiency(EE) markov decision process(MDP) convolutional neural network(cnn) cloud RAN deep Q-network(DQN)
在线阅读 下载PDF
A CASCADED MODEL OF NEURAL NETWORK FOR PATTERN RECOGNITION
12
作者 张延忻 高成群 +2 位作者 黄五群 沈琴婉 陈天伦 《Journal of Electronics(China)》 1992年第4期367-375,共9页
A cascaded model of neural network and its learning algorithm suitable for opticalimplementation are proposed.Computer simulations have shown that this model may successfullybe applied to an error-tolerance pattern re... A cascaded model of neural network and its learning algorithm suitable for opticalimplementation are proposed.Computer simulations have shown that this model may successfullybe applied to an error-tolerance pattern recognitions of multiple 3-D targets with arbitrary spatialorientations. 展开更多
关键词 neural network PATTERN RECOGNITION cascaded model LEARNING algorithm Optical implementation
在线阅读 下载PDF
3D laser scanning strategy based on cascaded deep neural network
13
作者 Xiao-bin Xu Ming-hui Zhao +4 位作者 Jian Yang Yi-yang Xiong Feng-lin Pang Zhi-ying Tan Min-zhou Luo 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2022年第9期1727-1739,共13页
A 3D laser scanning strategy based on cascaded deep neural network is proposed for the scanning system converted from 2D Lidar with a pitching motion device. The strategy is aimed at moving target detection and monito... A 3D laser scanning strategy based on cascaded deep neural network is proposed for the scanning system converted from 2D Lidar with a pitching motion device. The strategy is aimed at moving target detection and monitoring. Combining the device characteristics, the strategy first proposes a cascaded deep neural network, which inputs 2D point cloud, color image and pitching angle. The outputs are target distance and speed classification. And the cross-entropy loss function of network is modified by using focal loss and uniform distribution to improve the recognition accuracy. Then a pitching range and speed model are proposed to determine pitching motion parameters. Finally, the adaptive scanning is realized by integral separate speed PID. The experimental results show that the accuracies of the improved network target detection box, distance and speed classification are 90.17%, 96.87% and 96.97%, respectively. The average speed error of the improved PID is 0.4239°/s, and the average strategy execution time is 0.1521 s.The range and speed model can effectively reduce the collection of useless information and the deformation of the target point cloud. Conclusively, the experimental of overall scanning strategy show that it can improve target point cloud integrity and density while ensuring the capture of target. 展开更多
关键词 Scanning strategy cascaded deep neural network Improved cross entropy loss function Pitching range and speed model Integral separate speed PID
在线阅读 下载PDF
基于CNN和Transformer双流融合的人体姿态估计
14
作者 李鑫 张丹 +2 位作者 郭新 汪松 陈恩庆 《计算机工程与应用》 北大核心 2025年第5期187-199,共13页
卷积神经网络(CNN)和Transformer模型在人体姿态估计中有着广泛应用,然而Transformer更注重捕获图像的全局特征,忽视了局部特征对于人体姿态细节的重要性,而CNN则缺乏Transformer的全局建模能力。为了充分利用CNN处理局部信息和Transfor... 卷积神经网络(CNN)和Transformer模型在人体姿态估计中有着广泛应用,然而Transformer更注重捕获图像的全局特征,忽视了局部特征对于人体姿态细节的重要性,而CNN则缺乏Transformer的全局建模能力。为了充分利用CNN处理局部信息和Transformer处理全局信息的优势,构建一种CNN-Transformer双流的并行网络架构来聚合丰富的特征信息。由于传统Transformer的输入需要将图片展平为多个patch,不利于提取对位置敏感的人体结构信息,因此将其多头注意力结构进行改进,使模型输入能够保持原始2D特征图的结构;同时提出特征耦合模块融合两个分支不同分辨率下的特征,最大限度地保留局部特征与全局特征;最后引入改进后的坐标注意力模块(coordinate attention),进一步提升网络的特征提取能力。在COCO和MPII数据集上的实验结果表明所提模型相对目前主流模型具有更高的检测精度,从而说明所提模型能够充分捕获并融合人体姿态中的局部和全局特征。 展开更多
关键词 卷积神经网络 TRANSFORMER 局部特征 全局特征 2D特征图 特征耦合
在线阅读 下载PDF
融合CNN与Transformer的遥感影像道路信息提取
15
作者 曲海成 王莹 +1 位作者 刘腊梅 郝明 《自然资源遥感》 北大核心 2025年第1期38-45,共8页
利用高分辨率遥感影像进行道路信息提取时,深度神经网络很难同时学习影像全局上下文信息和边缘细节信息,为此,该文提出了一种同时学习全局语义信息和局部空间细节的级联神经网络。首先将输入的特征图分别送入到双分支编码器卷积神经网络... 利用高分辨率遥感影像进行道路信息提取时,深度神经网络很难同时学习影像全局上下文信息和边缘细节信息,为此,该文提出了一种同时学习全局语义信息和局部空间细节的级联神经网络。首先将输入的特征图分别送入到双分支编码器卷积神经网络(convolutional neural networks,CNN)和Transformer中,然后,采用了双分支融合模块(shuffle attention dual branch fusion block,SA-DBF)来有效地结合这2个分支学习到的特征,从而实现全局信息与局部信息的融合。其中,双分支融合模块通过细粒度交互对这2个分支的特征进行建模,同时利用多重注意力机制充分提取特征图的通道和空间信息,并抑制掉无效的噪声信息。在公共数据集Massachusetts道路数据集上对模型进行测试,准确率(overall accuracy,OA)、交并比(intersection over union,IoU)和F 1等评价指标分别达到98.04%,88.03%和65.13%;与主流方法U-Net和TransRoadNet等进行比较,IoU分别提升了2.01个百分点和1.42个百分点,实验结果表明所提出的方法优于其他的比较方法,能够有效提高道路分割的精确度。 展开更多
关键词 级联神经网络 TRANSFORMER 特征融合 注意力机制
在线阅读 下载PDF
基于CNN模型的地震数据噪声压制性能对比研究
16
作者 张光德 张怀榜 +3 位作者 赵金泉 尤加春 魏俊廷 杨德宽 《石油物探》 北大核心 2025年第2期232-246,共15页
地震噪声的压制是地震勘探中地震数据处理的重要研究内容之一。准确地压制地震噪声和提取地震信号中的有效信息是地震勘探和地震监测的一项关键步骤。传统的地震噪声压制方法存在一些不足之处,如灵活性不足、难以处理复杂噪声、有效信... 地震噪声的压制是地震勘探中地震数据处理的重要研究内容之一。准确地压制地震噪声和提取地震信号中的有效信息是地震勘探和地震监测的一项关键步骤。传统的地震噪声压制方法存在一些不足之处,如灵活性不足、难以处理复杂噪声、有效信息损失以及依赖人工提取特征等局限性。为克服传统方法的不足,采用时频域变换并结合深度学习方法进行地震噪声压制,并验证其应用效果。通过构建5个神经网络模型(FCN、Unet、CBDNet、SwinUnet以及TransUnet)对经过时频变换的地震信号进行噪声压制。为了定量评估实验方法的去噪性能,引入了峰值信噪比(PSNR)、结构相似性指数(SSIM)和均方根误差(RMSE)3个指标,比较不同方法的噪声压制性能。数值实验结果表明,基于时频变换的卷积神经网络(CNN)方法对常见的地震噪声类型(包括随机噪声、海洋涌浪噪声、陆地面波噪声)具有较好的噪声压制效果,能够提高地震数据的信噪比。而Transformer模块的引入可进一步提高对上述3种常见地震数据噪声类型的压制效果,进一步提升CNN模型的去噪性能。尽管该方法在数值实验中取得了较好的应用效果,但仍有进一步优化的空间可供探索,比如改进网络结构以适应更复杂的地震信号,并探索与其他先进技术结合,以提升地震噪声压制性能。 展开更多
关键词 地震噪声压制 深度学习 卷积神经网络(cnn) 时频变换 TRANSFORMER
在线阅读 下载PDF
基于VMD-1DCNN-GRU的轴承故障诊断
17
作者 宋金波 刘锦玲 +2 位作者 闫荣喜 王鹏 路敬祎 《吉林大学学报(信息科学版)》 2025年第1期34-42,共9页
针对滚动轴承信号含噪声导致诊断模型训练困难的问题,提出了一种基于变分模态分解(VMD:Variational Mode Decomposition)和深度学习相结合的轴承故障诊断模型。首先,该方法通过VMD对轴承信号进行模态分解,并且通过豪斯多夫距离(HD:Hausd... 针对滚动轴承信号含噪声导致诊断模型训练困难的问题,提出了一种基于变分模态分解(VMD:Variational Mode Decomposition)和深度学习相结合的轴承故障诊断模型。首先,该方法通过VMD对轴承信号进行模态分解,并且通过豪斯多夫距离(HD:Hausdorff Distance)完成去噪,尽可能保留原始信号的特征。其次,将选择的有效信号输入一维卷积神经网络(1DCNN:1D Convolutional Neural Networks)和门控循环单元(GRU:Gate Recurrent Unit)相结合的网络结构(1DCNN-GRU)中完成数据的分类,实现轴承的故障诊断。通过与常见的轴承故障诊断方法比较,所提VMD-1DCNN-GRU模型具有最高的准确性。实验结果验证了该模型对轴承故障有效分类的可行性,具有一定的研究意义。 展开更多
关键词 故障诊断 深度学习 变分模态分解 一维卷积神经网络 门控循环单元
在线阅读 下载PDF
基于CNN-Swin Transformer Network的LPI雷达信号识别 被引量:1
18
作者 苏琮智 杨承志 +2 位作者 邴雨晨 吴宏超 邓力洪 《现代雷达》 CSCD 北大核心 2024年第3期59-65,共7页
针对在低信噪比(SNR)条件下,低截获概率雷达信号调制方式识别准确率低的问题,提出一种基于Transformer和卷积神经网络(CNN)的雷达信号识别方法。首先,引入Swin Transformer模型并在模型前端设计CNN特征提取层构建了CNN+Swin Transforme... 针对在低信噪比(SNR)条件下,低截获概率雷达信号调制方式识别准确率低的问题,提出一种基于Transformer和卷积神经网络(CNN)的雷达信号识别方法。首先,引入Swin Transformer模型并在模型前端设计CNN特征提取层构建了CNN+Swin Transformer网络(CSTN),然后利用时频分析获取雷达信号的时频特征,对图像进行预处理后输入CSTN模型进行训练,由网络的底部到顶部不断提取图像更丰富的语义信息,最后通过Softmax分类器对六类不同调制方式信号进行分类识别。仿真实验表明:在SNR为-18 dB时,该方法对六类典型雷达信号的平均识别率达到了94.26%,证明了所提方法的可行性。 展开更多
关键词 低截获概率雷达 信号调制方式识别 Swin Transformer网络 卷积神经网络 时频分析
在线阅读 下载PDF
基于VMD-CNN-BiTCN滚动轴承故障诊断
19
作者 徐志祥 玄永伟 +1 位作者 王洪洋 王壬杰 《微特电机》 2025年第2期68-73,共6页
针对滚动轴承故障诊断中,传统卷积神经网络(CNN)特征提取感受野受限、无法有效提取数据时序特征的问题,提出了一种CNN结合双向时间卷积网络(BiTCN)的模型,该模型能够扩展感受野并有效捕获数据的时序特征。将原始振动信号通过变分模态(V... 针对滚动轴承故障诊断中,传统卷积神经网络(CNN)特征提取感受野受限、无法有效提取数据时序特征的问题,提出了一种CNN结合双向时间卷积网络(BiTCN)的模型,该模型能够扩展感受野并有效捕获数据的时序特征。将原始振动信号通过变分模态(VMD)分解为K个本征模函数(IMF);将分解后的信号输入到CNN层中进行特征提取和信号压缩;将该信号送入BiTCN中,提取正反两个方向的时序特征,使用膨胀卷积最大化感受野;通过池化层和全连接层实现滚动轴承故障诊断。实验结果显示,该模型在特征提取能力和时序特征感知具有显著优势,能够在多个数据集中表现出良好的故障诊断性能和泛化能力。 展开更多
关键词 滚动轴承 故障诊断 卷积神经网络 双向时间卷积网络 变分模态分解
在线阅读 下载PDF
基于卷积神经网络CNN模型的课堂情绪识别系统设计
20
作者 谭方勇 白晨宇 吉彩云 《苏州市职业大学学报》 2025年第1期42-47,共6页
针对目前传统的课堂情绪管理系统中存在的识别和储存问题,提出了一种基于CNN的面部情绪识别系统。该系统通过课堂图像采集、人脸定位与身份识别、人脸情绪识别、课堂情绪数据统计等方法,并配合人脸情绪识别技术来实现课堂教学效果的评... 针对目前传统的课堂情绪管理系统中存在的识别和储存问题,提出了一种基于CNN的面部情绪识别系统。该系统通过课堂图像采集、人脸定位与身份识别、人脸情绪识别、课堂情绪数据统计等方法,并配合人脸情绪识别技术来实现课堂教学效果的评估。经测试验证,该系统能够有效地满足课堂教学效果反馈评估的需求。 展开更多
关键词 卷积神经网络(cnn) 教学监控 人脸识别 情绪识别 教学评估
在线阅读 下载PDF
上一页 1 2 188 下一页 到第
使用帮助 返回顶部