期刊文献+
共找到87篇文章
< 1 2 5 >
每页显示 20 50 100
State-of-health estimation for fast-charging lithium-ion batteries based on a short charge curve using graph convolutional and long short-term memory networks
1
作者 Yvxin He Zhongwei Deng +4 位作者 Jue Chen Weihan Li Jingjing Zhou Fei Xiang Xiaosong Hu 《Journal of Energy Chemistry》 SCIE EI CAS CSCD 2024年第11期1-11,共11页
A fast-charging policy is widely employed to alleviate the inconvenience caused by the extended charging time of electric vehicles. However, fast charging exacerbates battery degradation and shortens battery lifespan.... A fast-charging policy is widely employed to alleviate the inconvenience caused by the extended charging time of electric vehicles. However, fast charging exacerbates battery degradation and shortens battery lifespan. In addition, there is still a lack of tailored health estimations for fast-charging batteries;most existing methods are applicable at lower charging rates. This paper proposes a novel method for estimating the health of lithium-ion batteries, which is tailored for multi-stage constant current-constant voltage fast-charging policies. Initially, short charging segments are extracted by monitoring current switches,followed by deriving voltage sequences using interpolation techniques. Subsequently, a graph generation layer is used to transform the voltage sequence into graphical data. Furthermore, the integration of a graph convolution network with a long short-term memory network enables the extraction of information related to inter-node message transmission, capturing the key local and temporal features during the battery degradation process. Finally, this method is confirmed by utilizing aging data from 185 cells and 81 distinct fast-charging policies. The 4-minute charging duration achieves a balance between high accuracy in estimating battery state of health and low data requirements, with mean absolute errors and root mean square errors of 0.34% and 0.66%, respectively. 展开更多
关键词 Lithium-ion battery State of health estimation Feature extraction Graph convolutional network long short-term memory network
在线阅读 下载PDF
Audiovisual speech recognition based on a deep convolutional neural network
2
作者 Shashidhar Rudregowda Sudarshan Patilkulkarni +2 位作者 Vinayakumar Ravi Gururaj H.L. Moez Krichen 《Data Science and Management》 2024年第1期25-34,共10页
Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for India... Audiovisual speech recognition is an emerging research topic.Lipreading is the recognition of what someone is saying using visual information,primarily lip movements.In this study,we created a custom dataset for Indian English linguistics and categorized it into three main categories:(1)audio recognition,(2)visual feature extraction,and(3)combined audio and visual recognition.Audio features were extracted using the mel-frequency cepstral coefficient,and classification was performed using a one-dimension convolutional neural network.Visual feature extraction uses Dlib and then classifies visual speech using a long short-term memory type of recurrent neural networks.Finally,integration was performed using a deep convolutional network.The audio speech of Indian English was successfully recognized with accuracies of 93.67%and 91.53%,respectively,using testing data from 200 epochs.The training accuracy for visual speech recognition using the Indian English dataset was 77.48%and the test accuracy was 76.19%using 60 epochs.After integration,the accuracies of audiovisual speech recognition using the Indian English dataset for training and testing were 94.67%and 91.75%,respectively. 展开更多
关键词 Audiovisual speech recognition Custom dataset 1D convolution neural network(CNN) Deep CNN(DCNN) long short-term memory(LSTM) LIPREADING Dlib Mel-frequency cepstral coefficient(MFCC)
在线阅读 下载PDF
Research on Short-Term Electric Load Forecasting Using IWOA CNN-BiLSTM-TPA Model
3
作者 MEI Tong-da SI Zhan-jun ZHANG Ying-xue 《印刷与数字媒体技术研究》 北大核心 2025年第1期179-187,共9页
Load forecasting is of great significance to the development of new power systems.With the advancement of smart grids,the integration and distribution of distributed renewable energy sources and power electronics devi... Load forecasting is of great significance to the development of new power systems.With the advancement of smart grids,the integration and distribution of distributed renewable energy sources and power electronics devices have made power load data increasingly complex and volatile.This places higher demands on the prediction and analysis of power loads.In order to improve the prediction accuracy of short-term power load,a CNN-BiLSTMTPA short-term power prediction model based on the Improved Whale Optimization Algorithm(IWOA)with mixed strategies was proposed.Firstly,the model combined the Convolutional Neural Network(CNN)with the Bidirectional Long Short-Term Memory Network(BiLSTM)to fully extract the spatio-temporal characteristics of the load data itself.Then,the Temporal Pattern Attention(TPA)mechanism was introduced into the CNN-BiLSTM model to automatically assign corresponding weights to the hidden states of the BiLSTM.This allowed the model to differentiate the importance of load sequences at different time intervals.At the same time,in order to solve the problem of the difficulties of selecting the parameters of the temporal model,and the poor global search ability of the whale algorithm,which is easy to fall into the local optimization,the whale algorithm(IWOA)was optimized by using the hybrid strategy of Tent chaos mapping and Levy flight strategy,so as to better search the parameters of the model.In this experiment,the real load data of a region in Zhejiang was taken as an example to analyze,and the prediction accuracy(R2)of the proposed method reached 98.83%.Compared with the prediction models such as BP,WOA-CNN-BiLSTM,SSA-CNN-BiLSTM,CNN-BiGRU-Attention,etc.,the experimental results showed that the model proposed in this study has a higher prediction accuracy. 展开更多
关键词 Whale Optimization Algorithm convolutional Neural Network long short-term memory Temporal Pattern Attention Power load forecasting
在线阅读 下载PDF
Recurrent Convolutional Neural Network MSER-Based Approach for Payable Document Processing 被引量:1
4
作者 Suliman Aladhadh Hidayat Ur Rehman +1 位作者 Ali Mustafa Qamar Rehan Ullah Khan 《Computers, Materials & Continua》 SCIE EI 2021年第12期3399-3411,共13页
A tremendous amount of vendor invoices is generated in the corporate sector.To automate the manual data entry in payable documents,highly accurate Optical Character Recognition(OCR)is required.This paper proposes an e... A tremendous amount of vendor invoices is generated in the corporate sector.To automate the manual data entry in payable documents,highly accurate Optical Character Recognition(OCR)is required.This paper proposes an end-to-end OCR system that does both localization and recognition and serves as a single unit to automate payable document processing such as cheques and cash disbursement.For text localization,the maximally stable extremal region is used,which extracts a word or digit chunk from an invoice.This chunk is later passed to the deep learning model,which performs text recognition.The deep learning model utilizes both convolution neural networks and long short-term memory(LSTM).The convolution layer is used for extracting features,which are fed to the LSTM.The model integrates feature extraction,modeling sequence,and transcription into a unified network.It handles the sequences of unconstrained lengths,independent of the character segmentation or horizontal scale normalization.Furthermore,it applies to both the lexicon-free and lexicon-based text recognition,and finally,it produces a comparatively smaller model,which can be implemented in practical applications.The overall superior performance in the experimental evaluation demonstrates the usefulness of the proposed model.The model is thus generic and can be used for other similar recognition scenarios. 展开更多
关键词 Character recognition text spotting long short-term memory recurrent convolutional neural networks
在线阅读 下载PDF
Classification of Arrhythmia Based on Convolutional Neural Networks and Encoder-Decoder Model
5
作者 Jian Liu Xiaodong Xia +2 位作者 Chunyang Han Jiao Hui Jim Feng 《Computers, Materials & Continua》 SCIE EI 2022年第10期265-278,共14页
As a common and high-risk type of disease,heart disease seriously threatens people’s health.At the same time,in the era of the Internet of Thing(IoT),smart medical device has strong practical significance for medical... As a common and high-risk type of disease,heart disease seriously threatens people’s health.At the same time,in the era of the Internet of Thing(IoT),smart medical device has strong practical significance for medical workers and patients because of its ability to assist in the diagnosis of diseases.Therefore,the research of real-time diagnosis and classification algorithms for arrhythmia can help to improve the diagnostic efficiency of diseases.In this paper,we design an automatic arrhythmia classification algorithm model based on Convolutional Neural Network(CNN)and Encoder-Decoder model.The model uses Long Short-Term Memory(LSTM)to consider the influence of time series features on classification results.Simultaneously,it is trained and tested by the MIT-BIH arrhythmia database.Besides,Generative Adversarial Networks(GAN)is adopted as a method of data equalization for solving data imbalance problem.The simulation results show that for the inter-patient arrhythmia classification,the hybrid model combining CNN and Encoder-Decoder model has the best classification accuracy,of which the accuracy can reach 94.05%.Especially,it has a better advantage for the classification effect of supraventricular ectopic beats(class S)and fusion beats(class F). 展开更多
关键词 ELECTROENCEPHALOGRAPHY convolutional neural network long short-term memory encoder-decoder model generative adversarial network
在线阅读 下载PDF
Use of Local Region Maps on Convolutional LSTM for Single-Image HDR Reconstruction
6
作者 Seungwook Oh GyeongIk Shin Hyunki Hong 《Computers, Materials & Continua》 SCIE EI 2022年第6期4555-4572,共18页
Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the sam... Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the same scene(assuming a stationary scene),we introduce a learning-based model for single-image HDR reconstruction.An input LDR image is sequentially segmented into the local region maps based on the cumulative histogram of the input brightness distribution.Using the local region maps,SParam-Net estimates the parameters of an inverse tone mapping function to generate a pseudo-HDR image.We process the segmented region maps as the input sequences on long short-term memory.Finally,a fast super-resolution convolutional neural network is used for HDR image reconstruction.The proposed method was trained and tested on datasets including HDR-Real,LDR-HDR-pair,and HDR-Eye.The experimental results revealed that HDR images can be generated more reliably than using contemporary end-to-end approaches. 展开更多
关键词 Low dynamic range high dynamic range deep learning convolutional long short-term memory inverse tone mapping function
在线阅读 下载PDF
Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network
7
作者 S.Girirajan A.Pandian 《Intelligent Automation & Soft Computing》 SCIE 2023年第2期1987-2001,共15页
Speech enhancement is the task of taking a noisy speech input and pro-ducing an enhanced speech output.In recent years,the need for speech enhance-ment has been increased due to challenges that occurred in various app... Speech enhancement is the task of taking a noisy speech input and pro-ducing an enhanced speech output.In recent years,the need for speech enhance-ment has been increased due to challenges that occurred in various applications such as hearing aids,Automatic Speech Recognition(ASR),and mobile speech communication systems.Most of the Speech Enhancement research work has been carried out for English,Chinese,and other European languages.Only a few research works involve speech enhancement in Indian regional Languages.In this paper,we propose a two-fold architecture to perform speech enhancement for Tamil speech signal based on convolutional recurrent neural network(CRN)that addresses the speech enhancement in a real-time single channel or track of sound created by the speaker.In thefirst stage mask based long short-term mem-ory(LSTM)is used for noise suppression along with loss function and in the sec-ond stage,Convolutional Encoder-Decoder(CED)is used for speech restoration.The proposed model is evaluated on various speaker and noisy environments like Babble noise,car noise,and white Gaussian noise.The proposed CRN model improves speech quality by 0.1 points when compared with the LSTM base model and also CRN requires fewer parameters for training.The performance of the pro-posed model is outstanding even in low Signal to Noise Ratio(SNR). 展开更多
关键词 Speech enhancement convolutional encoder-decoder long short-term memory noise suppression speech restoration
在线阅读 下载PDF
Study of A Hybrid Deep Learning Method for Forecasting the Short-Term Motion Responses of A Semi-Submersible 被引量:1
8
作者 XU Sheng JI Chun-yan 《China Ocean Engineering》 CSCD 2024年第6期917-931,共15页
Accurately predicting motion responses is a crucial component of the design process for floating offshore structures.This study introduces a hybrid model that integrates a convolutional neural network(CNN),a bidirecti... Accurately predicting motion responses is a crucial component of the design process for floating offshore structures.This study introduces a hybrid model that integrates a convolutional neural network(CNN),a bidirectional long short-term memory(BiLSTM)neural network,and an attention mechanism for forecasting the short-term motion responses of a semisubmersible.First,the motions are processed through the CNN for feature extraction.The extracted features are subsequently utilized by the BiLSTM network to forecast future motions.To enhance the predictive capability of the neural networks,an attention mechanism is integrated.In addition to the hybrid model,the BiLSTM is independently employed to forecast the motion responses of the semi-submersible,serving as benchmark results for comparison.Furthermore,both the 1D and 2D convolutions are conducted to check the influence of the convolutional dimensionality on the predicted results.The results demonstrate that the hybrid 1D CNN-BiLSTM network with an attention mechanism outperforms all other models in accurately predicting motion responses. 展开更多
关键词 short-term motion responses convolutional neural network bidirectional long short-term memory neural network attention mechanism hybrid model multi-step prediction SEMI-SUBMERSIBLE
在线阅读 下载PDF
Short-term train arrival delay prediction:a data-driven approach
9
作者 Qingyun Fu Shuxin Ding +3 位作者 Tao Zhang Rongsheng Wang Ping Hu Cunlai Pu 《Railway Sciences》 2024年第4期514-529,共16页
Purpose-To optimize train operations,dispatchers currently rely on experience for quick adjustments when delays occur.However,delay predictions often involve imprecise shifts based on known delay times.Real-time and a... Purpose-To optimize train operations,dispatchers currently rely on experience for quick adjustments when delays occur.However,delay predictions often involve imprecise shifts based on known delay times.Real-time and accurate train delay predictions,facilitated by data-driven neural network models,can significantly reduce dispatcher stress and improve adjustment plans.Leveraging current train operation data,these models enable swift and precise predictions,addressing challenges posed by train delays in high-speed rail networks during unforeseen events.Design/methodology/approach-This paper proposes CBLA-net,a neural network architecture for predicting late arrival times.It combines CNN,Bi-LSTM,and attention mechanisms to extract features,handle time series data,and enhance information utilization.Trained on operational data from the Beijing-Tianjin line,it predicts the late arrival time of a target train at the next station using multidimensional input data from the target and preceding trains.Findings-This study evaluates our model’s predictive performance using two data approaches:one considering full data and another focusing only on late arrivals.Results show precise and rapid predictions.Training with full data achieves aMAEof approximately 0.54 minutes and a RMSEof 0.65 minutes,surpassing the model trained solely on delay data(MAE:is about 1.02 min,RMSE:is about 1.52 min).Despite superior overall performance with full data,the model excels at predicting delays exceeding 15 minutes when trained exclusively on late arrivals.For enhanced adaptability to real-world train operations,training with full data is recommended.Originality/value-This paper introduces a novel neural network model,CBLA-net,for predicting train delay times.It innovatively compares and analyzes the model’s performance using both full data and delay data formats.Additionally,the evaluation of the network’s predictive capabilities considers different scenarios,providing a comprehensive demonstration of the model’s predictive performance. 展开更多
关键词 Train delay prediction Intelligent dispatching command Deep learning convolutional neural network long short-term memory Attention mechanism
在线阅读 下载PDF
基于ConvLSTM的移动边缘计算服务器能耗模型
10
作者 李小龙 李曦 +1 位作者 杨凌峰 黄华 《应用科学学报》 CAS CSCD 北大核心 2024年第1期53-66,共14页
针对现有能耗模型对动态工作负载波动具有低敏感性和低精度的问题,该文基于卷积长短期记忆(convolutional long short-term memory, ConvLSTM)神经网络,提出了用于移动边缘计算的服务器智能能耗模型(intelligence server energy consump... 针对现有能耗模型对动态工作负载波动具有低敏感性和低精度的问题,该文基于卷积长短期记忆(convolutional long short-term memory, ConvLSTM)神经网络,提出了用于移动边缘计算的服务器智能能耗模型(intelligence server energy consumption model,IECM),用于预测和优化服务器的能量消耗。通过收集服务器运行时间参数,使用熵值法筛选和保留显著影响服务器能耗的参数。基于选定的参数,利用ConvLSTM神经网络训练服务器能耗模型的深度网络。与现有的能耗模型相比,IECM在CPU密集型、I/O密集型、内存密集型和混合型任务上,能够适应服务器工作负载的动态变化,并在能耗预测上具有更好的准确性。 展开更多
关键词 卷积长短期记忆 能耗预测 智能功率模型 功率建模
在线阅读 下载PDF
基于ConvLSTM的中国东南沿海波浪智能预报和评估
11
作者 金阳 韩磊 +1 位作者 金梅兵 董昌明 《海洋学研究》 CSCD 北大核心 2024年第3期88-98,共11页
相较于半理论半分析和数值模型的波浪预报方法,智能波浪预报有着精度高、计算资源需求低的优势。该文基于卷积长短期记忆网络(convolutional long short-term memory network,ConvLSTM)算法,建立了有效波高(significant wave height,SWH... 相较于半理论半分析和数值模型的波浪预报方法,智能波浪预报有着精度高、计算资源需求低的优势。该文基于卷积长短期记忆网络(convolutional long short-term memory network,ConvLSTM)算法,建立了有效波高(significant wave height,SWH)二维预报模型,以中国东南沿海2014—2022年ERA5数据进行训练,通过敏感性试验优化模型配置,并开展中国东南沿海SWH在2023年4个预报时效(6 h、12 h、18 h、24 h)下的预测性能评估。敏感性试验显示,输入时间序列长度N=4(即输入-18 h,-12 h,-6 h,0 h的SWH值)时,模型在4个预报时效下的准确性均优于其他时间序列长度;输入物理要素组合为SWH、平均波向和海面10 m风矢量时,模型在12 h、18 h和24 h预报时效下的准确性优于其他组合。通过对ConvLSTM模型训练及配置的精细调整,可以实现对中国东南沿海SWH的二维、高精度的智能预报。 展开更多
关键词 中国近海 卷积长短期记忆网络 数据驱动 海浪 有效波高 二维预报模型 短期预报 人工智能 深度学习
在线阅读 下载PDF
Visualization-based prediction of dendritic copper growth in electrochemical cells using convolutional long short-term memory 被引量:1
12
作者 Roshan Kumar Trina Dhara +1 位作者 Han Hu Monojit Chakraborty 《Energy and AI》 2022年第4期149-160,共12页
Electrodeposition in electrochemical cells is one of the leading causes of its performance deterioration. The prediction of electrodeposition growth demands a good understanding of the complex physics involved, which ... Electrodeposition in electrochemical cells is one of the leading causes of its performance deterioration. The prediction of electrodeposition growth demands a good understanding of the complex physics involved, which can lead to the fabrication of a probabilistic mathematical model. As an alternative, a convolutional Long shortterm memory architecture-based image analysis approach is presented herein. This technique can predict the electrodeposition growth of the electrolytes, without prior detailed knowledge of the system. The captured images of the electrodeposition from the experiments are used to train and test the model. A comparison between the expected output image and predicted image on a pixel level, percentage mean squared error, absolute percentage error, and pattern density of the electrodeposit are investigated to assess the model accuracy. The randomness of the electrodeposition growth is outlined by investigating the fractal dimension and the interfacial length of the electrodeposits. The trained model predictions show a significant promise between all the experimentally obtained relevant parameters with the predicted one. It is expected that this deep learning-based approach for predicting random electrodeposition growth will be of immense help for designing and optimizing the relevant experimental scheme in near future without performing multiple experiments. 展开更多
关键词 ELECTRODEPOSITION Electrochemical cell Deep learning Data-driven modelling convolutional long short-term memory
原文传递
Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse Autoencoder with LSTM
13
作者 Yahang Qin Zhenni Li +3 位作者 Shengli Xie Bo Li Ming Liu Victor Kuzin 《Tsinghua Science and Technology》 2025年第1期68-86,共19页
Multipath signal recognition is crucial to the ability to provide high-precision absolute-position services by the BeiDou Navigation Satellite System(BDS).However,most existing approaches to this issue involve supervi... Multipath signal recognition is crucial to the ability to provide high-precision absolute-position services by the BeiDou Navigation Satellite System(BDS).However,most existing approaches to this issue involve supervised machine learning(ML)methods,and it is difficult to move to unsupervised multipath signal recognition because of the limitations in signal labeling.Inspired by an autoencoder with powerful unsupervised feature extraction,we propose a new deep learning(DL)model for BDS signal recognition that places a long short-term memory(LSTM)module in series with a convolutional sparse autoencoder to create a new autoencoder structure.First,we propose to capture the temporal correlations in long-duration BeiDou satellite time-series signals by using the LSTM module to mine the temporal change patterns in the time series.Second,we develop a convolutional sparse autoencoder method that learns a compressed representation of the input data,which then enables downscaled and unsupervised feature extraction from long-duration BeiDou satellite series signals.Finally,we add an l_(1/2) regularizer to the objective function of our DL model to remove redundant neurons from the neural network while ensuring recognition accuracy.We tested our proposed approach on a real urban canyon dataset,and the results demonstrated that our algorithm could achieve better classification performance than two ML-based methods(e.g.,11%better than a support vector machine)and two existing DL-based methods(e.g.,7.26%better than convolutional neural networks). 展开更多
关键词 convolutional sparse autoencoder BeiDou Navigation Satellite System(BDS) long short-term memory(LSTM) multipath classification
原文传递
基于CEEMDAN-ConvLSTM组合模型的云计算负载预测方法 被引量:2
14
作者 赵鹏 周建涛 赵大明 《计算机科学》 CSCD 北大核心 2023年第S01期642-650,共9页
随着云计算技术的快速发展,越来越多的用户选择使用云服务。负载请求与资源供应的不匹配问题日益凸显,使得用户请求无法得到及时响应,极大地影响云服务质量,实时预测负载请求,将有助于及时供应资源。针对云计算环境中的负载预测方法性... 随着云计算技术的快速发展,越来越多的用户选择使用云服务。负载请求与资源供应的不匹配问题日益凸显,使得用户请求无法得到及时响应,极大地影响云服务质量,实时预测负载请求,将有助于及时供应资源。针对云计算环境中的负载预测方法性能低的问题,提出了一种基于自适应噪声的完备经验模态分解和卷积长时序神经网络组合模型(CEEMDAN-ConvLSTM)的云计算负载预测方法。首先运用自适应噪声的完备经验模态(CEEMDAN)分解技术对数据序列进行分解操作,将其转换为若干个易于分析和建模的子序列;然后运用卷积长时序神经网络(ConvLSTM)预测模型对这一系列子序列进行建模预测,并采用基于多进程并行计算的研究思路,实现多序列并行预测及贝叶斯优化调参;最后将预测值综合叠加以获得整个模型的预测输出,从而实现对原始复杂序列数据进行高精度预测的目标。使用Google集群工作负载数据集进行实验验证,实验结果表明,CEEMDAN-ConvLSTM组合模型具有良好的预测效果,相比自回归差分移动平均模型(ARIMA)、长短期记忆网络(LSTM)和卷积长时序神经网络(ConvLSTM),所提模型预测均方根误差(RMSE)指标分别提升了30.9%,30.1%和22.5%。 展开更多
关键词 云计算 负载预测 卷积长时序神经网络(convlstm) 模态分解技术 贝叶斯优化
在线阅读 下载PDF
基于CNN-ATT-ConvLSTM的行人属性识别 被引量:2
15
作者 李洋 许华虎 卞敏捷 《计算机应用与软件》 北大核心 2021年第4期152-158,共7页
针对现有行人属性识别方法忽视行人属性的互相关性和空间信息导致识别性能较低的问题,将任务视为时空序列多标签图像分类问题,提出基于卷积神经网络(CNN)和卷积长短期记忆网络(ConvLSTM)并融合通道注意力机制的模型。用CNN和通道注意力... 针对现有行人属性识别方法忽视行人属性的互相关性和空间信息导致识别性能较低的问题,将任务视为时空序列多标签图像分类问题,提出基于卷积神经网络(CNN)和卷积长短期记忆网络(ConvLSTM)并融合通道注意力机制的模型。用CNN和通道注意力提取行人属性的显著性和相关性视觉特征;用ConvLSTM进一步提取视觉特征的空间信息和属性相关性;以优化序列对行人属性进行预测。在两个常用行人属性数据集PETA和RAP上进行大量实验,取得了最佳性能,证明了该方法的优越性和有效性。 展开更多
关键词 行人属性识别 卷积神经网络 卷积长短期记忆网络 注意力机制 多标签分类
在线阅读 下载PDF
Dynamic Hand Gesture Recognition Based on Short-Term Sampling Neural Networks 被引量:12
16
作者 Wenjin Zhang Jiacun Wang Fangping Lan 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第1期110-120,共11页
Hand gestures are a natural way for human-robot interaction.Vision based dynamic hand gesture recognition has become a hot research topic due to its various applications.This paper presents a novel deep learning netwo... Hand gestures are a natural way for human-robot interaction.Vision based dynamic hand gesture recognition has become a hot research topic due to its various applications.This paper presents a novel deep learning network for hand gesture recognition.The network integrates several well-proved modules together to learn both short-term and long-term features from video inputs and meanwhile avoid intensive computation.To learn short-term features,each video input is segmented into a fixed number of frame groups.A frame is randomly selected from each group and represented as an RGB image as well as an optical flow snapshot.These two entities are fused and fed into a convolutional neural network(Conv Net)for feature extraction.The Conv Nets for all groups share parameters.To learn longterm features,outputs from all Conv Nets are fed into a long short-term memory(LSTM)network,by which a final classification result is predicted.The new model has been tested with two popular hand gesture datasets,namely the Jester dataset and Nvidia dataset.Comparing with other models,our model produced very competitive results.The robustness of the new model has also been proved with an augmented dataset with enhanced diversity of hand gestures. 展开更多
关键词 convolutional neural network(ConvNet) hand gesture recognition long short-term memory(LSTM)network short-term sampling transfer learning
在线阅读 下载PDF
基于C3D和CBAM-ConvLSTM的犯罪事件视频场景分类 被引量:2
17
作者 李燕 何敏 《刑事技术》 2022年第5期448-457,共10页
随着平安城市项目的不断推进,我国大部分城市已经实现监控全覆盖,并且每天产生海量的监控视频,利用人工智能的方式实现监控视频的自动化处理是目前待解决的问题。针对上述问题,本文提出一种基于C3D和CBAM-ConvLSTM(convolutional block ... 随着平安城市项目的不断推进,我国大部分城市已经实现监控全覆盖,并且每天产生海量的监控视频,利用人工智能的方式实现监控视频的自动化处理是目前待解决的问题。针对上述问题,本文提出一种基于C3D和CBAM-ConvLSTM(convolutional block attention module-convolutional long short-term memory network)的视频场景分类算法,对监控中的犯罪事件进行有效分类。首先,使用C3D网络和注意力机制提取监控视频的局部空间特征和局部时间特征;然后,将提取的视频特征序列输入到CBAM-ConvLSTM中提取视频的全局空间特征及全局时间特征;最后,根据全局特征使用分类器对输入视频进行犯罪事件分类。实验在自建的犯罪事件数据集Crimes-mini和公开的暴力行为数据集Hockey两个数据集上进行验证,犯罪事件分类的准确率可达92.19%、F1值可达90.40%;暴力行为分类的准确率可达99.5%、F1值可达99.5%。测试结果表明,论文提出的方法能够较有效地对监控视频中的犯罪事件、暴力行为进行分类。 展开更多
关键词 视频分类 三维卷积神经网络 注意力机制 卷积长短期记忆网络
在线阅读 下载PDF
Deep Learning Network for Energy Storage Scheduling in Power Market Environment Short-Term Load Forecasting Model
18
作者 Yunlei Zhang RuifengCao +3 位作者 Danhuang Dong Sha Peng RuoyunDu Xiaomin Xu 《Energy Engineering》 EI 2022年第5期1829-1841,共13页
In the electricity market,fluctuations in real-time prices are unstable,and changes in short-term load are determined by many factors.By studying the timing of charging and discharging,as well as the economic benefits... In the electricity market,fluctuations in real-time prices are unstable,and changes in short-term load are determined by many factors.By studying the timing of charging and discharging,as well as the economic benefits of energy storage in the process of participating in the power market,this paper takes energy storage scheduling as merely one factor affecting short-term power load,which affects short-term load time series along with time-of-use price,holidays,and temperature.A deep learning network is used to predict the short-term load,a convolutional neural network(CNN)is used to extract the features,and a long short-term memory(LSTM)network is used to learn the temporal characteristics of the load value,which can effectively improve prediction accuracy.Taking the load data of a certain region as an example,the CNN-LSTM prediction model is compared with the single LSTM prediction model.The experimental results show that the CNN-LSTM deep learning network with the participation of energy storage in dispatching can have high prediction accuracy for short-term power load forecasting. 展开更多
关键词 Energy storage scheduling short-term load forecasting deep learning network convolutional neural network CNN long and short term memory network LTSM
在线阅读 下载PDF
Hybrid Model for Short-Term Passenger Flow Prediction in Rail Transit
19
作者 Yinghua Song Hairong Lyu Wei Zhang 《Journal on Big Data》 2023年第1期19-40,共22页
A precise and timely forecast of short-term rail transit passenger flow provides data support for traffic management and operation,assisting rail operators in efficiently allocating resources and timely relieving pres... A precise and timely forecast of short-term rail transit passenger flow provides data support for traffic management and operation,assisting rail operators in efficiently allocating resources and timely relieving pressure on passenger safety and operation.First,the passenger flow sequence models in the study are broken down using VMD for noise reduction.The objective environment features are then added to the characteristic factors that affect the passenger flow.The target station serves as an additional spatial feature and is mined concurrently using the KNN algorithm.It is shown that the hybrid model VMD-CLSMT has a higher prediction accuracy,by setting BP,CNN,and LSTM reference experiments.All models’second order prediction effects are superior to their first order effects,showing that the residual network can significantly raise model prediction accuracy.Additionally,it confirms the efficacy of supplementary and objective environmental features. 展开更多
关键词 short-term passenger flow forecast variational mode decomposition long and short-term memory convolutional neural network residual network
在线阅读 下载PDF
融合时空特征的城市多站点PM2.5浓度预测 被引量:1
20
作者 黄琨 吴学群 +1 位作者 成飞飞 韩啸 《传感器与微系统》 CSCD 北大核心 2024年第5期149-152,157,共5页
本文提出一种融合时空特征的城市多站点PM2.5预测方法,该方法可以捕捉PM2.5在时间和空间上的相关性,通过将区域多个站点的PM2.5数据转换为一系列静态图像,将其输入到卷积长短期记忆(ConvLSTM)模型中,采用端对端的方式进行训练,预测城市... 本文提出一种融合时空特征的城市多站点PM2.5预测方法,该方法可以捕捉PM2.5在时间和空间上的相关性,通过将区域多个站点的PM2.5数据转换为一系列静态图像,将其输入到卷积长短期记忆(ConvLSTM)模型中,采用端对端的方式进行训练,预测城市未来多个站点多个时段的PM2.5浓度。以北京多个站点的PM2.5数据进行实验验证。结果表明:考虑了时空特征的ConvLSTM方法在短期预测方面优于其他4种时序方法,该方法可为PM2.5预测提供新的思路。 展开更多
关键词 时空特征 卷积长短期记忆 多站点 PM2.5浓度预测
在线阅读 下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部