The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to u...The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.展开更多
Pneumonia is a serious disease that can be fatal,particularly among children and the elderly.The accuracy of pneumonia diagnosis can be improved by combining artificial-intelligence technology with X-ray imaging.This ...Pneumonia is a serious disease that can be fatal,particularly among children and the elderly.The accuracy of pneumonia diagnosis can be improved by combining artificial-intelligence technology with X-ray imaging.This study proposes X-ODFCANet,which addresses the issues of low accuracy and excessive parameters in existing deep-learningbased pneumonia-classification methods.This network incorporates a feature coordination attention module and an omni-dimensional dynamic convolution(ODConv)module,leveraging the residual module for feature extraction from X-ray images.The feature coordination attention module utilizes two one-dimensional feature encoding processes to aggregate feature information from different spatial directions.Additionally,the ODConv module extracts and fuses feature information in four dimensions:the spatial dimension of the convolution kernel,input and output channel quantities,and convolution kernel quantity.The experimental results demonstrate that the proposed method can effectively improve the accuracy of pneumonia classification,which is 3.77%higher than that of ResNet18.The model parameters are 4.45M,which was reduced by approximately 2.5 times.The code is available at https://github.com/limuni/X ODFCA NET.展开更多
BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone i...BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone in the exact classification of FLLs due to its noninvasive nature,high scanning speed,and high-density resolution.Since their recent development,convolutional neural network-based deep learning techniques has been recognized to have high potential for image recognition tasks.AIM To develop and evaluate an automated multiphase convolutional dense network(MP-CDN)to classify FLLs on multiphase CT.METHODS A total of 517 FLLs scanned on a 320-detector CT scanner using a four-phase DCECT imaging protocol(including precontrast phase,arterial phase,portal venous phase,and delayed phase)from 2012 to 2017 were retrospectively enrolled.FLLs were classified into four categories:Category A,hepatocellular carcinoma(HCC);category B,liver metastases;category C,benign non-inflammatory FLLs including hemangiomas,focal nodular hyperplasias and adenomas;and category D,hepatic abscesses.Each category was split into a training set and test set in an approximate 8:2 ratio.An MP-CDN classifier with a sequential input of the fourphase CT images was developed to automatically classify FLLs.The classification performance of the model was evaluated on the test set;the accuracy and specificity were calculated from the confusion matrix,and the area under the receiver operating characteristic curve(AUC)was calculated from the SoftMax probability outputted from the last layer of the MP-CDN.RESULTS A total of 410 FLLs were used for training and 107 FLLs were used for testing.The mean classification accuracy of the test set was 81.3%(87/107).The accuracy/specificity of distinguishing each category from the others were 0.916/0.964,0.925/0.905,0.860/0.918,and 0.925/0.963 for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.The AUC(95%confidence interval)for differentiating each category from the others was 0.92(0.837-0.992),0.99(0.967-1.00),0.88(0.795-0.955)and 0.96(0.914-0.996)for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.CONCLUSION MP-CDN accurately classified FLLs detected on four-phase CT as HCC,metastases,benign non-inflammatory FLLs and hepatic abscesses and may assist radiologists in identifying the different types of FLLs.展开更多
Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,...Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,sentiment analysis and question-answering systems.However,previous studies ignored much unusedstructural information in sentences that could enhance the performance of the relation extraction task.Moreover,most existing dependency-based models utilize self-attention to distinguish the importance of context,whichhardly deals withmultiple-structure information.To efficiently leverage multiple structure information,this paperproposes a dynamic structure attention mechanism model based on textual structure information,which deeplyintegrates word embedding,named entity recognition labels,part of speech,dependency tree and dependency typeinto a graph convolutional network.Specifically,our model extracts text features of different structures from theinput sentence.Textual Structure information Graph Convolutional Networks employs the dynamic structureattention mechanism to learn multi-structure attention,effectively distinguishing important contextual features invarious structural information.In addition,multi-structure weights are carefully designed as amergingmechanismin the different structure attention to dynamically adjust the final attention.This paper combines these featuresand trains a graph convolutional network for relation extraction.We experiment on supervised relation extractiondatasets including SemEval 2010 Task 8,TACRED,TACREV,and Re-TACED,the result significantly outperformsthe previous.展开更多
Classic Graph Convolutional Networks (GCNs) often learn node representation holistically, which ignores the distinct impacts from different neighbors when aggregating their features to update a node’s representation....Classic Graph Convolutional Networks (GCNs) often learn node representation holistically, which ignores the distinct impacts from different neighbors when aggregating their features to update a node’s representation. Disentangled GCNs have been proposed to divide each node’s representation into several feature units. However, current disentangling methods do not try to figure out how many inherent factors the model should assign to help extract the best representation of each node. This paper then proposes D^(2)-GCN to provide dynamic disentanglement in GCNs and present the most appropriate factorization of each node’s mixed features. The convergence of the proposed method is proved both theoretically and experimentally. Experiments on real-world datasets show that D^(2)-GCN outperforms the baseline models concerning node classification results in both single- and multi-label tasks.展开更多
Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly...Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly, the discrete singular convolution (DSC) is used for the first time to analyze the impact dynamics. Secondly, the efficiency of various numerical methods for dynamic analysis is explored via an example of a flexible rod hit by a rigid ball. Three numerical methods, including the conventional finite element (FE) method, the DSC algorithm, and the spectral finite element (SFE) method, and one proposed modeling strategy, the improved spectral finite element (ISFE) method, are involved. Numerical results are compared with the known analytical solutions to show their efficiency. It is demonstrated that the proposed ISFE modeling strategy with a proper length of con- ventional FE yields the most accurate contact stress among the four investigated models. It is also found that the DSC algorithm is an alternative method for collision problems.展开更多
Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was arg...Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was argued that: (1) under reasonable assumptions (approximating the frequency response function by the discrete Fourier transform of the discretized unitary impulse response function), the matrix formulation by FD is equivalent to a circular convolution; (2) to avoid the wraparound interference, the excitation vector and impulse response must be padded with enough zeros; (3) provided that the zero padding requirement satisfied, the convergence and accuracy of direct time domain analysis, which is equivalent to that by FD, are guaranteed by the numerical integration scheme; (4) the imaginary part of the computational response approaching zero is due to the continuity of the impulse response functions.展开更多
Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been con...Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been conducted on two aspects.The first aspect is to encode speech frame by frame.The second aspect is to encode the whole speech to a vector.But the scale in these aspects is fixed.So,encoding speech with an adjustable scale for more latent information is worthy of investigation.But current alignment approaches only support frame-by-frame encoding and speech-to-vector encoding.It remains a challenge to propose a new alignment approach to support adjustable scale speech encoding.This paper presents the dynamic speech encoder with a new alignment approach in conjunction with frame-by-frame encoding and speech-to-vector encoding.The speech feature fromourmodel achieves three functions.First,the speech feature can reconstruct the origin speech while the length of the speech feature is equal to the text length.Second,our model can get text embedding fromspeech,and the encoded speech feature is similar to the text embedding result.Finally,it can transfer the style of synthesis speech and make it more similar to the given reference speech.展开更多
Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the sam...Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the same scene(assuming a stationary scene),we introduce a learning-based model for single-image HDR reconstruction.An input LDR image is sequentially segmented into the local region maps based on the cumulative histogram of the input brightness distribution.Using the local region maps,SParam-Net estimates the parameters of an inverse tone mapping function to generate a pseudo-HDR image.We process the segmented region maps as the input sequences on long short-term memory.Finally,a fast super-resolution convolutional neural network is used for HDR image reconstruction.The proposed method was trained and tested on datasets including HDR-Real,LDR-HDR-pair,and HDR-Eye.The experimental results revealed that HDR images can be generated more reliably than using contemporary end-to-end approaches.展开更多
Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, ne...Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, networkconnectivity is facilitated between smart devices from anyplace and anytime.IoT-based health monitoring systems are gaining popularity and acceptance forcontinuous monitoring and detect health abnormalities from the data collected.Electrocardiographic (ECG) signals are widely used for heart diseases detection.A novel method has been proposed in this work for ECG monitoring using IoTtechniques. In this work, a two-stage approach is employed. In the first stage, arouting protocol based on Dynamic Source Routing (DSR) and Routing byEnergy and Link quality (REL) for IoT healthcare platform is proposed for effi-cient data collection, and in the second stage, classification of ECG for Arrhythmia. Furthermore, this work has evaluated Support Vector Machine (SVM),Artificial Neural Network (ANN), and Convolution Neural Networks (CNNs)-based approach for ECG signals classification. Deep-ECG will use a deep CNNto extract critical features and then compare through evaluation of simple and fastdistance functions in order to obtain an efficient classification of heart abnormalities. For the identification of abnormal data, this work has proposed techniquesfor the classification of ECG data, which has been obtained from mobile watchusers. For experimental verification of the proposed methods, the Beth Israel Hospital (MIT/BIH) Arrhythmia and Massachusetts Institute of Technology (MIT)Database was used for evaluation. Results confirm the presented method’s superior performance with regards to the accuracy of classification. The CNN achievedan accuracy of 91.92% and has a higher accuracy of 4.98% for the SVM and2.68% for the ANN.展开更多
Accurately predicting the chiller coefficient of performance(COP)is essential for improving the energy efficiency of heating,ventilation,and air conditioning(HVAC)systems,significantly contributing to energy conservat...Accurately predicting the chiller coefficient of performance(COP)is essential for improving the energy efficiency of heating,ventilation,and air conditioning(HVAC)systems,significantly contributing to energy conservation in buildings.Traditional performance prediction methods often overlook the dynamic interaction among sensor variables and face challenges in using extensive historical data efficiently,which impedes accurate predictions.To overcome these challenges,this paper proposes an innovative on-site chiller performance prediction method employing a dynamic graph convolutional network(GCN)enhanced by association rules.The distinctive feature of this method is constructing an association graph bank containing static graphs in each operating mode by mining the association rules between various sensor variables in historical operating data.A real-time graph is created by analyzing the correlation between various sensor variables in the current operating data.This graph is fused online with the static graph in the current operating mode to obtain a dynamic graph used for feature extraction and training of GCN.The effectiveness of this method has been empirically confirmed through the operational data of an actual building chiller system.Comparative analysis with state-of-the-art methods highlights the superior performance of the proposed method.展开更多
车辆检测是智能交通系统和自动驾驶的重要组成部分。然而,实际交通场景中存在许多不确定因素,导致车辆检测模型的准确率低实时性差。为了解决这个问题,提出了一种快速准确的车辆检测算法——YOLOv8-DEL。使用DGCST(dynamic group convol...车辆检测是智能交通系统和自动驾驶的重要组成部分。然而,实际交通场景中存在许多不确定因素,导致车辆检测模型的准确率低实时性差。为了解决这个问题,提出了一种快速准确的车辆检测算法——YOLOv8-DEL。使用DGCST(dynamic group convolution shuffle transformer)模块代替C2f模块来重构主干网络,以增强特征提取能力并使网络更轻量;添加的P2检测层能使模型更敏锐地定位和检测小目标,同时采用Efficient RepGFPN进行多尺度特征融合,以丰富特征信息并提高模型的特征表达能力;通过结合GroupNorm和共享卷积的优点,设计了一种轻量型共享卷积检测头,在保持精度的前提下,有效减少参数量并提升检测速度。与YOLOv8相比,提出的YOLOv8-DEL在BDD100K数据集和KITTI数据集上,mAP@0.5分别提高了4.8个百分点和1.2个百分点,具有实时检测速度(208.6 FPS和216.4 FPS),在检测精度和速度方面实现了更有利的折中。展开更多
文摘The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.
基金supported in part by the Key Research and Development Program of Shaanxi Province of China,No.2024GX-YBXM-149in part by the National Natural Science Foundation of China,No.62071381.
文摘Pneumonia is a serious disease that can be fatal,particularly among children and the elderly.The accuracy of pneumonia diagnosis can be improved by combining artificial-intelligence technology with X-ray imaging.This study proposes X-ODFCANet,which addresses the issues of low accuracy and excessive parameters in existing deep-learningbased pneumonia-classification methods.This network incorporates a feature coordination attention module and an omni-dimensional dynamic convolution(ODConv)module,leveraging the residual module for feature extraction from X-ray images.The feature coordination attention module utilizes two one-dimensional feature encoding processes to aggregate feature information from different spatial directions.Additionally,the ODConv module extracts and fuses feature information in four dimensions:the spatial dimension of the convolution kernel,input and output channel quantities,and convolution kernel quantity.The experimental results demonstrate that the proposed method can effectively improve the accuracy of pneumonia classification,which is 3.77%higher than that of ResNet18.The model parameters are 4.45M,which was reduced by approximately 2.5 times.The code is available at https://github.com/limuni/X ODFCA NET.
基金Supported by National Natural Science Foundation of China,No.91959118Science and Technology Program of Guangzhou,China,No.201704020016+1 种基金SKY Radiology Department International Medical Research Foundation of China,No.Z-2014-07-1912-15Clinical Research Foundation of the 3rd Affiliated Hospital of Sun Yat-Sen University,No.YHJH201901.
文摘BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone in the exact classification of FLLs due to its noninvasive nature,high scanning speed,and high-density resolution.Since their recent development,convolutional neural network-based deep learning techniques has been recognized to have high potential for image recognition tasks.AIM To develop and evaluate an automated multiphase convolutional dense network(MP-CDN)to classify FLLs on multiphase CT.METHODS A total of 517 FLLs scanned on a 320-detector CT scanner using a four-phase DCECT imaging protocol(including precontrast phase,arterial phase,portal venous phase,and delayed phase)from 2012 to 2017 were retrospectively enrolled.FLLs were classified into four categories:Category A,hepatocellular carcinoma(HCC);category B,liver metastases;category C,benign non-inflammatory FLLs including hemangiomas,focal nodular hyperplasias and adenomas;and category D,hepatic abscesses.Each category was split into a training set and test set in an approximate 8:2 ratio.An MP-CDN classifier with a sequential input of the fourphase CT images was developed to automatically classify FLLs.The classification performance of the model was evaluated on the test set;the accuracy and specificity were calculated from the confusion matrix,and the area under the receiver operating characteristic curve(AUC)was calculated from the SoftMax probability outputted from the last layer of the MP-CDN.RESULTS A total of 410 FLLs were used for training and 107 FLLs were used for testing.The mean classification accuracy of the test set was 81.3%(87/107).The accuracy/specificity of distinguishing each category from the others were 0.916/0.964,0.925/0.905,0.860/0.918,and 0.925/0.963 for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.The AUC(95%confidence interval)for differentiating each category from the others was 0.92(0.837-0.992),0.99(0.967-1.00),0.88(0.795-0.955)and 0.96(0.914-0.996)for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.CONCLUSION MP-CDN accurately classified FLLs detected on four-phase CT as HCC,metastases,benign non-inflammatory FLLs and hepatic abscesses and may assist radiologists in identifying the different types of FLLs.
文摘Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,sentiment analysis and question-answering systems.However,previous studies ignored much unusedstructural information in sentences that could enhance the performance of the relation extraction task.Moreover,most existing dependency-based models utilize self-attention to distinguish the importance of context,whichhardly deals withmultiple-structure information.To efficiently leverage multiple structure information,this paperproposes a dynamic structure attention mechanism model based on textual structure information,which deeplyintegrates word embedding,named entity recognition labels,part of speech,dependency tree and dependency typeinto a graph convolutional network.Specifically,our model extracts text features of different structures from theinput sentence.Textual Structure information Graph Convolutional Networks employs the dynamic structureattention mechanism to learn multi-structure attention,effectively distinguishing important contextual features invarious structural information.In addition,multi-structure weights are carefully designed as amergingmechanismin the different structure attention to dynamically adjust the final attention.This paper combines these featuresand trains a graph convolutional network for relation extraction.We experiment on supervised relation extractiondatasets including SemEval 2010 Task 8,TACRED,TACREV,and Re-TACED,the result significantly outperformsthe previous.
基金supported by the National Natural Science Foundation of China(Grant Nos.62141214 and 62272171).
文摘Classic Graph Convolutional Networks (GCNs) often learn node representation holistically, which ignores the distinct impacts from different neighbors when aggregating their features to update a node’s representation. Disentangled GCNs have been proposed to divide each node’s representation into several feature units. However, current disentangling methods do not try to figure out how many inherent factors the model should assign to help extract the best representation of each node. This paper then proposes D^(2)-GCN to provide dynamic disentanglement in GCNs and present the most appropriate factorization of each node’s mixed features. The convergence of the proposed method is proved both theoretically and experimentally. Experiments on real-world datasets show that D^(2)-GCN outperforms the baseline models concerning node classification results in both single- and multi-label tasks.
基金Supported by the National Natural Science Foundation of China(50830201)the Priority Academic Program Development of Jiangsu Higher Education Institutions~~
文摘Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly, the discrete singular convolution (DSC) is used for the first time to analyze the impact dynamics. Secondly, the efficiency of various numerical methods for dynamic analysis is explored via an example of a flexible rod hit by a rigid ball. Three numerical methods, including the conventional finite element (FE) method, the DSC algorithm, and the spectral finite element (SFE) method, and one proposed modeling strategy, the improved spectral finite element (ISFE) method, are involved. Numerical results are compared with the known analytical solutions to show their efficiency. It is demonstrated that the proposed ISFE modeling strategy with a proper length of con- ventional FE yields the most accurate contact stress among the four investigated models. It is also found that the DSC algorithm is an alternative method for collision problems.
文摘Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was argued that: (1) under reasonable assumptions (approximating the frequency response function by the discrete Fourier transform of the discretized unitary impulse response function), the matrix formulation by FD is equivalent to a circular convolution; (2) to avoid the wraparound interference, the excitation vector and impulse response must be padded with enough zeros; (3) provided that the zero padding requirement satisfied, the convergence and accuracy of direct time domain analysis, which is equivalent to that by FD, are guaranteed by the numerical integration scheme; (4) the imaginary part of the computational response approaching zero is due to the continuity of the impulse response functions.
基金supported by National Key R&D Program of China (2020AAA0107901).
文摘Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been conducted on two aspects.The first aspect is to encode speech frame by frame.The second aspect is to encode the whole speech to a vector.But the scale in these aspects is fixed.So,encoding speech with an adjustable scale for more latent information is worthy of investigation.But current alignment approaches only support frame-by-frame encoding and speech-to-vector encoding.It remains a challenge to propose a new alignment approach to support adjustable scale speech encoding.This paper presents the dynamic speech encoder with a new alignment approach in conjunction with frame-by-frame encoding and speech-to-vector encoding.The speech feature fromourmodel achieves three functions.First,the speech feature can reconstruct the origin speech while the length of the speech feature is equal to the text length.Second,our model can get text embedding fromspeech,and the encoded speech feature is similar to the text embedding result.Finally,it can transfer the style of synthesis speech and make it more similar to the given reference speech.
基金This study was supported by the Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(NRF-2018R1D1A1B07049932).
文摘Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the same scene(assuming a stationary scene),we introduce a learning-based model for single-image HDR reconstruction.An input LDR image is sequentially segmented into the local region maps based on the cumulative histogram of the input brightness distribution.Using the local region maps,SParam-Net estimates the parameters of an inverse tone mapping function to generate a pseudo-HDR image.We process the segmented region maps as the input sequences on long short-term memory.Finally,a fast super-resolution convolutional neural network is used for HDR image reconstruction.The proposed method was trained and tested on datasets including HDR-Real,LDR-HDR-pair,and HDR-Eye.The experimental results revealed that HDR images can be generated more reliably than using contemporary end-to-end approaches.
文摘Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, networkconnectivity is facilitated between smart devices from anyplace and anytime.IoT-based health monitoring systems are gaining popularity and acceptance forcontinuous monitoring and detect health abnormalities from the data collected.Electrocardiographic (ECG) signals are widely used for heart diseases detection.A novel method has been proposed in this work for ECG monitoring using IoTtechniques. In this work, a two-stage approach is employed. In the first stage, arouting protocol based on Dynamic Source Routing (DSR) and Routing byEnergy and Link quality (REL) for IoT healthcare platform is proposed for effi-cient data collection, and in the second stage, classification of ECG for Arrhythmia. Furthermore, this work has evaluated Support Vector Machine (SVM),Artificial Neural Network (ANN), and Convolution Neural Networks (CNNs)-based approach for ECG signals classification. Deep-ECG will use a deep CNNto extract critical features and then compare through evaluation of simple and fastdistance functions in order to obtain an efficient classification of heart abnormalities. For the identification of abnormal data, this work has proposed techniquesfor the classification of ECG data, which has been obtained from mobile watchusers. For experimental verification of the proposed methods, the Beth Israel Hospital (MIT/BIH) Arrhythmia and Massachusetts Institute of Technology (MIT)Database was used for evaluation. Results confirm the presented method’s superior performance with regards to the accuracy of classification. The CNN achievedan accuracy of 91.92% and has a higher accuracy of 4.98% for the SVM and2.68% for the ANN.
基金supported in part by the Science and Technology Innovation Program of Hunan Province(No.2022RC1090)in part by the National Natural Science Foundation of China(No.62173349)+2 种基金in part by the Natural Science Foundation of Hunan Province(No.2022J20076)in part by the Innovation Driven Projection of Central South University(No.2023CXQD073)in part by the Major Program of Xiangjiang Laboratory(No.22XJ01005).
文摘Accurately predicting the chiller coefficient of performance(COP)is essential for improving the energy efficiency of heating,ventilation,and air conditioning(HVAC)systems,significantly contributing to energy conservation in buildings.Traditional performance prediction methods often overlook the dynamic interaction among sensor variables and face challenges in using extensive historical data efficiently,which impedes accurate predictions.To overcome these challenges,this paper proposes an innovative on-site chiller performance prediction method employing a dynamic graph convolutional network(GCN)enhanced by association rules.The distinctive feature of this method is constructing an association graph bank containing static graphs in each operating mode by mining the association rules between various sensor variables in historical operating data.A real-time graph is created by analyzing the correlation between various sensor variables in the current operating data.This graph is fused online with the static graph in the current operating mode to obtain a dynamic graph used for feature extraction and training of GCN.The effectiveness of this method has been empirically confirmed through the operational data of an actual building chiller system.Comparative analysis with state-of-the-art methods highlights the superior performance of the proposed method.