In the article“A Lightweight Approach for Skin Lesion Detection through Optimal Features Fusion”by Khadija Manzoor,Fiaz Majeed,Ansar Siddique,Talha Meraj,Hafiz Tayyab Rauf,Mohammed A.El-Meligy,Mohamed Sharaf,Abd Ela...In the article“A Lightweight Approach for Skin Lesion Detection through Optimal Features Fusion”by Khadija Manzoor,Fiaz Majeed,Ansar Siddique,Talha Meraj,Hafiz Tayyab Rauf,Mohammed A.El-Meligy,Mohamed Sharaf,Abd Elatty E.Abd Elgawad Computers,Materials&Continua,2022,Vol.70,No.1,pp.1617–1630.DOI:10.32604/cmc.2022.018621,URL:https://www.techscience.com/cmc/v70n1/44361,there was an error regarding the affiliation for the author Hafiz Tayyab Rauf.Instead of“Centre for Smart Systems,AI and Cybersecurity,Staffordshire University,Stoke-on-Trent,UK”,the affiliation should be“Independent Researcher,Bradford,BD80HS,UK”.展开更多
Solar flare prediction is an important subject in the field of space weather.Deep learning technology has greatly promoted the development of this subject.In this study,we propose a novel solar flare forecasting model...Solar flare prediction is an important subject in the field of space weather.Deep learning technology has greatly promoted the development of this subject.In this study,we propose a novel solar flare forecasting model integrating Deep Residual Network(ResNet)and Support Vector Machine(SVM)for both≥C-class(C,M,and X classes)and≥M-class(M and X classes)flares.We collected samples of magnetograms from May 1,2010 to September 13,2018 from Space-weather Helioseismic and Magnetic Imager(HMI)Active Region Patches and then used a cross-validation method to obtain seven independent data sets.We then utilized five metrics to evaluate our fusion model,based on intermediate-output extracted by ResNet and SVM using the Gaussian kernel function.Our results show that the primary metric true skill statistics(TSS)achieves a value of 0.708±0.027 for≥C-class prediction,and of 0.758±0.042 for≥M-class prediction;these values indicate that our approach performs significantly better than those of previous studies.The metrics of our fusion model’s performance on the seven datasets indicate that the model is quite stable and robust,suggesting that fusion models that integrate an excellent baseline network with SVM can achieve improved performance in solar flare prediction.Besides,we also discuss the performance impact of architectural innovation in our fusion model.展开更多
Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it cha...Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it challenging to collect defective samples.Additionally,the complex surface background of polysilicon cell wafers complicates the accurate identification and localization of defective regions.This paper proposes a novel Lightweight Multiscale Feature Fusion network(LMFF)to address these challenges.The network comprises a feature extraction network,a multi-scale feature fusion module(MFF),and a segmentation network.Specifically,a feature extraction network is proposed to obtain multi-scale feature outputs,and a multi-scale feature fusion module(MFF)is used to fuse multi-scale feature information effectively.In order to capture finer-grained multi-scale information from the fusion features,we propose a multi-scale attention module(MSA)in the segmentation network to enhance the network’s ability for small target detection.Moreover,depthwise separable convolutions are introduced to construct depthwise separable residual blocks(DSR)to reduce the model’s parameter number.Finally,to validate the proposed method’s defect segmentation and localization performance,we constructed three solar cell defect detection datasets:SolarCells,SolarCells-S,and PVEL-S.SolarCells and SolarCells-S are monocrystalline silicon datasets,and PVEL-S is a polycrystalline silicon dataset.Experimental results show that the IOU of our method on these three datasets can reach 68.5%,51.0%,and 92.7%,respectively,and the F1-Score can reach 81.3%,67.5%,and 96.2%,respectively,which surpasses other commonly usedmethods and verifies the effectiveness of our LMFF network.展开更多
BACKGROUND Ependymoma with lipomatous differentiation is a rare type of ependymoma.The ZFTA fusion-positive supratentorial ependymoma is a novel tumor type in the 2021 World Health Organization classification of centr...BACKGROUND Ependymoma with lipomatous differentiation is a rare type of ependymoma.The ZFTA fusion-positive supratentorial ependymoma is a novel tumor type in the 2021 World Health Organization classification of central nervous system tumors.ZFTA fusion-positive lipomatous ependymoma has not been reported to date.CASE SUMMARY We reported a case of a 15-year-old Chinese male who had a sudden convulsion lasting approximately six minutes.Magnetic resonance imaging showed a round cystic shadow of approximately 1.9 cm×1.5 cm×1.9 cm under the right parieto-occipital cortex.Microscopic examination showed characteristic perivascular pseudorosettes and adipose differentiation in the cytoplasm.Immunohisto-chemical staining showed that the tumor cells were negative for cytokeratin,NeuN,Syn and p53,but positive for GFAP,vimentin and S-100 protein.Signi-ficant punctate intracytoplasmic EMA immunoreactivity was observed.The level of Ki-67 was about 5%.Genetic analysis revealed ZFTA:RELA fusion.A cranio-tomy with total excision of the tumor was performed.The follow-up time was 36 months,no evidence of disease recurrence was found in magnetic resonance imaging.CONCLUSION Based on these findings,the patient was diagnosed as a ependymoma with ZFTA fusion and lipomatous differentiation.This case report provides information on the microscopic morphological features of ependymoma with ZFTA fusion and lipomatous differentiation,which can help pathologists to make a definitive diagnosis of this tumor.展开更多
BACKGROUND The classification of uterine sarcomas is based on distinctive morphological and immunophenotypic characteristics,increasingly supported by molecular genetic diagnostics.Data on neurotrophic tyrosine recept...BACKGROUND The classification of uterine sarcomas is based on distinctive morphological and immunophenotypic characteristics,increasingly supported by molecular genetic diagnostics.Data on neurotrophic tyrosine receptor kinase(NTRK)gene fusionpositive uterine sarcoma,potentially aggressive and morphologically similar to fibrosarcoma,are limited due to its recent recognition.Pan-TRK immunohistochemistry(IHC)analysis serves as an effective screening tool with high sensitivity and specificity for NTRK-fusion malignancies.CASE SUMMARY We report a case of a malignant mesenchymal tumor originating from the uterine cervix,which was pan-TRK IHC-positive but lacked NTRK gene fusions,accompanied by a brief literature review.A 55-year-old woman presented to the emergency department with abdominal pain and distension,exhibiting significant ascites and multiple solid pelvic masses.Pelvic examination revealed a tumor encompassing the uterine cervix,extending to the vagina and uterine corpus.A punch biopsy of the cervix indicated NTRK sarcoma with positive immunochemical pan-TRK stain.However,subsequent next generation sequencing revealed no NTRK gene fusion,leading to a diagnosis of poorly differentiated,advanced-stage sarcoma.CONCLUSION The clinical significance of NTRK gene fusion lies in potential treatment with TRK inhibitors for positive sarcomas.Identifying such rare tumors is crucial due to the potential applicability of tropomyosin receptor kinase inhibitor treatment.展开更多
Successful polyethylene glycol fusion(PEG-fusion)of severed axons following peripheral nerve injuries for PEG-fused axons has been reported to:(1)rapidly restore electrophysiological continuity;(2)prevent distal Walle...Successful polyethylene glycol fusion(PEG-fusion)of severed axons following peripheral nerve injuries for PEG-fused axons has been reported to:(1)rapidly restore electrophysiological continuity;(2)prevent distal Wallerian Degeneration and maintain their myelin sheaths;(3)promote primarily motor,voluntary behavioral recoveries as assessed by the Sciatic Functional Index;and,(4)rapidly produce correct and incorrect connections in many possible combinations that produce rapid and extensive recovery of functional peripheral nervous system/central nervous system connections and reflex(e.g.,toe twitch)or voluntary behaviors.The preceding companion paper describes sensory terminal field reo rganization following PEG-fusion repair of sciatic nerve transections or ablations;howeve r,sensory behavioral recovery has not been explicitly explored following PEG-fusion repair.In the current study,we confirmed the success of PEG-fusion surgeries according to criteria(1-3)above and more extensively investigated whether PEG-fusion enhanced mechanical nociceptive recovery following sciatic transection in male and female outbred Sprague-Dawley and inbred Lewis rats.Mechanical nociceptive responses were assessed by measuring withdrawal thresholds using von Frey filaments on the dorsal and midplantar regions of the hindpaws.Dorsal von Frey filament tests were a more reliable method than plantar von Frey filament tests to assess mechanical nociceptive sensitivity following sciatic nerve transections.Baseline withdrawal thresholds of the sciatic-mediated lateral dorsal region differed significantly across strain but not sex.Withdrawal thresholds did not change significantly from baseline in chronic Unoperated and Sham-operated rats.Following sciatic transection,all rats exhibited severe hyposensitivity to stimuli at the lateral dorsal region of the hindpaw ipsilateral to the injury.However,PEG-fused rats exhibited significantly earlier return to baseline withdrawal thresholds than Negative Control rats.Furthermore,PEG-fused rats with significantly improved Sciatic Functional Index scores at or after 4 weeks postoperatively exhibited yet-earlier von Frey filament recove ry compared with those without Sciatic Functional Index recovery,suggesting a correlation between successful PEG-fusion and both motor-dominant and sensory-dominant behavioral recoveries.This correlation was independent of the sex or strain of the rat.Furthermore,our data showed that the acceleration of von Frey filament sensory recovery to baseline was solely due to the PEG-fused sciatic nerve and not saphenous nerve collateral outgrowths.No chronic hypersensitivity developed in any rat up to 12 weeks.All these data suggest that PEG-fusion repair of transection peripheral nerve injuries co uld have important clinical benefits.展开更多
Video classification is an important task in video understanding and plays a pivotal role in intelligent monitoring of information content.Most existing methods do not consider the multimodal nature of the video,and t...Video classification is an important task in video understanding and plays a pivotal role in intelligent monitoring of information content.Most existing methods do not consider the multimodal nature of the video,and the modality fusion approach tends to be too simple,often neglecting modality alignment before fusion.This research introduces a novel dual stream multimodal alignment and fusion network named DMAFNet for classifying short videos.The network uses two unimodal encoder modules to extract features within modalities and exploits a multimodal encoder module to learn interaction between modalities.To solve the modality alignment problem,contrastive learning is introduced between two unimodal encoder modules.Additionally,masked language modeling(MLM)and video text matching(VTM)auxiliary tasks are introduced to improve the interaction between video frames and text modalities through backpropagation of loss functions.Diverse experiments prove the efficiency of DMAFNet in multimodal video classification tasks.Compared with other two mainstream baselines,DMAFNet achieves the best results on the 2022 WeChat Big Data Challenge dataset.展开更多
Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feat...Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.展开更多
In this editorial,the authors of this paper comment on the article by Bokov et al published in the recent issue of World Journal of Orthopedics.We reviewed a general overview of oblique lumbar interbody fusions(OLIF)a...In this editorial,the authors of this paper comment on the article by Bokov et al published in the recent issue of World Journal of Orthopedics.We reviewed a general overview of oblique lumbar interbody fusions(OLIF)and lateral lumbar interbody fusions(LLIF),their indications and complications as an increasingly popular minimally invasive technique to address several lumbar pathologies.This editorial thoroughly discusses and reviews the literature regarding factors affecting outcomes of indirect decompression utilized through OLIF and LLIF procedures.Several parameters play a critical role in patient outcomes including restoration of disc height,foraminal height,central canal squared,and foraminal area.The indirect decompression allows for unbuckling of the ligamentum flavum which can significantly decompress the neural elements as well as aid in reduction of spondylolisthesis.However,the authors further highlight the limitations of indirect decompression and factors that may predict unsuccessful outcomes including bony foraminal stenosis,severe central canal stenosis,and osteoporosis.As a result,failure of indirect decompression can lead to persistent pain,radiculopathy and unsatisfied patients.Spinal surgeons may be left to reimage patients and consider additional procedures with direct decompression.展开更多
With the rise of encrypted traffic,traditional network analysis methods have become less effective,leading to a shift towards deep learning-based approaches.Among these,multimodal learning-based classification methods...With the rise of encrypted traffic,traditional network analysis methods have become less effective,leading to a shift towards deep learning-based approaches.Among these,multimodal learning-based classification methods have gained attention due to their ability to leverage diverse feature sets from encrypted traffic,improving classification accuracy.However,existing research predominantly relies on late fusion techniques,which hinder the full utilization of deep features within the data.To address this limitation,we propose a novel multimodal encrypted traffic classification model that synchronizes modality fusion with multiscale feature extraction.Specifically,our approach performs real-time fusion of modalities at each stage of feature extraction,enhancing feature representation at each level and preserving inter-level correlations for more effective learning.This continuous fusion strategy improves the model’s ability to detect subtle variations in encrypted traffic,while boosting its robustness and adaptability to evolving network conditions.Experimental results on two real-world encrypted traffic datasets demonstrate that our method achieves a classification accuracy of 98.23% and 97.63%,outperforming existing multimodal learning-based methods.展开更多
Based on the Skyrme energy density functional and reaction Q-value,this study proposed an effective nucleus-nucleus poten-tial for describing the capture barrier in heavy-ion fusion processes.The 443 extracted barrier...Based on the Skyrme energy density functional and reaction Q-value,this study proposed an effective nucleus-nucleus poten-tial for describing the capture barrier in heavy-ion fusion processes.The 443 extracted barrier heights were well reproduced with a root-mean-square(RMS)error of 1.53 MeV,and the RMS deviations with respect to 144 time-dependent Hartree-Fock capture barrier heights were only 1.05 MeV.Coupled with the Siwek-Wilczyński formula,wherein three parameters were determined by the proposed effective potentials,the measured capture cross sections at energies around the barriers were reasonably well reproduced for several fusion reactions induced by nearly spherical nuclei as well as by nuclei with large deformations,such as^(154)Sm and^(238)U.The shallow capture pockets and small values of the average barrier radii resulted in the reduction of the capture cross sections for 52,54Cr-and 64 Ni-induced reactions,which were related to the synthesis of new super-heavy nuclei.展开更多
Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decode...Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.展开更多
This study proposes a learner profile framework based on multi-feature fusion,aiming to enhance the precision of personalized learning recommendations by integrating learners’static attributes(e.g.,demographic data a...This study proposes a learner profile framework based on multi-feature fusion,aiming to enhance the precision of personalized learning recommendations by integrating learners’static attributes(e.g.,demographic data and historical academic performance)with dynamic behavioral patterns(e.g.,real-time interactions and evolving interests over time).The research employs Term Frequency-Inverse Document Frequency(TF-IDF)for semantic feature extraction,integrates the Analytic Hierarchy Process(AHP)for feature weighting,and introduces a time decay function inspired by Newton’s law of cooling to dynamically model changes in learners’interests.Empirical results demonstrate that this framework effectively captures the dynamic evolution of learners’behaviors and provides context-aware learning resource recommendations.The study introduces a novel paradigm for learner modeling in educational technology,combining methodological innovation with a scalable technical architecture,thereby laying a foundation for the development of adaptive learning systems.展开更多
In the age of information explosion and artificial intelligence, sentiment analysis tailored for the tobacco industry has emerged as a pivotal avenue for cigarette manufacturers to enhance their tobacco products. Exis...In the age of information explosion and artificial intelligence, sentiment analysis tailored for the tobacco industry has emerged as a pivotal avenue for cigarette manufacturers to enhance their tobacco products. Existing solutions have primarily focused on intrinsic features within consumer reviews and achieved significant progress through deep feature extraction models. However, they still face these two key limitations: (1) neglecting the influence of fundamental tobacco information on analyzing the sentiment inclination of consumer reviews, resulting in a lack of consistent sentiment assessment criteria across thousands of tobacco brands;(2) overlooking the syntactic dependencies between Chinese word phrases and the underlying impact of sentiment scores between word phrases on sentiment inclination determination. To tackle these challenges, we propose the External Knowledge-enhanced Cross-Attention Fusion model, CITSA. Specifically, in the Cross Infusion Layer, we fuse consumer comment information and tobacco fundamental information through interactive attention mechanisms. In the Textual Attention Enhancement Layer, we introduce an emotion-oriented syntactic dependency graph and incorporate sentiment-syntactic relationships into consumer comments through a graph convolution network module. Subsequently, the Textual Attention Layer is introduced to combine these two feature representations. Additionally, we compile a Chinese-oriented tobacco sentiment analysis dataset, comprising 55,096 consumer reviews and 2074 tobacco fundamental information entries. Experimental results on our self-constructed datasets consistently demonstrate that our proposed model outperforms state-of-the-art methods in terms of accuracy, precision, recall, and F1-score.展开更多
With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of...With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.展开更多
Addressing the current challenges in transforming pixel displacement into physical displacement in visual monitoring technologies,as well as the inability to achieve precise full-field monitoring,this paper proposes a...Addressing the current challenges in transforming pixel displacement into physical displacement in visual monitoring technologies,as well as the inability to achieve precise full-field monitoring,this paper proposes a method for identifying the structural dynamic characteristics of wind turbines based on visual monitoring data fusion.Firstly,the Lucas-Kanade Tomasi(LKT)optical flow method and a multi-region of interest(ROI)monitoring structure are employed to track pixel displacements,which are subsequently subjected to band pass filtering and resampling operations.Secondly,the actual displacement time history is derived through double integration of the acquired acceleration data and subsequent band pass filtering.The scale factor is obtained by applying the least squares method to compare the visual displacement with the displacement derived from double integration of the acceleration data.Based on this,the multi-point displacement time histories under physical coordinates are obtained using the vision data and the scale factor.Subsequently,when visual monitoring of displacements becomes impossible due to issues such as image blurring or lens occlusion,the structural vibration equation and boundary condition constraints,among other key parameters,are employed to predict the displacements at unknown monitoring points,thereby enabling full-field displacement monitoring and dynamic characteristic testing of the structure.Finally,a small-scale shaking table test was conducted on a simulated wind turbine structure undergoing shutdown to validate the dynamic characteristics of the proposed method through test verification.The research results indicate that the proposed method achieves a time-domain error within the submillimeter range and a frequency-domain accuracy of over 99%,effectively monitoring the full-field structural dynamic characteristics of wind turbines and providing a basis for the condition assessment of wind turbine structures.展开更多
EHL-2 spherical torus(ST)is one of the key steps of p-^(11)B(proton-boron or hydrogen-boron)fusion energy research in ENN.The fusion produced energy is carried mainly by alpha particles of average energy 3 MeV,which i...EHL-2 spherical torus(ST)is one of the key steps of p-^(11)B(proton-boron or hydrogen-boron)fusion energy research in ENN.The fusion produced energy is carried mainly by alpha particles of average energy 3 MeV,which ideally can be converted to electricity with high efficiency(>80%).However,there exist serious difficulties to realize such conversion in a fusion device,due to the high energy density and high voltage required.To comprehensively describe the progress of the EHL-2 physics design,this work presents preliminary considerations of approaches for achieving energy conversion,highlighting critical issues for further investigation.Specifically,we provide an initial simulation of alpha particle extraction in the EHL-2 ST configuration as a starting point for p-^(11)B fusion energy conversion.展开更多
Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells an...Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.展开更多
Thunderstorm wind gusts are small in scale,typically occurring within a range of a few kilometers.It is extremely challenging to monitor and forecast thunderstorm wind gusts using only automatic weather stations.There...Thunderstorm wind gusts are small in scale,typically occurring within a range of a few kilometers.It is extremely challenging to monitor and forecast thunderstorm wind gusts using only automatic weather stations.Therefore,it is necessary to establish thunderstorm wind gust identification techniques based on multisource high-resolution observations.This paper introduces a new algorithm,called thunderstorm wind gust identification network(TGNet).It leverages multimodal feature fusion to fuse the temporal and spatial features of thunderstorm wind gust events.The shapelet transform is first used to extract the temporal features of wind speeds from automatic weather stations,which is aimed at distinguishing thunderstorm wind gusts from those caused by synoptic-scale systems or typhoons.Then,the encoder,structured upon the U-shaped network(U-Net)and incorporating recurrent residual convolutional blocks(R2U-Net),is employed to extract the corresponding spatial convective characteristics of satellite,radar,and lightning observations.Finally,by using the multimodal deep fusion module based on multi-head cross-attention,the temporal features of wind speed at each automatic weather station are incorporated into the spatial features to obtain 10-minutely classification of thunderstorm wind gusts.TGNet products have high accuracy,with a critical success index reaching 0.77.Compared with those of U-Net and R2U-Net,the false alarm rate of TGNet products decreases by 31.28%and 24.15%,respectively.The new algorithm provides grid products of thunderstorm wind gusts with a spatial resolution of 0.01°,updated every 10minutes.The results are finer and more accurate,thereby helping to improve the accuracy of operational warnings for thunderstorm wind gusts.展开更多
文摘In the article“A Lightweight Approach for Skin Lesion Detection through Optimal Features Fusion”by Khadija Manzoor,Fiaz Majeed,Ansar Siddique,Talha Meraj,Hafiz Tayyab Rauf,Mohammed A.El-Meligy,Mohamed Sharaf,Abd Elatty E.Abd Elgawad Computers,Materials&Continua,2022,Vol.70,No.1,pp.1617–1630.DOI:10.32604/cmc.2022.018621,URL:https://www.techscience.com/cmc/v70n1/44361,there was an error regarding the affiliation for the author Hafiz Tayyab Rauf.Instead of“Centre for Smart Systems,AI and Cybersecurity,Staffordshire University,Stoke-on-Trent,UK”,the affiliation should be“Independent Researcher,Bradford,BD80HS,UK”.
基金supported by the National Key R&D Program of China (Grant No.2022YFF0503700)the National Natural Science Foundation of China (42074196, 41925018)
文摘Solar flare prediction is an important subject in the field of space weather.Deep learning technology has greatly promoted the development of this subject.In this study,we propose a novel solar flare forecasting model integrating Deep Residual Network(ResNet)and Support Vector Machine(SVM)for both≥C-class(C,M,and X classes)and≥M-class(M and X classes)flares.We collected samples of magnetograms from May 1,2010 to September 13,2018 from Space-weather Helioseismic and Magnetic Imager(HMI)Active Region Patches and then used a cross-validation method to obtain seven independent data sets.We then utilized five metrics to evaluate our fusion model,based on intermediate-output extracted by ResNet and SVM using the Gaussian kernel function.Our results show that the primary metric true skill statistics(TSS)achieves a value of 0.708±0.027 for≥C-class prediction,and of 0.758±0.042 for≥M-class prediction;these values indicate that our approach performs significantly better than those of previous studies.The metrics of our fusion model’s performance on the seven datasets indicate that the model is quite stable and robust,suggesting that fusion models that integrate an excellent baseline network with SVM can achieve improved performance in solar flare prediction.Besides,we also discuss the performance impact of architectural innovation in our fusion model.
基金supported in part by the National Natural Science Foundation of China under Grants 62463002,62062021 and 62473033in part by the Guiyang Scientific Plan Project[2023]48–11,in part by QKHZYD[2023]010 Guizhou Province Science and Technology Innovation Base Construction Project“Key Laboratory Construction of Intelligent Mountain Agricultural Equipment”.
文摘Solar cell defect detection is crucial for quality inspection in photovoltaic power generation modules.In the production process,defect samples occur infrequently and exhibit random shapes and sizes,which makes it challenging to collect defective samples.Additionally,the complex surface background of polysilicon cell wafers complicates the accurate identification and localization of defective regions.This paper proposes a novel Lightweight Multiscale Feature Fusion network(LMFF)to address these challenges.The network comprises a feature extraction network,a multi-scale feature fusion module(MFF),and a segmentation network.Specifically,a feature extraction network is proposed to obtain multi-scale feature outputs,and a multi-scale feature fusion module(MFF)is used to fuse multi-scale feature information effectively.In order to capture finer-grained multi-scale information from the fusion features,we propose a multi-scale attention module(MSA)in the segmentation network to enhance the network’s ability for small target detection.Moreover,depthwise separable convolutions are introduced to construct depthwise separable residual blocks(DSR)to reduce the model’s parameter number.Finally,to validate the proposed method’s defect segmentation and localization performance,we constructed three solar cell defect detection datasets:SolarCells,SolarCells-S,and PVEL-S.SolarCells and SolarCells-S are monocrystalline silicon datasets,and PVEL-S is a polycrystalline silicon dataset.Experimental results show that the IOU of our method on these three datasets can reach 68.5%,51.0%,and 92.7%,respectively,and the F1-Score can reach 81.3%,67.5%,and 96.2%,respectively,which surpasses other commonly usedmethods and verifies the effectiveness of our LMFF network.
文摘BACKGROUND Ependymoma with lipomatous differentiation is a rare type of ependymoma.The ZFTA fusion-positive supratentorial ependymoma is a novel tumor type in the 2021 World Health Organization classification of central nervous system tumors.ZFTA fusion-positive lipomatous ependymoma has not been reported to date.CASE SUMMARY We reported a case of a 15-year-old Chinese male who had a sudden convulsion lasting approximately six minutes.Magnetic resonance imaging showed a round cystic shadow of approximately 1.9 cm×1.5 cm×1.9 cm under the right parieto-occipital cortex.Microscopic examination showed characteristic perivascular pseudorosettes and adipose differentiation in the cytoplasm.Immunohisto-chemical staining showed that the tumor cells were negative for cytokeratin,NeuN,Syn and p53,but positive for GFAP,vimentin and S-100 protein.Signi-ficant punctate intracytoplasmic EMA immunoreactivity was observed.The level of Ki-67 was about 5%.Genetic analysis revealed ZFTA:RELA fusion.A cranio-tomy with total excision of the tumor was performed.The follow-up time was 36 months,no evidence of disease recurrence was found in magnetic resonance imaging.CONCLUSION Based on these findings,the patient was diagnosed as a ependymoma with ZFTA fusion and lipomatous differentiation.This case report provides information on the microscopic morphological features of ependymoma with ZFTA fusion and lipomatous differentiation,which can help pathologists to make a definitive diagnosis of this tumor.
基金Supported by Grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute,funded by the Ministry of Health&Welfare,Republic of Korea,No.RS-2022-KH129889.
文摘BACKGROUND The classification of uterine sarcomas is based on distinctive morphological and immunophenotypic characteristics,increasingly supported by molecular genetic diagnostics.Data on neurotrophic tyrosine receptor kinase(NTRK)gene fusionpositive uterine sarcoma,potentially aggressive and morphologically similar to fibrosarcoma,are limited due to its recent recognition.Pan-TRK immunohistochemistry(IHC)analysis serves as an effective screening tool with high sensitivity and specificity for NTRK-fusion malignancies.CASE SUMMARY We report a case of a malignant mesenchymal tumor originating from the uterine cervix,which was pan-TRK IHC-positive but lacked NTRK gene fusions,accompanied by a brief literature review.A 55-year-old woman presented to the emergency department with abdominal pain and distension,exhibiting significant ascites and multiple solid pelvic masses.Pelvic examination revealed a tumor encompassing the uterine cervix,extending to the vagina and uterine corpus.A punch biopsy of the cervix indicated NTRK sarcoma with positive immunochemical pan-TRK stain.However,subsequent next generation sequencing revealed no NTRK gene fusion,leading to a diagnosis of poorly differentiated,advanced-stage sarcoma.CONCLUSION The clinical significance of NTRK gene fusion lies in potential treatment with TRK inhibitors for positive sarcomas.Identifying such rare tumors is crucial due to the potential applicability of tropomyosin receptor kinase inhibitor treatment.
基金supported by DOD AFIRMⅢW81XWH-20-2-0029 subcontract,UT POC19-1774-13Neuraptive Therapeutics Inc.26-7724-56+1 种基金NIH R01-NS128086 grantsLone Star Paralysis gift(to GDB)。
文摘Successful polyethylene glycol fusion(PEG-fusion)of severed axons following peripheral nerve injuries for PEG-fused axons has been reported to:(1)rapidly restore electrophysiological continuity;(2)prevent distal Wallerian Degeneration and maintain their myelin sheaths;(3)promote primarily motor,voluntary behavioral recoveries as assessed by the Sciatic Functional Index;and,(4)rapidly produce correct and incorrect connections in many possible combinations that produce rapid and extensive recovery of functional peripheral nervous system/central nervous system connections and reflex(e.g.,toe twitch)or voluntary behaviors.The preceding companion paper describes sensory terminal field reo rganization following PEG-fusion repair of sciatic nerve transections or ablations;howeve r,sensory behavioral recovery has not been explicitly explored following PEG-fusion repair.In the current study,we confirmed the success of PEG-fusion surgeries according to criteria(1-3)above and more extensively investigated whether PEG-fusion enhanced mechanical nociceptive recovery following sciatic transection in male and female outbred Sprague-Dawley and inbred Lewis rats.Mechanical nociceptive responses were assessed by measuring withdrawal thresholds using von Frey filaments on the dorsal and midplantar regions of the hindpaws.Dorsal von Frey filament tests were a more reliable method than plantar von Frey filament tests to assess mechanical nociceptive sensitivity following sciatic nerve transections.Baseline withdrawal thresholds of the sciatic-mediated lateral dorsal region differed significantly across strain but not sex.Withdrawal thresholds did not change significantly from baseline in chronic Unoperated and Sham-operated rats.Following sciatic transection,all rats exhibited severe hyposensitivity to stimuli at the lateral dorsal region of the hindpaw ipsilateral to the injury.However,PEG-fused rats exhibited significantly earlier return to baseline withdrawal thresholds than Negative Control rats.Furthermore,PEG-fused rats with significantly improved Sciatic Functional Index scores at or after 4 weeks postoperatively exhibited yet-earlier von Frey filament recove ry compared with those without Sciatic Functional Index recovery,suggesting a correlation between successful PEG-fusion and both motor-dominant and sensory-dominant behavioral recoveries.This correlation was independent of the sex or strain of the rat.Furthermore,our data showed that the acceleration of von Frey filament sensory recovery to baseline was solely due to the PEG-fused sciatic nerve and not saphenous nerve collateral outgrowths.No chronic hypersensitivity developed in any rat up to 12 weeks.All these data suggest that PEG-fusion repair of transection peripheral nerve injuries co uld have important clinical benefits.
基金Fundamental Research Funds for the Central Universities,China(No.2232021A-10)National Natural Science Foundation of China(No.61903078)+1 种基金Shanghai Sailing Program,China(No.22YF1401300)Natural Science Foundation of Shanghai,China(No.20ZR1400400)。
文摘Video classification is an important task in video understanding and plays a pivotal role in intelligent monitoring of information content.Most existing methods do not consider the multimodal nature of the video,and the modality fusion approach tends to be too simple,often neglecting modality alignment before fusion.This research introduces a novel dual stream multimodal alignment and fusion network named DMAFNet for classifying short videos.The network uses two unimodal encoder modules to extract features within modalities and exploits a multimodal encoder module to learn interaction between modalities.To solve the modality alignment problem,contrastive learning is introduced between two unimodal encoder modules.Additionally,masked language modeling(MLM)and video text matching(VTM)auxiliary tasks are introduced to improve the interaction between video frames and text modalities through backpropagation of loss functions.Diverse experiments prove the efficiency of DMAFNet in multimodal video classification tasks.Compared with other two mainstream baselines,DMAFNet achieves the best results on the 2022 WeChat Big Data Challenge dataset.
基金supported by the National Natural Science Foundation of China(62302167,62477013)Natural Science Foundation of Shanghai(No.24ZR1456100)+1 种基金Science and Technology Commission of Shanghai Municipality(No.24DZ2305900)the Shanghai Municipal Special Fund for Promoting High-Quality Development of Industries(2211106).
文摘Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.
文摘In this editorial,the authors of this paper comment on the article by Bokov et al published in the recent issue of World Journal of Orthopedics.We reviewed a general overview of oblique lumbar interbody fusions(OLIF)and lateral lumbar interbody fusions(LLIF),their indications and complications as an increasingly popular minimally invasive technique to address several lumbar pathologies.This editorial thoroughly discusses and reviews the literature regarding factors affecting outcomes of indirect decompression utilized through OLIF and LLIF procedures.Several parameters play a critical role in patient outcomes including restoration of disc height,foraminal height,central canal squared,and foraminal area.The indirect decompression allows for unbuckling of the ligamentum flavum which can significantly decompress the neural elements as well as aid in reduction of spondylolisthesis.However,the authors further highlight the limitations of indirect decompression and factors that may predict unsuccessful outcomes including bony foraminal stenosis,severe central canal stenosis,and osteoporosis.As a result,failure of indirect decompression can lead to persistent pain,radiculopathy and unsatisfied patients.Spinal surgeons may be left to reimage patients and consider additional procedures with direct decompression.
基金supported by the National Key Research and Development Program of China No.2023YFB2705000.
文摘With the rise of encrypted traffic,traditional network analysis methods have become less effective,leading to a shift towards deep learning-based approaches.Among these,multimodal learning-based classification methods have gained attention due to their ability to leverage diverse feature sets from encrypted traffic,improving classification accuracy.However,existing research predominantly relies on late fusion techniques,which hinder the full utilization of deep features within the data.To address this limitation,we propose a novel multimodal encrypted traffic classification model that synchronizes modality fusion with multiscale feature extraction.Specifically,our approach performs real-time fusion of modalities at each stage of feature extraction,enhancing feature representation at each level and preserving inter-level correlations for more effective learning.This continuous fusion strategy improves the model’s ability to detect subtle variations in encrypted traffic,while boosting its robustness and adaptability to evolving network conditions.Experimental results on two real-world encrypted traffic datasets demonstrate that our method achieves a classification accuracy of 98.23% and 97.63%,outperforming existing multimodal learning-based methods.
基金supported by the National Natural Science Foundation of China(Nos.12265006,12375129,U1867212)the Innovation Project of Guangxi Graduate Education(No.YCSWYCSW2022176)the Guangxi Natural Science Foundation(2017GXNSFGA198001).
文摘Based on the Skyrme energy density functional and reaction Q-value,this study proposed an effective nucleus-nucleus poten-tial for describing the capture barrier in heavy-ion fusion processes.The 443 extracted barrier heights were well reproduced with a root-mean-square(RMS)error of 1.53 MeV,and the RMS deviations with respect to 144 time-dependent Hartree-Fock capture barrier heights were only 1.05 MeV.Coupled with the Siwek-Wilczyński formula,wherein three parameters were determined by the proposed effective potentials,the measured capture cross sections at energies around the barriers were reasonably well reproduced for several fusion reactions induced by nearly spherical nuclei as well as by nuclei with large deformations,such as^(154)Sm and^(238)U.The shallow capture pockets and small values of the average barrier radii resulted in the reduction of the capture cross sections for 52,54Cr-and 64 Ni-induced reactions,which were related to the synthesis of new super-heavy nuclei.
基金support for this work was supported by Key Lab of Intelligent and Green Flexographic Printing under Grant ZBKT202301.
文摘Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A multi-constraint loss function composed of one-to-one, one-to-many, and contrastive denoising losses is designed to address the problem of insufficient constraint force in predicting results with traditional methods. This loss function enhances the accuracy of model classification predictions and improves the proximity of regression position predictions to ground truth objects. The proposed method model is evaluated on the popular dataset UCF101-24 and JHMDB-21. Experimental results demonstrate that the proposed method achieves an accuracy of 81.52% on the Frame-mAP metric, surpassing current existing methods.
基金This work is supported by the Ministry of Education of Humanities and Social Science projects in China(No.20YJCZH124)Guangdong Province Education and Teaching Reform Project No.640:Research on the Teaching Practice and Application of Online Peer Assessment Methods in the Context of Artificial Intelligence.
文摘This study proposes a learner profile framework based on multi-feature fusion,aiming to enhance the precision of personalized learning recommendations by integrating learners’static attributes(e.g.,demographic data and historical academic performance)with dynamic behavioral patterns(e.g.,real-time interactions and evolving interests over time).The research employs Term Frequency-Inverse Document Frequency(TF-IDF)for semantic feature extraction,integrates the Analytic Hierarchy Process(AHP)for feature weighting,and introduces a time decay function inspired by Newton’s law of cooling to dynamically model changes in learners’interests.Empirical results demonstrate that this framework effectively captures the dynamic evolution of learners’behaviors and provides context-aware learning resource recommendations.The study introduces a novel paradigm for learner modeling in educational technology,combining methodological innovation with a scalable technical architecture,thereby laying a foundation for the development of adaptive learning systems.
基金supported by the Global Research and Innovation Platform Fund for Scientific Big Data Transmission(Grant No.241711KYSB20180002)National Key Research and Development Project of China(Grant No.2019YFB1405801).
文摘In the age of information explosion and artificial intelligence, sentiment analysis tailored for the tobacco industry has emerged as a pivotal avenue for cigarette manufacturers to enhance their tobacco products. Existing solutions have primarily focused on intrinsic features within consumer reviews and achieved significant progress through deep feature extraction models. However, they still face these two key limitations: (1) neglecting the influence of fundamental tobacco information on analyzing the sentiment inclination of consumer reviews, resulting in a lack of consistent sentiment assessment criteria across thousands of tobacco brands;(2) overlooking the syntactic dependencies between Chinese word phrases and the underlying impact of sentiment scores between word phrases on sentiment inclination determination. To tackle these challenges, we propose the External Knowledge-enhanced Cross-Attention Fusion model, CITSA. Specifically, in the Cross Infusion Layer, we fuse consumer comment information and tobacco fundamental information through interactive attention mechanisms. In the Textual Attention Enhancement Layer, we introduce an emotion-oriented syntactic dependency graph and incorporate sentiment-syntactic relationships into consumer comments through a graph convolution network module. Subsequently, the Textual Attention Layer is introduced to combine these two feature representations. Additionally, we compile a Chinese-oriented tobacco sentiment analysis dataset, comprising 55,096 consumer reviews and 2074 tobacco fundamental information entries. Experimental results on our self-constructed datasets consistently demonstrate that our proposed model outperforms state-of-the-art methods in terms of accuracy, precision, recall, and F1-score.
基金supported by Communication University of China(HG23035)partly supported by the Fundamental Research Funds for the Central Universities(CUC230A013).
文摘With the rapid growth of socialmedia,the spread of fake news has become a growing problem,misleading the public and causing significant harm.As social media content is often composed of both images and text,the use of multimodal approaches for fake news detection has gained significant attention.To solve the problems existing in previous multi-modal fake news detection algorithms,such as insufficient feature extraction and insufficient use of semantic relations between modes,this paper proposes the MFFFND-Co(Multimodal Feature Fusion Fake News Detection with Co-Attention Block)model.First,the model deeply explores the textual content,image content,and frequency domain features.Then,it employs a Co-Attention mechanism for cross-modal fusion.Additionally,a semantic consistency detectionmodule is designed to quantify semantic deviations,thereby enhancing the performance of fake news detection.Experimentally verified on two commonly used datasets,Twitter and Weibo,the model achieved F1 scores of 90.0% and 94.0%,respectively,significantly outperforming the pre-modified MFFFND(Multimodal Feature Fusion Fake News Detection with Attention Block)model and surpassing other baseline models.This improves the accuracy of detecting fake information in artificial intelligence detection and engineering software detection.
基金supported by the National Science Foundation of China(Grant Nos.52068049 and 51908266)the Science Fund for Distinguished Young Scholars of Gansu Province(No.21JR7RA267)Hongliu Outstanding Young Talents Program of Lanzhou University of Technology.
文摘Addressing the current challenges in transforming pixel displacement into physical displacement in visual monitoring technologies,as well as the inability to achieve precise full-field monitoring,this paper proposes a method for identifying the structural dynamic characteristics of wind turbines based on visual monitoring data fusion.Firstly,the Lucas-Kanade Tomasi(LKT)optical flow method and a multi-region of interest(ROI)monitoring structure are employed to track pixel displacements,which are subsequently subjected to band pass filtering and resampling operations.Secondly,the actual displacement time history is derived through double integration of the acquired acceleration data and subsequent band pass filtering.The scale factor is obtained by applying the least squares method to compare the visual displacement with the displacement derived from double integration of the acceleration data.Based on this,the multi-point displacement time histories under physical coordinates are obtained using the vision data and the scale factor.Subsequently,when visual monitoring of displacements becomes impossible due to issues such as image blurring or lens occlusion,the structural vibration equation and boundary condition constraints,among other key parameters,are employed to predict the displacements at unknown monitoring points,thereby enabling full-field displacement monitoring and dynamic characteristic testing of the structure.Finally,a small-scale shaking table test was conducted on a simulated wind turbine structure undergoing shutdown to validate the dynamic characteristics of the proposed method through test verification.The research results indicate that the proposed method achieves a time-domain error within the submillimeter range and a frequency-domain accuracy of over 99%,effectively monitoring the full-field structural dynamic characteristics of wind turbines and providing a basis for the condition assessment of wind turbine structures.
文摘EHL-2 spherical torus(ST)is one of the key steps of p-^(11)B(proton-boron or hydrogen-boron)fusion energy research in ENN.The fusion produced energy is carried mainly by alpha particles of average energy 3 MeV,which ideally can be converted to electricity with high efficiency(>80%).However,there exist serious difficulties to realize such conversion in a fusion device,due to the high energy density and high voltage required.To comprehensively describe the progress of the EHL-2 physics design,this work presents preliminary considerations of approaches for achieving energy conversion,highlighting critical issues for further investigation.Specifically,we provide an initial simulation of alpha particle extraction in the EHL-2 ST configuration as a starting point for p-^(11)B fusion energy conversion.
基金funded by the China Chongqing Municipal Science and Technology Bureau,grant numbers 2024TIAD-CYKJCXX0121,2024NSCQ-LZX0135Chongqing Municipal Commission of Housing and Urban-Rural Development,grant number CKZ2024-87+3 种基金the Chongqing University of Technology graduate education high-quality development project,grant number gzlsz202401the Chongqing University of Technology-Chongqing LINGLUE Technology Co.,Ltd.,Electronic Information(Artificial Intelligence)graduate joint training basethe Postgraduate Education and Teaching Reform Research Project in Chongqing,grant number yjg213116the Chongqing University of Technology-CISDI Chongqing Information Technology Co.,Ltd.,Computer Technology graduate joint training base.
文摘Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer.However,this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size.Pathologists often refer to surrounding cells to identify abnormalities.To emulate this slide examination behavior,this study proposes a Multi-Scale Feature Fusion Network(MSFF-Net)for detecting cervical abnormal cells.MSFF-Net employs a Cross-Scale Pooling Model(CSPM)to effectively capture diverse features and contextual information,ranging from local details to the overall structure.Additionally,a Multi-Scale Fusion Attention(MSFA)module is introduced to mitigate the impact of cell size variations by adaptively fusing local and global information at different scales.To handle the complex environment of cervical cell images,such as cell adhesion and overlapping,the Inner-CIoU loss function is utilized to more precisely measure the overlap between bounding boxes,thereby improving detection accuracy in such scenarios.Experimental results on the Comparison detector dataset demonstrate that MSFF-Net achieves a mean average precision(mAP)of 63.2%,outperforming state-of-the-art methods while maintaining a relatively small number of parameters(26.8 M).This study highlights the effectiveness of multi-scale feature fusion in enhancing the detection of cervical abnormal cells,contributing to more accurate and efficient cervical cancer screening.
基金supported by the National Key Research and Development Program of China(Grant No.2022YFC3004104)the National Natural Science Foundation of China(Grant No.U2342204)+4 种基金the Innovation and Development Program of the China Meteorological Administration(Grant No.CXFZ2024J001)the Open Research Project of the Key Open Laboratory of Hydrology and Meteorology of the China Meteorological Administration(Grant No.23SWQXZ010)the Science and Technology Plan Project of Zhejiang Province(Grant No.2022C03150)the Open Research Fund Project of Anyang National Climate Observatory(Grant No.AYNCOF202401)the Open Bidding for Selecting the Best Candidates Program(Grant No.CMAJBGS202318)。
文摘Thunderstorm wind gusts are small in scale,typically occurring within a range of a few kilometers.It is extremely challenging to monitor and forecast thunderstorm wind gusts using only automatic weather stations.Therefore,it is necessary to establish thunderstorm wind gust identification techniques based on multisource high-resolution observations.This paper introduces a new algorithm,called thunderstorm wind gust identification network(TGNet).It leverages multimodal feature fusion to fuse the temporal and spatial features of thunderstorm wind gust events.The shapelet transform is first used to extract the temporal features of wind speeds from automatic weather stations,which is aimed at distinguishing thunderstorm wind gusts from those caused by synoptic-scale systems or typhoons.Then,the encoder,structured upon the U-shaped network(U-Net)and incorporating recurrent residual convolutional blocks(R2U-Net),is employed to extract the corresponding spatial convective characteristics of satellite,radar,and lightning observations.Finally,by using the multimodal deep fusion module based on multi-head cross-attention,the temporal features of wind speed at each automatic weather station are incorporated into the spatial features to obtain 10-minutely classification of thunderstorm wind gusts.TGNet products have high accuracy,with a critical success index reaching 0.77.Compared with those of U-Net and R2U-Net,the false alarm rate of TGNet products decreases by 31.28%and 24.15%,respectively.The new algorithm provides grid products of thunderstorm wind gusts with a spatial resolution of 0.01°,updated every 10minutes.The results are finer and more accurate,thereby helping to improve the accuracy of operational warnings for thunderstorm wind gusts.