期刊文献+
共找到35,251篇文章
< 1 2 250 >
每页显示 20 50 100
A Hybrid Approach for Pavement Crack Detection Using Mask R-CNN and Vision Transformer Model 被引量:1
1
作者 Shorouq Alshawabkeh Li Wu +2 位作者 Daojun Dong Yao Cheng Liping Li 《Computers, Materials & Continua》 SCIE EI 2025年第1期561-577,共17页
Detecting pavement cracks is critical for road safety and infrastructure management.Traditional methods,relying on manual inspection and basic image processing,are time-consuming and prone to errors.Recent deep-learni... Detecting pavement cracks is critical for road safety and infrastructure management.Traditional methods,relying on manual inspection and basic image processing,are time-consuming and prone to errors.Recent deep-learning(DL)methods automate crack detection,but many still struggle with variable crack patterns and environmental conditions.This study aims to address these limitations by introducing the Masker Transformer,a novel hybrid deep learning model that integrates the precise localization capabilities of Mask Region-based Convolutional Neural Network(Mask R-CNN)with the global contextual awareness of Vision Transformer(ViT).The research focuses on leveraging the strengths of both architectures to enhance segmentation accuracy and adaptability across different pavement conditions.We evaluated the performance of theMaskerTransformer against other state-of-theartmodels such asU-Net,TransformerU-Net(TransUNet),U-NetTransformer(UNETr),SwinU-NetTransformer(Swin-UNETr),You Only Look Once version 8(YoloV8),and Mask R-CNN using two benchmark datasets:Crack500 and DeepCrack.The findings reveal that the MaskerTransformer significantly outperforms the existing models,achieving the highest Dice SimilarityCoefficient(DSC),precision,recall,and F1-Score across both datasets.Specifically,the model attained a DSC of 80.04%on Crack500 and 91.37%on DeepCrack,demonstrating superior segmentation accuracy and reliability.The high precision and recall rates further substantiate its effectiveness in real-world applications,suggesting that the Masker Transformer can serve as a robust tool for automated pavement crack detection,potentially replacing more traditional methods. 展开更多
关键词 Pavement crack segmentation TRANSPORTATION deep learning vision transformer Mask R-CNN image segmentation
在线阅读 下载PDF
Adaptive optoelectronic transistor for intelligent vision system 被引量:1
2
作者 Yiru Wang Shanshuo Liu +5 位作者 Hongxin Zhang Yuchen Cao Zitong Mu Mingdong Yi Linghai Xie Haifeng Ling 《Journal of Semiconductors》 2025年第2期53-70,共18页
Recently,for developing neuromorphic visual systems,adaptive optoelectronic devices become one of the main research directions and attract extensive focus to achieve optoelectronic transistors with high performances a... Recently,for developing neuromorphic visual systems,adaptive optoelectronic devices become one of the main research directions and attract extensive focus to achieve optoelectronic transistors with high performances and flexible func-tionalities.In this review,based on a description of the biological adaptive functions that are favorable for dynamically perceiv-ing,filtering,and processing information in the varying environment,we summarize the representative strategies for achiev-ing these adaptabilities in optoelectronic transistors,including the adaptation for detecting information,adaptive synaptic weight change,and history-dependent plasticity.Moreover,the key points of the corresponding strategies are comprehen-sively discussed.And the applications of these adaptive optoelectronic transistors,including the adaptive color detection,sig-nal filtering,extending the response range of light intensity,and improve learning efficiency,are also illustrated separately.Lastly,the challenges faced in developing adaptive optoelectronic transistor for artificial vision system are discussed.The descrip-tion of biological adaptive functions and the corresponding inspired neuromorphic devices are expected to provide insights for the design and application of next-generation artificial visual systems. 展开更多
关键词 adaptive optoelectronic transistor neuromorphic computing artificial vision
在线阅读 下载PDF
Steel Surface Defect Detection Using Learnable Memory Vision Transformer
3
作者 Syed Tasnimul Karim Ayon Farhan Md.Siraj Jia Uddin 《Computers, Materials & Continua》 SCIE EI 2025年第1期499-520,共22页
This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as o... This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as other transformer-based models including Token to Token ViT,ViT withoutmemory,and Parallel ViT.Leveraging awidely-used steel surface defect dataset,the research applies data augmentation and t-distributed stochastic neighbor embedding(t-SNE)to enhance feature extraction and understanding.These techniques mitigated overfitting,stabilized training,and improved generalization capabilities.The LMViT model achieved a test accuracy of 97.22%,significantly outperforming ResNet18(88.89%)and ResNet50(88.90%),aswell as the Token to TokenViT(88.46%),ViT without memory(87.18),and Parallel ViT(91.03%).Furthermore,LMViT exhibited superior training and validation performance,attaining a validation accuracy of 98.2%compared to 91.0%for ResNet 18,96.0%for ResNet50,and 89.12%,87.51%,and 91.21%for Token to Token ViT,ViT without memory,and Parallel ViT,respectively.The findings highlight the LMViT’s ability to capture long-range dependencies in images,an areawhere CNNs struggle due to their reliance on local receptive fields and hierarchical feature extraction.The additional transformer-based models also demonstrate improved performance in capturing complex features over CNNs,with LMViT excelling particularly at detecting subtle and complex defects,which is critical for maintaining product quality and operational efficiency in industrial applications.For instance,the LMViT model successfully identified fine scratches and minor surface irregularities that CNNs often misclassify.This study not only demonstrates LMViT’s potential for real-world defect detection but also underscores the promise of other transformer-based architectures like Token to Token ViT,ViT without memory,and Parallel ViT in industrial scenarios where complex spatial relationships are key.Future research may focus on enhancing LMViT’s computational efficiency for deployment in real-time quality control systems. 展开更多
关键词 Learnable Memory vision Transformer(LMViT) Convolutional Neural Networks(CNN) metal surface defect detection deep learning computer vision image classification learnable memory gradient clipping label smoothing t-SNE visualization
在线阅读 下载PDF
Artificial self-powered and self-healable neuromorphic vision skin utilizing silver nanoparticle-doped ionogel photosynaptic heterostructure
4
作者 Xinkai Qian Fa Zhang +7 位作者 Xiujuan Li Junyue Li Hongchao Sun Qiye Wang Chaoran Huang Zhenyu Zhang Zhe Zhou Juqing Liu 《Journal of Semiconductors》 2025年第1期205-213,共9页
Artificial skin should embody a softly functional film that is capable of self-powering,healing and sensing with neuromorphic processing.However,the pursuit of a bionic skin that combines high flexibility,self-healabi... Artificial skin should embody a softly functional film that is capable of self-powering,healing and sensing with neuromorphic processing.However,the pursuit of a bionic skin that combines high flexibility,self-healability,and zero-powered photosynaptic functionality remains elusive.In this study,we report a self-powered and self-healable neuromorphic vision skin,featuring silver nanoparticle-doped ionogel heterostructure as photoacceptor.The localized surface plasmon resonance induced by light in the nanoparticles triggers temperature fluctuations within the heterojunction,facilitating ion migration for visual sensing with synaptic behaviors.The abundant reversible hydrogen bonds in the ionogel endow the skin with remarkable mechanical flexibility and self-healing properties.We assembled a neuromorphic visual skin equipped with a 5×5 photosynapse array,capable of sensing and memorizing diverse light patterns. 展开更多
关键词 neuromorphic vision skin ionogel heterojuction LSPR photosynapse
在线阅读 下载PDF
The Application of Machine Vision in Defect Detection Systems
5
作者 Peihang Zhong Jiawei Lin Muling Wang 《Journal of Electronic Research and Application》 2025年第2期191-196,共6页
With the rapid development of computer vision technology,artificial intelligence algorithms,and high-performance computing platforms,machine vision technology has gradually shown its great potential in automated produ... With the rapid development of computer vision technology,artificial intelligence algorithms,and high-performance computing platforms,machine vision technology has gradually shown its great potential in automated production lines,especially in defect detection.Machine vision technology can be applied in many industries such as semiconductor,automobile manufacturing,aerospace,food,and drugs,which can significantly improve detection efficiency and accuracy,reduce labor costs,improve product quality,enhance market competitiveness,and provide strong support for the arrival of Industry 4.0 era.In this article,the concept,advantages,and disadvantages of machine vision and the algorithm framework of machine vision in the defect detection system are briefly described,aiming to promote the rapid development of industry and strengthen China’s industry. 展开更多
关键词 Machine vision Defect detection system Image preprocessing
在线阅读 下载PDF
Vision Transformer for Extracting Tropical Cyclone Intensity from Satellite Images
6
作者 Ye TIAN Wen ZHOU +1 位作者 Paxson KYCHEUNG Zhenchen LIU 《Advances in Atmospheric Sciences》 2025年第1期79-93,共15页
Tropical cyclone(TC)intensity estimation is a fundamental aspect of TC monitoring and forecasting.Deep learning models have recently been employed to estimate TC intensity from satellite images and yield precise resul... Tropical cyclone(TC)intensity estimation is a fundamental aspect of TC monitoring and forecasting.Deep learning models have recently been employed to estimate TC intensity from satellite images and yield precise results.This work proposes the ViT-TC model based on the Vision Transformer(ViT)architecture.Satellite images of TCs,including infrared(IR),water vapor(WV),and passive microwave(PMW),are used as inputs for intensity estimation.Experiments indicate that combining IR,WV,and PMW as inputs yields more accurate estimations than other channel combinations.The ensemble mean technique is applied to enhance the model's estimations,reducing the root-mean-square error to 9.32 kt(knots,1 kt≈0.51 m s^(-1))and the mean absolute error to 6.49 kt,which outperforms traditional methods and is comparable to existing deep learning models.The model assigns high attention weights to areas with high PMW,indicating that PMW magnitude is essential information for the model's estimation.The model also allocates significance to the cloud-cover region,suggesting that the model utilizes the whole TC cloud structure and TC eye to determine TC intensity. 展开更多
关键词 vision Transformer tropical cyclones intensity estimation deep learning
在线阅读 下载PDF
达索系统携手Apple Vision Pro开启产品设计与制造新维度
7
作者 《设计》 2025年第6期150-150,共1页
虚拟孪生领域的全球领导者达索系统近日宣布,其基于3DEXPERIENCE平台的3D UNIV+RSES将利用空间计算推出适用于visionOS系统的全新应用“3DLive”,开启虚拟孪生的新维度。该应用预计将于今年夏季发布。为了实现这一愿景,达索系统与苹果... 虚拟孪生领域的全球领导者达索系统近日宣布,其基于3DEXPERIENCE平台的3D UNIV+RSES将利用空间计算推出适用于visionOS系统的全新应用“3DLive”,开启虚拟孪生的新维度。该应用预计将于今年夏季发布。为了实现这一愿景,达索系统与苹果公司合作,将Apple Vision Pro集成到新一代3DEXPERIENCE平台中。双方通过工程设计层面的深入合作,融合两大平台的优势,共同打造这一革命性体验。借助3DLive,在达索系统的3DEXPERIENCE平台上创建的虚拟孪生将能够跃出屏幕,融入用户的物理空间,在逼真的环境中实现实时可视化和团队协作。Apple Vision Pro先进的摄像头、传感器和追踪技术,还能让虚拟孪生在3D UNIV+RSES中与周围的物理环境进行科学级精准的交互。 展开更多
关键词 Apple vision Pro 达索系统
在线阅读 下载PDF
Binocular vision disorders and refractive errors on university students’quality of life
8
作者 Raheleh Moravej Alireza Jamali +6 位作者 Navidreza Zamani Fatemeh Azad Shahraki Abbas Ali Yekta Hadi Ostadimoghaddam Nasim Vaghefi Hamidreza Ghadimi Mehdi Khabazkhoob 《International Journal of Ophthalmology(English edition)》 2025年第4期707-715,共9页
AIM:To evaluate the effects of refractive errors and binocular vision anomalies on the quality of life(QOL)of university students.METHODS:This cross-sectional analytical study was conducted on university students usin... AIM:To evaluate the effects of refractive errors and binocular vision anomalies on the quality of life(QOL)of university students.METHODS:This cross-sectional analytical study was conducted on university students using simple random sampling.Objective refraction,ocular alignment,vergence and accommodative performance were measured and assessed in all participants.Data on QOL were collected using the College of Optometrists in Vision Development-Quality of Life(COVD-QOL)Questionnaire.The effect of mentioned parameters on the QOL were evaluated.RESULTS:Totally 726 students with mean age of 21.35±1.88y were evaluated in this study,51.5%of whom were female.Esophoria was caused significantly lower QOL in the domains of somatic symptoms and occupationalphysical symptoms(P<0.05);Besides,esotropia decreased QOL in domains of somatic symptoms P=0.002 and psychological factors(P=0.023).Students with accommodation insufficiency experienced more symptoms in all domains(P<0.05)except for psychological factors(P=0.07).Increasing in the near point of convergence and accommodation and decreases QOL and increasing accommodative facility increases QOL(all P<0.05).Myopia and astigmatism cause decrease in QOL(both P<0.05),but hyperopic students had better QOL in comparison with others(P<0.05).CONCLUSION:Screening programs and treatment of refractive errors and binocular vision anomalies,especially phoria and accommodative insufficiency,positively impact the QOL and academic achievements of university students. 展开更多
关键词 quality of life binocular vision disorders refractive errors ACCOMMODATION CONVERGENCE university students
在线阅读 下载PDF
Multi-Scale Vision Transformer with Dynamic Multi-Loss Function for Medical Image Retrieval and Classification
9
作者 Omar Alqahtani Mohamed Ghouse +2 位作者 Asfia Sabahath Omer Bin Hussain Arshiya Begum 《Computers, Materials & Continua》 2025年第5期2221-2244,共24页
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi... This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison. 展开更多
关键词 Medical image retrieval vision transformer multi-scale encoding multi-loss function ISIC-2018 ChestX-ray14
在线阅读 下载PDF
基于增强式Vision Transformer的大坝表面裂缝分割研究
10
作者 马海明 《水利科技与经济》 2025年第4期104-107,共4页
大坝出现裂缝会导致坝渗漏,影响水库发电等效益。通过对大坝表面裂缝进行精确分割,研究开发一种基于增强式Vision Transformer的方法,结合Vision Transformer的全局信息处理能力和多层卷积块以及卷积注意力模块(CBAM),旨在提升对裂缝特... 大坝出现裂缝会导致坝渗漏,影响水库发电等效益。通过对大坝表面裂缝进行精确分割,研究开发一种基于增强式Vision Transformer的方法,结合Vision Transformer的全局信息处理能力和多层卷积块以及卷积注意力模块(CBAM),旨在提升对裂缝特征的敏感性和辨识度。结果表明,该模型展示出在分割大坝表面裂缝方面的优越性能,验证了其在大坝安全监控系统中的应用潜力。 展开更多
关键词 大坝表面裂缝分割 深度学习 vision Transformer 卷积注意力模块
在线阅读 下载PDF
基于改进Vision Transformer的局部光照一致性估计
11
作者 王杨 宋世佳 +3 位作者 王鹤琴 袁振羽 赵立军 吴其林 《计算机工程》 北大核心 2025年第2期312-321,共10页
光照一致性是增强现实(AR)系统中实现虚实有机融合的关键因素之一。由于拍摄视角的局限性和场景光照的复杂性,开发者在估计全景照明信息时通常忽略局部光照一致性,从而影响最终的渲染效果。为解决这一问题,提出一种基于改进视觉Transfor... 光照一致性是增强现实(AR)系统中实现虚实有机融合的关键因素之一。由于拍摄视角的局限性和场景光照的复杂性,开发者在估计全景照明信息时通常忽略局部光照一致性,从而影响最终的渲染效果。为解决这一问题,提出一种基于改进视觉Transformer(ViT)结构的局部光照一致性估计框架(ViTLight)。首先利用ViT编码器提取特征向量并计算回归球面谐波(SH)系数,进而恢复光照信息;其次改进ViT编码器结构,引入多头自注意力交互机制,采用卷积运算引导注意力头之间相互联系,在此基础上增加局部感知模块,扫描每个图像分块并对局部像素进行加权求和,捕捉区域内的特定特征,有助于平衡全局上下文特征和局部光照信息,提高光照估计的精度。在公开数据集上对比主流特征提取网络和4种经典光照估计框架,实验和分析结果表明,ViTLight在图像渲染准确率方面高于现有框架,其均方根误差(RMSE)和结构相异性(DSSIM)指标分别为0.1296和0.0426,验证了该框架的有效性与正确性。 展开更多
关键词 增强现实 光照估计 球面谐波系数 视觉Transformer 多头自注意力
在线阅读 下载PDF
Dual-Path Vision Transformer用于急性缺血性脑卒中辅助诊断 被引量:1
12
作者 张桃红 郭学强 +4 位作者 郑瀚 罗继昌 王韬 焦力群 唐安莹 《电子科技大学学报》 EI CAS CSCD 北大核心 2024年第2期307-314,共8页
急性缺血性脑卒中是由于脑组织血液供应障碍导致的脑功能障碍,数字减影脑血管造影(DSA)是诊断脑血管疾病的金标准。基于患者的正面和侧面DSA图像,对急性缺血性脑卒中的治疗效果进行分级评估,构建基于Vision Transformer的双路径图像分... 急性缺血性脑卒中是由于脑组织血液供应障碍导致的脑功能障碍,数字减影脑血管造影(DSA)是诊断脑血管疾病的金标准。基于患者的正面和侧面DSA图像,对急性缺血性脑卒中的治疗效果进行分级评估,构建基于Vision Transformer的双路径图像分类智能模型DPVF。为了提高辅助诊断速度,基于EdgeViT的轻量化设计思想进行了模型的构建;为了使模型保持轻量化的同时具有较高的精度,提出空间-通道自注意力模块,促进Transformer模型捕获更全面的特征信息,提高模型的表达能力;此外,对于DPVF的两分支的特征融合,构建交叉注意力模块对两分支输出进行交叉融合,促使模型提取更丰富的特征,从而提高模型表现。实验结果显示DPVF在测试集上的准确率达98.5%,满足实际需求。 展开更多
关键词 急性缺血性脑卒中 视觉Transformer 双分支网络 特征融合
在线阅读 下载PDF
基于Vision Transformer的虹膜——人脸多特征融合识别研究
13
作者 马滔 陈睿 张博 《中国新技术新产品》 2024年第18期8-10,共3页
为了提高生物特征识别系统的准确性和鲁棒性,本文研究基于计算机视觉的虹膜—人脸多特征融合识别方法。本文对面部图像中虹膜区域进行提取以及预处理,采用对比度增强和归一化操作,加强了特征提取的一致性,提升了图像质量。为了获取丰富... 为了提高生物特征识别系统的准确性和鲁棒性,本文研究基于计算机视觉的虹膜—人脸多特征融合识别方法。本文对面部图像中虹膜区域进行提取以及预处理,采用对比度增强和归一化操作,加强了特征提取的一致性,提升了图像质量。为了获取丰富的深度特征,本文使用Vision Transformer模型对预处理后的虹膜和面部图像进行特征提取。利用多头注意力机制将虹膜和面部的多模态特征信息进行融合,再利用全连接层进行分类识别。试验结果表明,该方法识别性能优秀,识别准确性显著提升。 展开更多
关键词 计算机视觉 vision Transformer 多特征融合 虹膜识别 人脸识别
在线阅读 下载PDF
细粒度图像分类上Vision Transformer的发展综述 被引量:4
14
作者 孙露露 刘建平 +3 位作者 王健 邢嘉璐 张越 王晨阳 《计算机工程与应用》 CSCD 北大核心 2024年第10期30-46,共17页
细粒度图像分类(fine-grained image classification,FGIC)一直是计算机视觉领域中的重要问题。与传统图像分类任务相比,FGIC的挑战在于类间对象极其相似,使任务难度进一步增加。随着深度学习的发展,Vision Transformer(ViT)模型在视觉... 细粒度图像分类(fine-grained image classification,FGIC)一直是计算机视觉领域中的重要问题。与传统图像分类任务相比,FGIC的挑战在于类间对象极其相似,使任务难度进一步增加。随着深度学习的发展,Vision Transformer(ViT)模型在视觉领域掀起热潮,并被引入到FGIC任务中。介绍了FGIC任务所面临的挑战,分析了ViT模型及其特性。主要根据模型结构全面综述了基于ViT的FGIC算法,包括特征提取、特征关系构建、特征注意和特征增强四方面内容,对每种算法进行了总结,并分析了它们的优缺点。通过对不同ViT模型在相同公用数据集上进行模型性能比较,以验证它们在FGIC任务上的有效性。最后指出了目前研究的不足,并提出未来研究方向,以进一步探索ViT在FGIC中的潜力。 展开更多
关键词 细粒度图像分类 vision Transformer 特征提取 特征关系构建 特征注意 特征增强
在线阅读 下载PDF
基于Vision Transformer的小麦病害图像识别算法
15
作者 白玉鹏 冯毅琨 +3 位作者 李国厚 赵明富 周浩宇 侯志松 《中国农机化学报》 北大核心 2024年第2期267-274,共8页
小麦白粉病、赤霉病和锈病是危害小麦产量的三大病害。为提高小麦病害图像的识别准确率,构建一种基于Vision Transformer的小麦病害图像识别算法。首先,通过田间拍摄的方式收集包含小麦白粉病、赤霉病和锈病3种病害在内的小麦病害图像,... 小麦白粉病、赤霉病和锈病是危害小麦产量的三大病害。为提高小麦病害图像的识别准确率,构建一种基于Vision Transformer的小麦病害图像识别算法。首先,通过田间拍摄的方式收集包含小麦白粉病、赤霉病和锈病3种病害在内的小麦病害图像,并对原始图像进行预处理,建立小麦病害图像识别数据集;然后,基于改进的Vision Transformer构建小麦病害图像识别算法,分析不同迁移学习方式和数据增强对模型识别效果的影响。试验可知,全参数迁移学习和数据增强能明显提高Vision Transformer模型的收敛速度和识别精度。最后,在相同时间条件下,对比Vision Transformer、AlexNet和VGG16算法在相同数据集上的表现。试验结果表明,Vision Transformer模型对3种小麦病害图像的平均识别准确率为96.81%,相较于AlexNet和VGG16模型识别准确率分别提高6.68%和4.94%。 展开更多
关键词 小麦病害 vision Transformer 迁移学习 图像识别 数据增强
在线阅读 下载PDF
基于Vision Transformer与迁移学习的裤装廓形识别与分类
16
作者 应欣 张宁 申思 《丝绸》 CAS CSCD 北大核心 2024年第11期77-83,共7页
针对裤装廓形识别与分类模型的分类不准确问题,文章采用带有自注意力机制的Vision Transformer模型实现裤装廓形图像的分类,对于图片背景等无关信息对廓形识别的干扰,添加自注意力机制,增强有用特征通道。为防止因裤型样本数据集较少产... 针对裤装廓形识别与分类模型的分类不准确问题,文章采用带有自注意力机制的Vision Transformer模型实现裤装廓形图像的分类,对于图片背景等无关信息对廓形识别的干扰,添加自注意力机制,增强有用特征通道。为防止因裤型样本数据集较少产生过拟合问题,可通过迁移学习方法对阔腿裤、喇叭裤、紧身裤、哈伦裤4种裤装廓形进行训练和验证,将改进的Vision Transformer模型与传统CNN模型进行对比实验,验证模型效果。实验结果表明:使用Vision Transformer模型在4种裤装廓形分类上的分类准确率达到97.72%,与ResNet-50和MobileNetV2模型相比均有提升,可为服装廓形的图像分类识别提供有力支撑,在实际服装领域中有较高的使用价值。 展开更多
关键词 裤装廓形 自注意力机制 vision transformer 迁移学习 图像分类 廓形识别
在线阅读 下载PDF
Artificial hawk-eye camera for foveated, tetrachromatic, and dynamic vision 被引量:1
17
作者 Wenhao Ran Zhuoran Wang Guozhen Shen 《Journal of Semiconductors》 EI CAS CSCD 2024年第9期1-3,共3页
With the rapid development of drones and autonomous vehicles, miniaturized and lightweight vision sensors that can track targets are of great interests. Limited by the flat structure, conventional image sensors apply ... With the rapid development of drones and autonomous vehicles, miniaturized and lightweight vision sensors that can track targets are of great interests. Limited by the flat structure, conventional image sensors apply a large number of lenses to achieve corresponding functions, increasing the overall volume and weight of the system. 展开更多
关键词 AWK vision system.
在线阅读 下载PDF
Collaborative positioning for swarms:A brief survey of vision,LiDAR and wireless sensors based methods 被引量:1
18
作者 Zeyu Li Changhui Jiang +3 位作者 Xiaobo Gu Ying Xu Feng zhou Jianhui Cui 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第3期475-493,共19页
As positioning sensors,edge computation power,and communication technologies continue to develop,a moving agent can now sense its surroundings and communicate with other agents.By receiving spatial information from bo... As positioning sensors,edge computation power,and communication technologies continue to develop,a moving agent can now sense its surroundings and communicate with other agents.By receiving spatial information from both its environment and other agents,an agent can use various methods and sensor types to localize itself.With its high flexibility and robustness,collaborative positioning has become a widely used method in both military and civilian applications.This paper introduces the basic fundamental concepts and applications of collaborative positioning,and reviews recent progress in the field based on camera,LiDAR(Light Detection and Ranging),wireless sensor,and their integration.The paper compares the current methods with respect to their sensor type,summarizes their main paradigms,and analyzes their evaluation experiments.Finally,the paper discusses the main challenges and open issues that require further research. 展开更多
关键词 Collaborative positioning vision LIDAR Wireless sensors Sensor fusion
在线阅读 下载PDF
基于Vision Transformer的阿尔茨海默病分类研究
19
作者 许曙博 郑英豪 +3 位作者 秦方博 周超 周劲 陈嘉燕 《微型电脑应用》 2024年第8期4-7,共4页
为了有效地提升对阿尔茨海默病(AD)的磁共振成像(MRI)图像分类准确率,提出一种LC(Layer-Cut)-ViT方法。该方法通过引入Vision Transformer(ViT)的自注意力机制对MRI图像进行层切分,使模型能更好地理解图像的全局信息,同时突出切片间的... 为了有效地提升对阿尔茨海默病(AD)的磁共振成像(MRI)图像分类准确率,提出一种LC(Layer-Cut)-ViT方法。该方法通过引入Vision Transformer(ViT)的自注意力机制对MRI图像进行层切分,使模型能更好地理解图像的全局信息,同时突出切片间的特征关系。此外,通过配准、颅骨分离算法提取MRI图像的脑部组织部分,进一步提升模型的性能。实验结果显示,所提方法对阿尔茨海默病的MRI图像具有较好的分类能力。 展开更多
关键词 阿尔茨海默病 MRI图像分类 vision Transformer LC-ViT
在线阅读 下载PDF
Frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students 被引量:1
20
作者 Jie Cai Wen-Wen Fan +5 位作者 Yun-Hui Zhong Cai-Lan Wen Xiao-Dan Wei Wan-Chen Wei Wan-Yan Xiang Jin-Mao Chen 《International Journal of Ophthalmology(English edition)》 SCIE CAS 2024年第2期374-379,共6页
AIM:To investigate the frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students.METHODS:Totally 158 student volunteers underwent routine visio... AIM:To investigate the frequency and associated factors of accommodation and non-strabismic binocular vision dysfunction among medical university students.METHODS:Totally 158 student volunteers underwent routine vision examination in the optometry clinic of Guangxi Medical University.Their data were used to identify the different types of accommodation and nonstrabismic binocular vision dysfunction and to determine their frequency.Correlation analysis and logistic regression were used to examine the factors associated with these abnormalities.RESULTS:The results showed that 36.71%of the subjects had accommodation and non-strabismic binocular vision issues,with 8.86%being attributed to accommodation dysfunction and 27.85%to binocular abnormalities.Convergence insufficiency(CI)was the most common abnormality,accounting for 13.29%.Those with these abnormalities experienced higher levels of eyestrain(χ2=69.518,P<0.001).The linear correlations were observed between the difference of binocular spherical equivalent(SE)and the index of horizontal esotropia at a distance(r=0.231,P=0.004)and the asthenopia survey scale(ASS)score(r=0.346,P<0.001).Furthermore,the right eye's SE was inversely correlated with the convergence of positive and negative fusion images at close range(r=-0.321,P<0.001),the convergence of negative fusion images at close range(r=-0.294,P<0.001),the vergence facility(VF;r=-0.234,P=0.003),and the set of negative fusion images at far range(r=-0.237,P=0.003).Logistic regression analysis indicated that gender,age,and the difference in right and binocular SE did not influence the emergence of these abnormalities.CONCLUSION:Binocular vision abnormalities are more prevalent than accommodation dysfunction,with CI being the most frequent type.Greater binocular refractive disparity leads to more severe eyestrain symptoms. 展开更多
关键词 optometry clinic non-strabismic binocular vision dysfunction college students convergence insufficiency
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部