基于元算子的深度学习框架缺陷检测方法被引量：9

Defect Detection for Deep Learning Frameworks Based on Meta Operators

在线阅读下载PDF

导出

摘要在用于构建深度学习模型的深度学习框架中,算子的正确计算对于深度学习模型的正确预测至关重要.然而,已有的深度学习框架缺陷检测方法只能通过比较和推测的方式找到不同深度学习框架之间计算结果相差较大的算子,而且无法检测深度学习模型在训练过程中产生的计算错误,具有很大的局限性.针对此问题,本文设计并实现了基于元算子的深度学习框架缺陷检测方法,通过将不同深度学习框架中算子的共性计算逻辑抽象为“元算子”,支持在不改变模型代码的前提下绑定元算子的具体实现,从而可以细粒度地对比同一模型使用不同深度学习框架的运算结果,进而发现缺陷.本文的方法同时支持训练过程和推断过程的缺陷检测,还可以对计算错误的定位进行验证.本文验证了元算子计算的准确性,并评估其运算性能;收集了深度学习框架中已知有错误计算的算子,并将本文方法应用在包含这些算子的深度学习模型上,验证了本文缺陷检测方法的有效性. Deep Learning(DL)has beeb widely adopteb in various fields such as image recognition,machine translation,and autonomouu driving.In ordee to bettee support deep learning tasks and promote the application of DL,more and more platforms and frameworks have emerged,such as TensorFlow,PyTorch,and Keras.These platforms and frameworks are known as deep learning frameworks.Using the programming interfaces provided by these deep learning frameworks,developers can easily design,train,and test the deep learning models.Deep learning frameworks usually take“operator"at the unit of calculation,and different operatore define different typee of numerical calculation.In deep learning frameworks,the correct calculation of operators is critical to the correctness of deep learning models.These calculation errors could affect the accuracy of the prediction resulte of the deep learning models,or ever result in serioue consequences such as traffic accidents in automatic driving.In recech years,attention has been paid on testing and diagnosie of deep learning frameworks,but existing defect detection methods havs greae limitations.On the one hand,existing defect detection methods for deep learninn frameworke can detect only large calculation differences of operatore between different deep learning frameworke through comparison and speculatiog.On the other hand,existing methods can diagnose only calculation errore of deep learning models in the inferencs process,and cannot diagnose calculation errors in the training process.To address the issue,we expect to detect errors of deep learninn models due to the defects of deep learninn frameworks automatically in the process of training or inferencs and verify the accuracy of detection results.There are many challenges in implementing such e defect detection method.First,the deep learning model usually consists of a complex network structurs.Foe a deep learning model,given any input instance,it is very difficult to determins the correct output.Second,a deep learning model usually consists a large number of operators and their relationship in the model is very complex,making locating defectiw operators difficult.In addition,verifying the correctness of defect location in a large and complicated deep learning model is challenging.iN response to the above challenges,in this paper,we desige and implement a defect detection method for deep learning frameworks based on meta operatorc.We abstract common computing logic of operators such as forward computation and gradient computation of operators in different deep learning frameworks as“meta operators”.We bind the specific implementation of operators without changing the code of deep learning models.In this way,users can make fine-grained replacements of operators in deep learning models.Through fine-grained operator replacement,not only can the calculation errors of the deep learning frameworks during the inference process be found,but also the calculation errors during the training process and the localization of these errors can be verified by recording the meta operator’s running time and memory consumption.We verify the accuracy of the meta operator calculation and evaluate its performance.We collect the known operators with calculation errors in the deep learning frameworks and apply the defect detection method on deep learning models containing these operators,showing the effectiveness of the defect detection method.

作者谷典典石屹宁刘譞哲吴格姜海鸥赵耀帅马郓 GU Dian-Dian;SHI Yi-Ning;LIU Xuan-Zhe;WU Ge;JIANG Hai-Ou;ZHAO Yao-Shuai;MA Yun(Key Laboratory of High Confidence Software Technologies of Ministry of Education(Peking University),Beijing 100871;TravelSky Technology Limited,Beijing 101318;Key Laboratory of Intelligent Passenger Service of Civil Aviation,CAAC,Beijing 101318;Peking University Information Technology Institute(Tianjin Binhai)Information Technology Institute(Tianjin Binhai),Peking University,Tianjin 300452;Institute for Artificial Intelligence,Peking University,Beijing 100871)

机构地区高可信软件技术教育部重点实验室(北京大学) 中国民航信息网络股份有限公司中国民用航空局民航旅客服务智能化应用技术重点实验室北京大学(天津滨海)新一代信息技术研究院北京大学人工智能研究院

出处《计算机学报》 EI CAS CSCD 北大核心 2022年第2期240-255,共16页 Chinese Journal of Computers

基金国家重点研发计划“高时效、可扩展的大数据计算模型、优化技术与系统”(2018YFB1004400) 北京高等学校卓越青年科学家项目“软件定义的人机物融合计算技术与系统”(BJJWZYJH01201910001004)资助。

关键词深度学习框架元算子缺陷检测深度学习软件测试 deep learning frameworks meta operator defect detection deep learning software testing

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献57

1吕冰峰,刘敏,裴新荣.2018年鸡蛋的国家食品安全监督抽检结果分析[J].食品安全质量检测学报,2020,11(1):319-323. 被引量：16
2吴丹,王先逵,蒲建.智能化工艺设计系统的几个关键技术[J].清华大学学报（自然科学版）,1996,36(4):54-59. 被引量：10
3吴磊,张敏灵.基于类属属性的多标记学习算法[J].软件学报,2014,25(9):1992-2001. 被引量：11
4李德毅,马楠.智能时代新工科——人工智能推动教育改革的实践[J].高等工程教育研究,2017,65(5):8-12. 被引量：138
5王巧华,王彩云,马美湖.基于机器视觉的鸭蛋新鲜度检测[J].中国食品学报,2017,17(8):268-274. 被引量：13
6亢良伊,王建飞,刘杰,叶丹.可扩展机器学习的并行与分布式优化算法综述[J].软件学报,2018,29(1):109-130. 被引量：29
7雷杰,高鑫,宋杰,王兴路,宋明黎.深度网络模型压缩综述[J].软件学报,2018,29(2):251-266. 被引量：47
8贾积有.人工智能与教育的辩证关系[J].上海师范大学学报（哲学社会科学版）,2018,47(3):25-33. 被引量：24
9朱虎明,李佩,焦李成,杨淑媛,侯彪.深度神经网络并行化研究综述[J].计算机学报,2018,41(8):1861-1881. 被引量：60
10程玉胜,钱坤,王一宾,赵大卫.融合萤火虫方法的多标签懒惰学习算法[J].计算机应用,2019,39(5):1305-1311. 被引量：4

引证文献9

1王斌,李靖,赵康,周温.面向火焰快速检测的轻量化深度网络研究[J].计算机工程与应用,2022,58(17):256-262. 被引量：7
2曹天成.多标签自然场景图片识别[J].数字技术与应用,2022,40(11):28-30. 被引量：2
3范维,胡建超,王巧华,汤文权.基于深度学习的移动端缺陷蛋检测系统研究[J].农业机械学报,2023,54(3):411-420. 被引量：5
4高赫然,吴恒,许源佳,李修和,王焘,张文博.面向深度学习训练的内存交换机制综述[J].软件学报,2023,34(12):5862-5886. 被引量：1
5郭宇.基于灰度运算的粉末冶金齿轮缺陷检测技术研究[J].山西冶金,2024,47(4):93-95.
6崔宪伟,杨翎,赵勤坤,谢佳佳,张濠麟.基于计算机视觉的车载轨道缺陷智能巡检系统设计[J].自动化技术与应用,2024,43(7):35-38. 被引量：1
7马祥跃,杜晓婷,采青,郑阳,胡崝,郑征.深度学习框架测试研究综述[J].软件学报,2024,35(8):3752-3784. 被引量：2
8冯砚博,孟爽,孙健伟,李德溥,于润泽,曲云飞,于兴滨.科学指导学生高效应用人工智能改进学习方式[J].包装工程,2024,45(S02):102-104.
9张麟华,王煜.船舶涡轮机叶片细小裂痕视觉显著性检测研究[J].舰船科学技术,2024,46(18):167-170. 被引量：1

二级引证文献19

1钱承山,沈有为,孙宁,戴仁天.基于Transformer改进YOLOv5的山火检测方法研究[J].电子测量技术,2023,46(16):46-56. 被引量：6
2孙好,董兴法,王军,陈致远.基于改进YOLOv4-Tiny轻量化校内行人目标检测算法[J].计算机工程与应用,2023,59(15):97-106. 被引量：3
3刘春霞,李超,潘理虎,樊森霖.改进YOLOv5s的煤矿烟火检测算法[J].计算机工程与应用,2023,59(17):286-294. 被引量：6
4于春和,杨子奇.基于yolov4的钢材表面缺陷检测[J].电脑与信息技术,2023,31(5):29-33. 被引量：1
5徐紫嫣,刘婧婧,丁慧,徐仰仓.基于手机拍照图像的对虾鲜度分级技术研究[J].食品安全质量检测学报,2023,14(18):239-244. 被引量：4
6李丹萌,张晨光,刘天,杜雪姣.基于张量网络的多标签学习方法[J].海南大学学报（自然科学版）,2023,41(4):335-342.
7刘涛,高一萌,柴蕊,李政通.改进YOLOv5s的无人机视角下小目标检测算法[J].计算机工程与应用,2024,60(1):110-121. 被引量：10
8王怀济,李广明,张红良,申京傲,吴京.融合卷积通道注意力的遥感图像目标检测方法[J].计算机工程与应用,2024,60(2):200-210. 被引量：1
9禹晨,张飞,郝斌.基于YOLOv8化工企业储煤场火煤和扬尘检测方法研究[J].盐科学与化工,2024,53(3):49-54. 被引量：2
10段可欣,闫文君,凌青,王艳艳,王艺卉.样本标签污染条件下的雷达辐射源个体识别技术[J].海军航空大学学报,2024,39(2):189-198.

1夏烈.微课在初中物理实验教学中的应用[J].江西教育,2021(33):33-33. 被引量：1
2马闯,杨晓龙,陈含爽,张海峰.基于平均场近似的BP算法求解随机块模型[J].物理学报,2021,70(22):339-350. 被引量：1
3韩源,刘家豪,苏茂林.中国国家文化安全形势评估——基于PSR和FAHP的实证研究[J].社会科学,2021(10):15-27. 被引量：14
4龚航,刘培顺.夜间行驶车辆远光灯检测方法[J].计算机科学,2021,48(12):256-263.
5颜信全,侯佳保,杜伟.磁粉检测中修磨坑底部检测灵敏度问题分析[J].金属加工（热加工）,2021(12):75-81.
6刘豪,侯德鑫,郑刚兵,袁建锋,叶树亮.基于热成像的钢管混凝土脱空检测技术研究[J].红外技术,2021,43(11):1119-1126. 被引量：7
7史素,江琳琳,李金波.“万以内的加法(二)”教学实录与评析[J].小学数学教育,2021(17):46-48.
8何达明.提高农村小学学困生计算能力的有效措施[J].时代教育（下旬）,2021(12):0058-0059.
9王兰芳,汪婷婷,邓小蓉.“解决问题”教学实录与评析[J].小学数学教育,2021(17):49-51.
10李兴红,张聆玲,杨琴.基于DSP+FPGA的伺服系统应用[J].计算机仿真,2021,38(10):255-257. 被引量：1

计算机学报

2022年第2期

浏览历史

内容加载中请稍等...

基于元算子的深度学习框架缺陷检测方法被引量：9

同被引文献57

引证文献9

二级引证文献19

相关作者

相关机构

相关主题

浏览历史

基于元算子的深度学习框架缺陷检测方法 被引量：9

同被引文献57

引证文献9

二级引证文献19

相关作者

相关机构

相关主题

浏览历史

基于元算子的深度学习框架缺陷检测方法被引量：9