基于动态稀疏训练的宽度学习系统研究

Research on broad learning systems based on dynamic sparse training

在线阅读下载PDF

导出

摘要针对宽度学习系统稀疏过程忽视不同权重重要性的变化,易出现误剪枝的问题,该文提出了基于动态稀疏训练的宽度学习系统。在标准宽度学习系统的目标函数中引入正则化项约束输出权重阈值,通过对输出权重和输出权重阈值的联合训练寻找出最优网络参数和稀疏网络结构。针对每一个输出权重引入输出权重阈值,根据输出权重重要性的改变,生成控制模型结构的输出权重掩码。通过动态训练,寻找网络结构和网络精度之间最优的平衡,提升模型整体性能。为了验证所提方法的有效性,在UCI公共数据集上选择多个数据集进行仿真实验。实验结果表明所提方法可以在不降低模型性能的同时,利用动态稀疏的方式稀疏模型。 [Objective]In sparse broad learning systems,the changing importance of output weights is overlooked.Some weights are unimportant in the early stages of model training but become important after being trimmed,making their recovery challenging.Inspired by dynamic sparse training in neural networks,this paper proposes a width learning system utilizing dynamic sparse training to compensate for pruning errors during model training and improve overall model performance while maintaining model sparsity.[Methods]This system introduces a regularization term to constrain the output weight threshold in the objective function of a standard-width learning system.It seeks optimal network parameters and a sparse network structure through joint training of output weights and their thresholds.Introduce an output weight threshold for each output weight,and generate an output weight mask for the control model structure based on changes in the importance of the output weight.The mask is jointly generated using weights and their thresholds and dynamically adjusts weight threshold during training to prune and restore output weights based on changes in weight importance.This system can indirectly sparsify models using the mask while retaining output weights,achieving an optimal balance between network structure and accuracy through dynamic training,and improving the overall model performance by minimizing the incorrect pruning of weights.There is the greatest improvement in accuracy on the dataset'BUTCSP',with an increase of approximately 30.12%.This article introduces exponential powers as regularization terms to constrain the weight threshold in the loss function of a standard-width learning system and adds a weight mask to the error term of the loss function.The alternating direction multiplier method is used to optimize and solve the objective function.[Results]To verify the effectiveness of the broad learning system based on dynamic sparse training(BLSDST),this paper uses six UCI public datasets for simulation.The performance of the system was compared with those of the broad learning system(BLS)and lasso broad learning system(L1BLS).Results indicate that the BLSDST achieves a balance between model accuracy and sparsity by constraining the weight-threshold regularization term.Further,it can reduce model complexity without sacrificing model accuracy while compensating for the impact of model pruning on model performance.[Conclusions]Experimental results show that the proposed system can achieve model dynamic sparsity without reducing model performance and even improving it.

作者李海港孙娟曹义湾褚菲余淼张勇 LI Haigang;SUN Juan;CAO Yiwan;CHU Fei;YU Miao;ZHANG Yong(School of Information and Control Engineering,China University of Mining and Technology,Xuzhou 221116,China)

机构地区中国矿业大学信息与控制工程学院

出处《实验技术与管理》 CAS 北大核心 2024年第12期53-60,共8页 Experimental Technology and Management

基金国家自然科学基金项目(62273348,61973304) 教育部产学合作协同育人项目(2022030014) 江苏省大学生创新训练项目(202410290143Y) 中国矿业大学教学研究项目(2021YB20,2022KCSZ03)。

关键词宽度学习系统增量学习动态稀疏权重阈值 broad learning system incremental learning dynamic sparsity weight threshold

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1赵诗影,闫泽,孟庆鑫,肖怀,赖旭芝,吴敏.基于宽度学习系统的气动波纹管驱动器无模型跟踪控制[J].控制与决策,2024,39(1):121-128. 被引量：1
2张倩,代伟,朱美强,张坤,王军.基于三维点云的机器人跟踪抓取实验设计[J].实验技术与管理,2024,41(2):65-72. 被引量：2
3De-Ren Han.A Survey on Some Recent Developments of Alternating Direction Method of Multipliers[J].Journal of the Operations Research Society of China,2022,10(1):1-52. 被引量：9
4赵慧敏,郑建杰,郭晨,邓武.基于流形正则化框架和MMD的域自适应BLS模型[J].自动化学报,2024,50(7):1458-1471. 被引量：1
5金伟正,孙原,李方玉.基于多分支特征融合的车载激光雷达3D目标检测算法[J].实验技术与管理,2024,41(1):37-43. 被引量：1

二级参考文献25

1张倩,张坤,朱美强,李海港,王军.运动模糊情况下的结构光光条中心快速提取[J].激光与光电子学进展,2023,60(1):168-176. 被引量：7
2叶克贝,朱剑英,王化明.锥型介电弹性体驱动器的性能研究[J].机械制造与自动化,2009,38(1):15-17. 被引量：2
3王泽浩,张中炜.自适应方向模板线结构光条纹中心提取方法[J].激光杂志,2017,38(1):60-64. 被引量：42
4赵苓,李奇.气动人工肌肉驱动单关节机械臂的自抗扰控制[J].液压与气动,2017,41(3):38-42. 被引量：10
5Ke GUO,Deren HAN,David Z. W. WANG,Tingting WU.Convergence of ADMM for multi-block nonconvex separable optimization models[J].Frontiers of Mathematics in China,2017,12(5):1139-1162. 被引量：14
6Xingju Cai,Deren Han.O(1/t) complexity analysis of the generalized alternating direction method of multipliers[J].Science China Mathematics,2019,62(4):795-808. 被引量：1
7林景栋,吴欣怡,柴毅,尹宏鹏.卷积神经网络结构优化综述[J].自动化学报,2020,46(1):24-37. 被引量：146
8李伟明,彭国,高兴宇,丁畅.线激光光条中心快速提取算法[J].中国激光,2020,47(3):184-191. 被引量：38
9Yue-Dong Ku,Jian-Hong Yang,Huai-Ying Fang,Wen Xiao,Jiang-Teng Zhuang.Optimization of Grasping Efficiency of a Robot Used for Sorting Construction and Demolition Waste[J].International Journal of Automation and computing,2020,17(5):691-700. 被引量：9
10Yu Cao,Jian Huang.Neural-Network-Based Nonlinear Model Predictive Tracking Control of a Pneumatic Muscle Actuator-Driven Exoskeleton[J].IEEE/CAA Journal of Automatica Sinica,2020,7(6):1478-1488. 被引量：9

共引文献9

1高岩.基于需求侧管理实时电价优化方法综述[J].上海理工大学学报,2022,44(2):103-111. 被引量：18
2党亚峥,崔甜甜.非凸非光滑不可分离优化的线性对称邻近ADMM收敛性分析[J].系统科学与数学,2023,43(11):2949-2969.
3Xin-Xin Li,Xiao-Ya Zhang.A New Stopping Criterion for Eckstein and Bertsekas’s Generalized Alternating Direction Method of Multipliers[J].Journal of the Operations Research Society of China,2023,11(4):941-955.
4蔡昀峰,宋梦,高赐威,陈涛,邓富金.考虑碳积分与电能溯源的绿色社区能量共享策略[J].中国电机工程学报,2024,44(6):2157-2170. 被引量：2
5Peng-Jie Liu,Jin-Bao Jian,Hu Shao,Xiao-Quan Wang,Jia-Wei Xu,Xiao-Yu Wu.A Bregman-Style Improved ADMM and its Linearized Version in the Nonconvex Setting:Convergence and Rate Analyses[J].Journal of the Operations Research Society of China,2024,12(2):298-340.
6徐春雨,冀东江,薛晰颖.一种联合多正则化项的双域去雾模型[J].科技创新与应用,2024,14(26):80-83.
7钱世炯,赵明岩,陈浩克,梁喜凤,魏艳红.基于三维重建的菠萝螺旋去刺方法与实验[J].实验技术与管理,2024,41(9):73-83. 被引量：2
8WANG Dongrui,XIU Naihua,ZHOU Shenglong.Optimality Conditions for Double-sparsity Constrained Optimization[J].数学进展,2024,53(6):1145-1157.
9Jianchao Bai,Ke Guo,Junli Liang,Yang Jing,H.C.So.ACCELERATED SYMMETRIC ADMM AND ITS APPLICATIONS IN LARGE-SCALE SIGNAL PROCESSING[J].Journal of Computational Mathematics,2024,42(6):1605-1626.

1林东凤,黄汉明,沈俏.基于改进遗传算法的广度架构搜索算法[J].计算机工程与设计,2024,45(12):3667-3673.
2崔铁军,李莎莎.SFEP中各因素重要度及未知因素数值的确定方法研究[J].工业安全与环保,2024,50(12):20-25.

实验技术与管理

2024年第12期

浏览历史

内容加载中请稍等...

基于动态稀疏训练的宽度学习系统研究

参考文献5

二级参考文献25

共引文献9

相关作者

相关机构

相关主题

浏览历史