DPAL-BERT:A Faster and Lighter Question Answering Model

在线阅读下载PDF

导出

摘要 Recent advancements in natural language processing have given rise to numerous pre-training language models in question-answering systems.However,with the constant evolution of algorithms,data,and computing power,the increasing size and complexity of these models have led to increased training costs and reduced efficiency.This study aims to minimize the inference time of such models while maintaining computational performance.It also proposes a novel Distillation model for PAL-BERT(DPAL-BERT),specifically,employs knowledge distillation,using the PAL-BERT model as the teacher model to train two student models:DPAL-BERT-Bi and DPAL-BERTC.This research enhances the dataset through techniques such as masking,replacement,and n-gram sampling to optimize knowledge transfer.The experimental results showed that the distilled models greatly outperform models trained from scratch.In addition,although the distilled models exhibit a slight decrease in performance compared to PAL-BERT,they significantly reduce inference time to just 0.25%of the original.This demonstrates the effectiveness of the proposed approach in balancing model performance and efficiency.

作者 Lirong Yin Lei Wang Zhuohang Cai Siyu Lu Ruiyang Wang Ahmed AlSanad Salman A.AlQahtani Xiaobing Chen Zhengtong Yin Xiaolu Li Wenfeng Zheng

机构地区 Department of Geography and Anthropology School of Automation College of Computer and Information Sciences School of Electrical and Computer Engineering College of Resources and Environmental Engineering School of Geographical Sciences

出处《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期771-786,共16页 工程与科学中的计算机建模（英文）

基金 supported by Sichuan Science and Technology Program(2023YFSY0026,2023YFH0004).

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1Zhuo ZHANG,Ya LI,Jianxin XUE,Xiaoguang MAO.Improving fault localization with pre-training[J].Frontiers of Computer Science,2024,18(1):247-249.
2周奕,马汉杰,许永恩,宗佳敏,李少华.基于对抗训练和片段级别的双向情感三元组抽取模型[J].软件工程,2024,27(9):73-78.

Computer Modeling in Engineering & Sciences

2024年第10期

浏览历史

内容加载中请稍等...

DPAL-BERT:A Faster and Lighter Question Answering Model

相关作者

相关机构

相关主题

浏览历史