摘要
广西非遗资源丰富绚烂,数据海量,目前的广西非遗资源虽然已有部份结构化数据,但仍存在着大量的非结构化数据,这对于知识图谱的构建来说是极大的挑战,所以引入了BERT模型对广西非遗资料进行知识图谱的构建。对于结构化数据进行直接构建,对于非结构化数据引入BERT模型先进行知识抽取,再进行知识图谱构建。知识图谱是广西非遗数字化保护新形式,对广西优秀文化传承有着积极推动作用。
Guangxi’s intangible cultural heritage resources are rich and colorful,and the data is bright.Although there are some structured data in the current intangible cultural heritage resources in Guangxi,there are still a lot of unstructured data,which is a great challenge for the construction of the knowledge graph.Therefore,we introduced the BERT model to construct the knowledge graph of Guangxi’s intangible cultural heritage materials.The structured data is directly constructed,and the unstructured data is introduced into the BERT model for knowledge extraction,and then the knowledge graph is constructed.Knowledge graph is a new form of digital protection of intangible cultural heritage in Guangxi,which plays a positive role in promoting the inheritance of Guangxi’s excellent culture.
作者
李宏杰
黄薇
王奔
Li Hongjie;Huang Wei;Wang Ben(College of Artificial Intelligence,Guangxi Minzu University,Nanning 530006,China;College of Electronic Information,Guangxi Minzu University,Nanning 530006,China)
出处
《现代计算机》
2023年第21期56-60,共5页
Modern Computer
关键词
自然语言处理
知识图谱
命名实体识别
广西非物质文化遗产
natural language processing(NLP)
knowledge graph(KG)
named entity recognition(NER)
Guangxi intangible cultural heritage(GxICH)