期刊文献+

协同过滤在中文维基百科类别推荐上的应用

Application of cooperative filtering in categories recommendation of Chinese Wikipedia
在线阅读 下载PDF
导出
摘要 针对传统人工编辑导致大量类别信息重复和不规范的问题,提出了应用协同过滤技术为中文维基百科文章自动推荐类别。利用中文维基百科中的四个重要语义特征即链入、链出、链入的类别和链出的类别来表示维基百科文章,得到与目标文章相似的前若干篇文章的所有类别后,通过查询返回的相似度值计算各个类别的权重,选择前面的若干个类别作为推荐结果返回给目标文章。实验结果表明了这四个语义特征能较好地表征一篇维基百科文章,同时也验证了协同过滤方法在中文维基百科自动推荐类别中的有效性。 Collaborative filtering was applied to automatically recommend categories for a Chinese Wikipedia article. Four typical semantic features namely incoming link, outgoing link, incoming link categories and outgoing link categories, were adopted to represent articles. Among all the categories of articles similar to target article, several most similar categories were chosen as the recommendation results to the target article, via calculating the similarity value between them. The experimental results show that the four semantic features have efficient performance in Wikipedia article representation. And the collaborative filtering method is also proved to be effective in recommending proper categories for Chinese Wikipedia articles.
出处 《计算机应用》 CSCD 北大核心 2013年第3期838-840,844,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(90920005 61003192) 国家语委"十二五"重点项目(ZDI125-1) 国家"十二五"科技支撑计划项目(2012BAK24B01) 教育部/国家外国专家局高等学校学科创新引智计划项目(B07042) 湖北省自然科学基金资助项目(2011CDA034) 华中师范大学中央高校基本科研业务费专项资金资助项目(CCNU10A02009 CCNU10C01005)
关键词 协同过滤 中文维基百科 类别推荐 语义特征 collaborative filtering Chinese Wikipedia category recommendation semantic feature
  • 相关文献

参考文献6

二级参考文献42

  • 1陈文亮,朱靖波,朱慕华,姚天顺.基于领域词典的文本特征表示[J].计算机研究与发展,2005,42(12):2155-2160. 被引量:23
  • 2赵佳鹤,王秀坤,刘亚欣.基于语义分析的主题信息采集系统的设计与实现[J].计算机应用,2007,27(2):406-408. 被引量:15
  • 3Leacock C, Chodorow wordnet similarity for Fellbaum C. Wordnet Princeton: MIT Press, M. Combining local context and word sense identification [C] // An Electronic Lexical Database. 1998:265 -283.
  • 4Remy M. Wikipedia: the free encyclopedia, online information review[J]. Emerald Group Publishing Limited, 1999, 26(6): 434-435.
  • 5Ponzetto S P, Strube M. Deriving a large scale taxonomy from Wikipedia [ C ]//Proceedings of the 22nd National Conference on Artificial Intelligence. Vancouver: AAAI Press, 2007: 1440-1445.
  • 6Zesch T, Gurevych I. Analysis of the Wikipedia category graph for NLP applications[C]//Proceedings of the Text Graphs-2 Workshop (NAACL-HLT 2007). New York Omnipress Inc, 2007: 1-8.
  • 7Wang Yang, Wang Haofen, Zhu Haiping, et al. Exploit semantic information for category annotation recommendation in Wikipedia [ C]// Natural Language Processing and Information Systems. Berlin: Springer, 2007: 48- 60.
  • 8Banerjee S, Pedersen T. Extended gloss overlap as a measure of semantic relatedness [ C]//Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence. Acapulco. Mexico: Morgan Kaufmann Publishers Inc, 2003: 805-810.
  • 9Lesk M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone[C]//Proceedings of the 5th Annual Conference on Systems Documentation. New York: ACM, 1986 : 24-26.
  • 10朱慕华,朱靖波,陈文亮.面向文本分类的多类别SVM组合方式的比较[c]//全国第八届计算语言学联合学术会议,2005:435-441.

共引文献73

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部