期刊文献+

基于混合方法的中文微博自动摘要技术研究 被引量:5

Automatic summarization of Chinese micro-blog based on a hybrid method
在线阅读 下载PDF
导出
摘要 针对微博内容驳杂、信息稀疏的问题,深入研究传统自动摘要技术,结合微博数据特点,在微博事件提取的基础上提出一种基于统计和理解的混合摘要方法。首先根据词频、句子位置等文本特征得到基于统计的初始摘要;然后通过语义词典,计算句子相似度、确定事件主体进行基于语义理解的可读性加工,使最终摘要更具可读性;最后采用合理的摘要评价方法评价所得摘要。实验结果表明,该方法在不同压缩比例下均能获得质量稳定且可读性良好的摘要。 Micro-blog features complex contents and sparse information. In order to solve these prob- lems, on the basis of in-depth study on traditional automatic abstract techniques, combing with the data of micro-blog features, we propose a hybrid automatic summarization method based on statistics and comprehension for micro-blog event extraction. Firstly, we obtain the initial abstract based on the statistics according to word frequency and the location of sentences. Then we calculate sentence similarity through the semantic dictionary, determine the event subject, process the semantic understanding based readability, and make the final abstract more readable. Finally, a reasonable abstract evaluation method is adopted to evaluate the obtained abstract. Experimental results show that the proposed method can obtain a good summary of stable quality and readability under different compression ratios.
出处 《计算机工程与科学》 CSCD 北大核心 2016年第6期1257-1261,共5页 Computer Engineering & Science
基金 国家自然科学基金(61163025) 内蒙古自治区自然科学基金(2015MS0621)
关键词 微博事件 事件价值 可读性 自动摘要 micro-blog event event value readablity automatic summarization
  • 相关文献

参考文献12

  • 1Xu W,Grishman R,Meyers A,et al.A preliminary study of tweet summarization using information extraction[C]∥Proc of the Workshop on Language in Social Media(LASM 2013),2013:20-29.
  • 2Sharifi B P,Inouye D I,Kalita J K.Summarization of twitter microblogs[J].The Computer Journal,2014,57(3):378-402.
  • 3Inouye D. Multiple post microblog summarization[R].GA:University of Colorado Springs,2010.
  • 4Chakrabarti D,Punera K.Event summarization using tweets[C]∥Proc of ICWSM,2011:124-135.
  • 5Nichols J,Mahmud J,Drews C.Summarizing sporting events using twitter[C]∥Proc of the 2012 ACM International Conference on Intelligent User Interfaces,2012:189-198.
  • 6Long Rui. Exploring microblog data for event detection,tracking and summarization[D].Shanghai:Shanghai Jiaotong University,2012.
  • 7Gao Yong-bing, Nie Zhi-mi,Zhou Huan-yu,et al.Research based on similarity measurement of JS composite personal weibo sequential events classified[J].Computer Applications and Software,2015,32(7):98-104.
  • 8Guo Yu-qing,Wan Min.Automatic abstracting in domain-independent chinese documents[J].Journal of Tsinghua University(Science & Technology),2002,42(1):139-142.
  • 9刘端阳,王良芳.结合语义扩展度和词汇链的关键词提取算法[J].计算机科学,2013,40(12):264-269. 被引量:19
  • 10刘宗田,黄美丽,周文,仲兆满,付剑锋,单建芳,智慧来.面向事件的本体研究[J].计算机科学,2009,36(11):189-192. 被引量:101

二级参考文献52

共引文献128

同被引文献71

引证文献5

二级引证文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部