期刊文献+

中文事件抽取技术研究 被引量:106

Research on Chinese Event Extraction
在线阅读 下载PDF
导出
摘要 事件抽取是信息抽取领域一个重要的研究方向,本文对事件抽取的两项关键技术——事件类别识别以及事件元素识别进行了深入研究。在事件类别识别阶段,本文采用了一种基于触发词扩展和二元分类相结合的方法;在事件元素识别阶段,本文采用了基于最大熵的多元分类的方法。这些方法很好的解决了事件抽取中训练实例正反例不平衡以及数据稀疏问题,取得了较好的系统性能。 Event Extraction is an important research point in the area of Information Extraction. This paper makes an intensive study of the two stages of Chinese event extraction, namely event type recognition and event argument recognition. A novel method combining event trigger expansion and a binary classifier is presented in the step of event type recognition while in the step of argument recognition, one with multi class classification based on maximum entropy is introduced. The above methods solved the data unbalanced problem in training model and the data sparseness problem brought by the small set of training data effectively, and finally our event extraction system achieved a better performance.
出处 《中文信息学报》 CSCD 北大核心 2008年第1期3-8,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60575042 60675034) 国家863资助项目(2006AA01Z145)
关键词 计算机应用 中文信息处理 事件抽取 事件类别识别 事件元素识别 computer application Chinese information processing event extraction event type recognition eventargument recognition
  • 相关文献

参考文献9

  • 1Naomi Daniel,Dragomir Radev and Timothy Allison.Sub-event based Multi-document Summarization[A].In:Proceedings of the HLT-NAACL Workshop on Text Summarization[C].2003.9-16.
  • 2Elena Filatova and Vasileios Hatzivassiloglou.Event-based Extractive summarization[A].In:Proceedings of ACL Workshop on Summarization[C]].2004.104-111.
  • 3Wenjie Li,Mingli Wu and Qin Lu.Extractive Summarization using Inter-and Intra-Event Relevance[A].In:Proceedings of the 44th Annual Meeting of the Association for Computational Liguistics[C].2006.369-376.
  • 4David Ahn.The stages of event extraction[A].In:Proceedings of the Workshop on Annotations and Reasoning about Time and Events[C].2006.1-8.
  • 5ACE (Automatic Content Extraction) Chinese Annotation Guidelines for Events.National Institute of Standards and Technology[R].2005.
  • 6Mihai Surdeanu,Sanda Harabagiu,John Williams,et al.Using Predicate-Argument Structures for Information Extraction[A].In:Proceedings of ACL[C].2003.8-15.
  • 7Mihai Surdeanu and Sanda Harabagiu.Infrastructure for Open-Domain Information Extraction[A].In:Proceedings of the Human Language Technology Conference[C].2002.325-330.
  • 8Hai Leong Chieu,Hwee Tou Ng.A Maximum Entropy Approach to Information Extraction from SemiStructured and Free Text[A].In:Proceedings of the 18th National Conference on Artificial Intelligence[C].2002.786-791.
  • 9来自ACE标准标注结果,分别对应着ACE的三项标注任务:实体识别、时间表达式识别和属性词识别.

同被引文献827

引证文献106

二级引证文献731

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部