期刊文献+

基于时空分析的线索性事件的抽取与集成系统研究 被引量:21

Research on Extraction and Integration of Developing Event Based on Analysis of Space-time Information
在线阅读 下载PDF
导出
摘要 信息抽取技术能够提供高质量的检索服务。本文面向网络新闻事件,对人们感兴趣的事件关键信息进行了抽取和集成。系统中采用了如下的方法、策略:(1)利用句型模板构造抽取规则,然后直接从经过时间短语和空间短语识别和规范化处理的文本中抽取事件信息,从而跳过了深层句法分析,降低了实现系统的难度;(2)利用事件的规范化的时空信息关联不同文档中的同一事件,进行事件合并;(3)文档发生事件转移时对文档进行事件切分,从而解决了文档内不同事件信息的归并问题。初步实验结果表明:本文采用的方法和策略是有效的。 Technology of information extraction (IE) can provide high-quality service for retrieval. Targeting at events in web news,this paper conducts a system that can extract and integrate key information of event that interests people. Methodologies and strategies of the system are as follows: (1) Extraction rules are built in tenus of sentence patterns, then event informarion is directly extracted from the text in which temporal phrases (TP) and space phrases (SP) are recognized and normalized . The extraction system can thus be easily implemented owing to skipping complex syntax parsing. (2) The same event in different documents is linked by normalized TP and SP of event, and the information associated with an event is merged. (3) When new event appears in a text, the text is segmented. So isolative information for an event in same segment can be merged into its owner. Preliminary experiments show that methodologies and strategies in this paper are feasible.
出处 《中文信息学报》 CSCD 北大核心 2006年第1期21-28,共8页 Journal of Chinese Information Processing
基金 国家863项目资助(2001AA114040)
关键词 计算机应用 中文信息处理 信息抽取 句型模板 线索性事件 时空信息 事件合并 computer application Chinese information processing information extraction sentence patlem developing event space-time information event merge
  • 相关文献

参考文献8

  • 1陈群秀.信息处理用信息现代汉语句型系统初步研究[A]..Advances in Computation of Oriental Lauguages[C].北京:清华大学出版社,2003年8月.205-212.
  • 2朱靖波,姚天顺.中文信息自动抽取[J].东北大学学报(自然科学版),1998,19(1):52-54. 被引量:24
  • 3李保利,陈玉忠,俞士汶.信息抽取研究综述[J].计算机工程与应用,2003,39(10):1-5. 被引量:178
  • 4Ralph Grishman.Information Extraction: Techniques and Challenges[M]. In: Maria Teresa Pazienza, editor, Information Extraction. Springer-Verlag, Lecture Notes in Artificial Intelligence, Rome, 1997.
  • 5C. Aone, L. Halverson, T. Hampton, M. Ramos-Santacruz. SRA: Description of the IE2 System Used for MUC-7[A]. MUC-7. Fairfax, Virginia. 1998.
  • 6Yu S., Bai S., and Wu P. Description of the Kent Ridge Digital Labs System Used for MUC - 7[A]. In: Proceedings of the Seventh Message Understanding Conference[C]. 1998.
  • 7Chen H. , Ding Y., Tsai S., et al. Description of the NTU System Used for MEq2[A]. In: Proceedings of the Seventh Message Understanding Conference[C]. 1998.
  • 8吴平博,陈群秀,马亮.基于事件框架的事件相关文档的智能检索研究[J].中文信息学报,2003,17(6):25-30. 被引量:30

二级参考文献31

  • 1林尧璃 马少平.人工智能导论[M].北京:清华大学出版社,1989..
  • 2[16]Hobbs J,Appelt D,Bear J et al.FASTUS:A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text[C].In:Roche,Schabes eds. Finite State Devices for Natural Language Processing, MIT Press,Cambridge MA, 1996
  • 3[17]Appelt D E.Introduction to Information Extraction[J].AI COMMUNICATIONS, 1999; 12(3)
  • 4[18]Yangarber R.Scenario Customization for Information Extraction[D].Ph D Thesis.New York University,2001-01
  • 5[19]Cowie J, Lehnert W.Information Extraction[J].Communications of the ACM, 1996;39(1)
  • 6[20]Grishman R Adaptive information extraction and sublangu age analysis[C].In:Proceedings of IJCAI-2001 Workshop on Adaptive Text Extraction and Mining,2001
  • 7[1]Applet D E,Israel D J.Introduction to Information Extraction Technology. A Tutorial for IJCAI-99,1999
  • 8[2]Gaizauskas R,Wilks Y.Information Extraction:Beyond Document Retrieval[J].Journal of Documentation, 1997
  • 9[3]Sager N.Natural Language Information Processing. Reading,Massachusetts:Addison Wesley, 1981
  • 10[4]Dejong G.An Overview of the FRUMP System[C].In:LEHNERT W,RINGLE M h eds. Strategies for Natural Language Processing,Lawrence Erlbaum, 1982:149~176

共引文献222

同被引文献189

引证文献21

二级引证文献170

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部