期刊文献+

基于模拟退火算法与隐马尔可夫模型的Web信息抽取 被引量:4

Web Information Extraction Based on Simulated Annealing Algorithm and Hidden Markov Model
在线阅读 下载PDF
导出
摘要 典型隐马尔可夫模型对初始参数非常敏感,采用随机参数训练隐马尔可夫模型时常陷入局部最优,应用于W eb信息抽取时效果不佳.文中提出基于模拟退火算法与隐马尔可夫模型的W eb信息抽取算法.通过实验比较选择最佳的模拟退火算法参数,结合Baum-W elch算法优化隐马尔可夫模型并应用于W eb信息抽取.实验结果表明新算法在信息抽取的精确率和召回率都有明显的提高. Typical HMM is sensitive to the initial model parameters and often leads to sub-optimal when training it with random parameters.It is ineffective when extracting Web information with typical HMM.The artical proposes web information extraction algorithm based on SA and HMM.The algorithm chooses the best SA parameters by experiment and optimizes HMM combining Baum-Welch during the course of extracting Web information.Experimental results show that the new algorithm significantly improves the performance in precision and recall.
出处 《南华大学学报(自然科学版)》 2011年第1期70-74,共5页 Journal of University of South China:Science and Technology
基金 湖南省教育厅基金资助项目(O7C637)
关键词 模拟退火算法 隐马尔可夫模型 WEB信息抽取 simulated annealing algorithm hidden Markov model Web information extraction
  • 相关文献

参考文献12

  • 1Fabien Salzenstein, Wojciech Pieczynski. Parameter estimation in hidden fuzzy markov random fields and image segmentation[J]. Graphical Models and Image Processing, 1997,59(4) :205-220.
  • 2Xuan-Hieu Phan, Susumu Horiguchi, Tu-Bao Ho. Automated data extraction from the web with conditional models [ J ]. Int. J. Business Intelligence and Data Mining, 2005,1 (2) :210-228.
  • 3Freitag D, McCallum A, Pereira F. Maximum entropy markov models for information extraction and segmentation[ G ]//processing of ICML,2000,1 ( 1 ) :591-598.
  • 4贾德香,唐国庆,韩净.基于改进模拟退火算法的电网无功优化[J].继电器,2004,32(4):32-35. 被引量:22
  • 5洪沛霖,张佑生,邢燕.基于改进模拟退火算法的手写体数字识别[J].计算机技术与发展,2007,17(9):15-17. 被引量:6
  • 6吴月,刘忠明,刘永祺.模拟退火算法在大气环境质量综合评价中的应用[J].四川环境,2008,27(3):71-74. 被引量:4
  • 7林慧君,彭宏.模拟退火算法在全局查询优化中的应用[J].计算机技术与发展,2006,16(4):155-157. 被引量:11
  • 8Metropolis N, Rosenbluth A. Rosenbluth metal, equation of state calculations by fast computing machines [ J ]. Journal of Chemical Physics, 1953,55(21 ) : 1087-1092.
  • 9Kirkpatrick S, Jr Gelatt C D, Vecchi M P. Optimization by simulated annealing [ J ]. Science, 1983,220 ( 11 ) : 650-671.
  • 10Tobias Scheffer, Christian Decomain, Stefan Wrobel. Mining the web with active hidden markov models [ C]//San Jose. Proceedings of the IEEE Intemationl Conference on Data Mining. California:IEEE Computer Society, 2001 : 309 -318.

二级参考文献34

  • 1王茹,方丹,林辉.一种新型改进遗传算法在优化中的应用[J].测控技术,2005,24(1):76-79. 被引量:8
  • 2周长峰,谭跃进,廖良才,刘燕.模拟退火算法求解最短路径填挖问题[J].系统工程,2005,23(7):109-112. 被引量:6
  • 3姚志麒.环境质量指数的数学计算问题[J].环境科学,1980,1(6):53-53.
  • 4诸骏伟(ZHU Jun-wei).电力系统分析(上册)(Analysis of Electric Power System,Book One)[M].北京:水利电力出版社(Beijing: Hydroelectric Press),1995..
  • 5李丽英 等(LI Li-ying et al):.电力系统无功优化问题研究综述(Research on VAR Optimization in Power Systems)[J].电力情报(Information on Electric Power),2003,3:68-74.
  • 6钱颂迪(QIAN Song-di).运筹学(Operational Resear-ch)[M].北京:清华大学出版社(Beijing: Tsinghua University Press),1990..
  • 7杨建刚(YANG Jian-gang).人工神经网络实用教程(Practical Tutorial of Artificial Neural Networks)[M].杭州:浙江大学出版社(Hangzhou: Zhejiang University Press),2001..
  • 8[1]Hass H,Builtjes PJH,Simp son D.Comparison of model results obtained with several European regional air quality models[J].Atmos.Environ.,1997,31(19):3259-3280.
  • 9杨若黎,顾基发.一种高效的模拟退火全局优化算法[J].系统工程理论与实践,1997,17(5):29-35. 被引量:101
  • 10姚志麒.关于采用环境质量指数的几个问题[J].环境科学,1979,(2):37-45.

共引文献36

同被引文献34

引证文献4

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部