期刊文献+

一种面向大规模短特征集的字符串匹配技术 被引量:1

String matching technology for large-scale short pattern set
在线阅读 下载PDF
导出
摘要 面向大规模特征集的字符串匹配技术在病毒检测、内容过滤等问题上的应用愈加广泛,而短模式串一直是阻碍性能提升的重要瓶颈。针对短模式串进行分析讨论,基于跳跃算法优化,采用了动态块大小和动态Hash处理以及Hash函数设计场景化的策略,同时探讨了多核处理器与多线程设计之间的关系。实验数据证明改进的算法策略具有支撑百万级特征集字符串匹配的能力。 Large-scale pattern set for the string matching technology in virus detection, content filtering and other applica-tions become widespread increasingly, while the short pattern has been a major bottleneck impeding performance improve-ments. Based on optimization of the jump algorithm, short pattern strings are analyzed and discussed, and strategies of dynamic block size, dynamic Hash processing and the scene of the Hash function design are applied, besides, the relationship between multi-core processors and multi-threading design is explored. Experimental data prove that the improved algo-rithm is available to support one million size pattern set for string matching.
作者 李志文 张伟
出处 《计算机工程与应用》 CSCD 2014年第1期105-110,129,共7页 Computer Engineering and Applications
基金 北京市教育委员会科技计划面上项目(No.KM201110772014)
关键词 大规模特征集 字符串匹配 短模式串 HASH函数 多线程技术 large-scale pattern set string matching short pattern Hash function multi-threading technology
  • 相关文献

参考文献6

二级参考文献55

共引文献47

同被引文献15

  • 1Commentz-Walter B.A string matching algorithm fast on the average[M]//Automata,Languages and Programming.Berlin:Springer,1979:118-132.
  • 2Wu Sun,Manber U.A fast algorithm for multi-pattern searching,TR94-17[R].Tuscon:University of Arizona,1994.
  • 3Aho A V,Corasick M J.Efficient string matching:an aid to bibliographic search[J].Communications of the ACM ,1975,18(6):333-340.
  • 4Horspool R N.Practical fast searching in strings[J].Software-Practice & Experience,1980,10(6):501-506.
  • 5Boyer R S,Moore J S.A fast string searching algorithm[J].Communications of the ACM,1977,20(10):762-772.
  • 6Norton M.Optimizing pattern matching for instrusion detection[EB/OL].(2006-05-11)[2006-11-09].http://does.idsresearch.org/OptimizingPatternMatching-ForIDS.pdf.
  • 7Navarro G,Raffinot M.Flexible pattern matching in strings[M].[S.l.] :The Press Syndicate of the University of Cambridge,2002.
  • 8Navarro G.Improved approximate pattern matching on hypertext[J].Theoretical Computer Science,2000,237(1):455-463.
  • 9Karp K M,Rabin M O.Efficient randomized patternmatching algorithms[J].IBM Journal of Research and Development,1987,31(2):249-260.
  • 10Salmela L,Tarhio J,Kytjoki J.Multipattern string matching with q-grams[J].ACM Journal of Experimental Algorithmics,2006,11(1):1-19.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部