期刊文献+

支持文本识别与取词的DVD播放软件的设计与实现

DESIGN AND IMPLEMENTATION OF DVD PLAYING SOFTWARE WITH OCR AND WORD RETRIEVAL SUPPORT
在线阅读 下载PDF
导出
摘要 DVD视频提供丰富的文本资源,但是由于其文本信息是以图片的形式存储的,目前的播放软件不能提供针对文本信息的识别和查询。通过对包含字幕数据的私有流1的分析,总结了字幕数据的存储格式和访问接口,给出了字幕流的提取和解码算法,提出了自动化的文本识别方法,并且以MPEG-2解码器为核心设计了一种支持文本识别和查询的DVD播放软件,最后利用Di-rectX技术实现了该播放软件。 DVD video contains plenty of text information, however, current playing software can not provide text information with recognition and searching because it is stored in form of the picture in DVD. By analysing the structure of private stream 1 that includes the subtitle stream data, the storage structure and accessing interface of subtitle data are summarized. The extraction and decoding algorithm for subtitle is described, and the method of automated text recognition is proposed. The DVD playing software supporting text recognition and searching is designed with MPEG-2 decoder as its core and is implemented with the DirectX technologies.
出处 《计算机应用与软件》 CSCD 2009年第9期95-98,共4页 Computer Applications and Software
基金 河南省科技攻关项目(0324500015)资助
关键词 数字视频光盘 播放器 私有流 字幕流 文本识别 DIRECTX Digital video disk Player Private stream Subtitle stream Text Recognition DirectX
  • 相关文献

参考文献8

  • 1蓝波,林小竹,籍俊伟.一种改进的RLE算法在图像数据编码中的应用[J].微电子学与计算机,2004,21(5):101-103. 被引量:9
  • 2芦亚亚,丁维龙,古辉.由行程编码改进的一种通用性压缩算法[J].浙江工业大学学报,2007,35(1):60-64. 被引量:9
  • 3Versatile Logic. Is it easy to understand DVD format? [ EB/OL]. (2001 - 10 - 24 ) [ 2008 - 02 - 15 ]. http ://web. archive. org/web/ 20011024182535/dvdpro. com/dvd. htm.
  • 4DVD-Replica. Unofficial DVD Specifications[ EB/OL ]. [ 2008 -02 - 15 ]. http ://www. dvd-replica. com/.
  • 5Jim Taylor. DVD Frequently Asked Questions ( and Answers) [ EB/ OL]. ( 2008 - 01- 04 ) [ 2008 - 02 - 15 ]. http://www, dvddemysti- fied. com/dvdfaq. html.
  • 6SoundWare Associates. DVD-Video Information [ EB/OL ]. [ 2008 - 02 - 15 ]. http ://www. mpucoder. com/DVD/index.html.
  • 7庄越挺,刘骏伟,吴飞,潘云鹤,张引.基于支持向量机的视频字幕自动定位与提取[J].计算机辅助设计与图形学学报,2002,14(8):750-753. 被引量:38
  • 8陆其明.DIRECTSHOW开发指南[M].北京:科学出版社,2004.

二级参考文献23

  • 1冯志全,范平,张少白,王玉茹,成谢锋.一种无失真图像数据压缩算法[J].计算机应用,2001,21(z1):134-134. 被引量:3
  • 2孙学岩,叶海建,韩玉坤.数字图像压缩原理及常用压缩编码方法[J].农机化研究,2005,27(3):128-130. 被引量:3
  • 3[1]Y Wang, Z Liu, J Huang. Multimedia content analysis using audio and visual information[J]. IEEE Signal Processing Magazine, 2000, 17(6):12~36
  • 4[2]R Lienhart, F Stuber. Automatic text recognition in digital videos[A]. In: Proceedings of ACM Multimedia, Boston, 1996.11~20
  • 5[3]Zhong Yu, Zhang Hongjiang, Jain Anil K. Automatic caption localization in compressed video[J]. Pattern Analysis and Machine Intelligence, 2000, 22(4):385~392
  • 6[4]V Vapnik. The Nature of Statistical Learning Theory[M]. New York: Springer, 1995
  • 7[5]M Schmidt. Identifying speaker with support vector networks[A]. In: Proceedings of Interface'96, Sydney, 1996
  • 8[6]T Joachims. Text categorization with support vector machines: Learning with many relevant features[A]. In: Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, 1998.137~142
  • 9[7]Yuan Qi. Learning algorithms for video and audio processing: Independent component analysis and support vector machine based approaches[R].College Park: University of Maryland at College Park, LAMP-TR-056(CAR-TR-951), 2000
  • 10[8]Edgar Osuna, Robert Freund, Federico Girosi. Training support vector machines: An application to face detection[A]. In: Proceedings of Computer Vision and Pattern Recognition, Puerto Rico, 1997.130~136

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部