摘要
Web上的数据量急剧膨胀使得进行Web数据挖掘成为数据挖掘技术研究的热点。而XML能够为Web挖掘提供半结构化的数据模型,解决了Web挖掘中的数据源问题。介绍了XML的和Web文本挖掘的概念,提出了一种基于XML的Web文本挖掘模型,剖析了该模型的各个组成部分,给出了该模型的特点。
With the flood of the data on the web, web data mining has become the focus of the data mining technology. XML can provide a semi-structrual data model for web data mining, resolving the difficult of data source for web mining. The definition of XML and web text mining is introduced generally, a model of XML-based web text mining is designed, and the parts of the model is analysed, and finally the characteristics of the model is presented.
出处
《计算机工程与设计》
CSCD
北大核心
2007年第10期2287-2290,共4页
Computer Engineering and Design