摘要
"大数据"是2012年排名第二的热词,本文试图从数据库研究者的视角来解读大数据,说明"大数据"这个概念的诞生、内涵和外延以及它和传统数据库的关系。将在现今语境下重新审视"数据库研究",即如何理解"数据库"这个概念以及数据库研究的本质问题。还将讨论Hadoop与大数据的关系,"数据库研究"和"大数据研究"的关系。通过回顾Hadoop的起源和发展,从数据处理的角度说明Hadoop发展的偶然性和必然性,以及它所处的地位。基本观点是:"大数据"是个笼统的概念,对其进行分类有助于深入理解;大数据研究的显著特征是它与应用密切相关;Hadoop是数据管理研究回到文件系统这一原点后的一个有益探索;"大数据"和传统的数据库在研究理念和方法学上是一脉相承的。
"Big Data" is one of the hottest topics in 2012.We try to detangle Big Data from the view of database researcher,and describe the concept of Big Data and the relationship between Big Data and traditional databases.Revisiting database research in the Big Data scene includes reinvestigating the concept of database and essential issues of database research,discussing the relationship between Hadoop and Big Data,database research and Big Data research as well.Through tracking the inspiration and development of Hadoop,we try to explain why it has been such a big deal in Big Data.The basic ideas of the report are:(1) Big Data is a general concept.Classification of Big Data is helpful to have a deep understanding of it; (2) Big Data research closely correlates to its applications; (3) Hadoop is an enlightening exploration for database research going back to file system; (4) the philosophy and methodology of Big Data is consistent with those of traditional databases.
出处
《计算机工程与科学》
CSCD
北大核心
2013年第10期1-11,共11页
Computer Engineering & Science