期刊文献+

基于Hadoop平台的OLAP煤炭销售数据分析系统 被引量:2

OLAP Analyzing System of Coal Sale Data Based on Hadoop Platform
在线阅读 下载PDF
导出
摘要 针对煤炭销售数据量大而信息量少的问题,开发了基于Hadoop平台的OLAP煤炭销售数据分析系统,介绍了系统设计思想及架构,并以销售量统计为例阐述了实现数据深层次快速挖掘和直观显示的具体过程。该系统利用Hadoop云平台对数据进行ETL处理,创建Hive分布式数据仓库,并采用Hive的HQL语言进行OLAP统计分析,能够快速、准确地实现对销售量信息的多层次、多角度、深层次的数据挖掘、统计和分析,并直观、多角度地反映数据分析结果。 For the problem that coal sale data has large volume but little information content,the paper proposed a design scheme of OLAP analyzing system of coal sale data based on Hadoop platform,introduced design ideas and structure of the system,and described specific process of implementation of deep and fast mining and intuitive display of data taking statistics of sales data as an example.The system uses Hadoop cloud platform for ETL data processing,creates Hive distributed data warehouse,and uses HQL language of Hive for OLAP statistical analysis,which achieves deep data mining,statistics and analysis of sales information from multi-level and multi-angle quickly and accurately,meanwhile reflects the result of data analysis intuitively form multi-angle.
出处 《工矿自动化》 北大核心 2012年第11期77-80,共4页 Journal Of Mine Automation
关键词 煤炭销售 HADOOP平台 OLAP技术 云计算 数据挖掘 数据仓库 coal sale Hadoop platform OLAP technique cloud computing data mining data warehouse
  • 相关文献

参考文献9

  • 1TAN P N, STEINBACH M, KUMAR V.数据挖掘导论(完整版)[M].范明,范宏建,译.北京:人民邮电出版社,2011:79-84.
  • 2INMONWH.数据仓库[M].4版.王志海,译,北京:机械工业出版社.2006:20.
  • 3WHITe T.Hadoop权威指南(中文版)[M].曾大聃,周傲英,译.北京:清华大学出版社,2010.
  • 4刘永增,张晓景,李先毅.基于Hadoop/Hive的web日志分析系统的设计[J].广西大学学报(自然科学版),2011,36(A01):314-317. 被引量:24
  • 5THUSOO A, SARMA J S, JAIN N, et al. Hive-A Petahyte Scale Data Warehouse Using Hadoop[C]// IEEE 26th International Conference on Data Engineering, 2010 : 996-1005.
  • 6BORTHAKUR D. HDFS Architecture Guide [EB/ OL]. [2012-07-221. http ://wenku. baidu, corn/view/ ec30012f453610661ed9f411, html.
  • 7谢桂兰,罗省贤.基于Hadoop MapReduce模型的应用研究[J].微型机与应用,2010,29(8):4-7. 被引量:69
  • 8王振宇,陈红星,刘道园.煤炭企业数据中心ETL智能化调度研究[J].中国煤炭,2012,38(1):78-80. 被引量:4
  • 9BORTHAKUR D. The Hadoop Distributed File System: Architecture and Design[R]. The Apache Software Foundation, 2007.

二级参考文献14

  • 1葛斌,孟祥瑞,柏杏丽.煤炭企业ERP理论及应用技术研究[J].中国煤炭,2004,30(7):30-31. 被引量:7
  • 2WHITE T.Hadoop,the definitive guide[M].O'Reilly Media,Inc,2009.
  • 3DEAN J,GHEMAWAT S.MapReduee:simplified data processing on large clusters.[C]//Proc of the 6th Symposium on Operating Systems Design and Implementation.San Francisco:Google Inc,2004.
  • 4Hadoop官方文档:http://hadoop.apache.org/common/docs/r0.18.2/cn/mapred_tutorial.html,2008.
  • 5HUSSAIN T, ASGHAR S, MASOOD N. Web Usage Mining:A Survey on Preprocessing of Web Log File[ C]//Information and Emerging Technologies, 2010 : 1-6.
  • 6ASHISH T, JOYDEEP S, NAMIT J, et al. Hive-A Petabyte Scale Data Warehouse Using Hadoop[ C ],//Data Engineering (ICDE), 2010 IEEE 26th International :996-1005.
  • 7Tom White.Hadoop权威指南[M].曾大聃,周傲英,译.北京:清华大学出版社,2010.
  • 8HE YONGQIANG, LEE RUBAO, HUAI YIN, et al. RCFile:A Fast and Space-efficient Data Placement Structure in MapReduce-bsed Warehouse Systems[ C ]//Data Engineering (ICDE), 2010 IEEE 26th International :996-1005.
  • 9杨毅,孙超.基于数据仓库的煤炭运销决策支持系统研究[J].中国煤炭,2009,35(2):36-38. 被引量:7
  • 10刘福国.基于数据挖掘的钢球磨煤机运行特性建模和优化[J].煤炭学报,2010,35(5):850-854. 被引量:9

共引文献100

同被引文献15

引证文献2

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部