摘要
针对互联网技术快速发展,用户对各种网站访问量急剧加大,日志数据急剧增加的现状,采用Hbase数据库,Flume、Kafka分布式发布订阅消息系统和Spark Streaming流计算框架,设计实现基于Spark Sreaming的网站流量实时分析系统,深入探讨了网站流量的分析角度和指标,展示了网站的运营情况,从而引导网站开发、运营人员作出相关决策来改进网站的服务,为网站维护、制定网站营销策略提供有力的依据。
In response to the rapid development of Internet technology,users have greatly increased the number of visits to various websites and the rapid increase of log data. The Hbase database,Flume,Kafka distributed publish and subscribe message system and Spark Streaming flow computing framework are designed and implemented based on Spark Sreaming. Website traffic real-time analysis system,in-depth discussion of the analysis angle and indicators of website traffic analysis,showing the operation of the website,thereby guiding the website development,the operators make relevant decisions to improve the website’s services,and provide website marketing strategies for website maintenance. A strong basis.
作者
刘珍
方明
LIU Zhen;FANG Ming(School of Computer Science,Xi'an Shiyou University,Xi'an 710065,China)
出处
《智能计算机与应用》
2019年第6期201-205,共5页
Intelligent Computer and Applications