摘要
互联网技术发展迅速,各种各样网站开始大量的出现在互联网上,网站的PV统计显得尤为重要。目前网站PV的统计方法有多种,例如在网页上使用计数器等。这些统计方法适用于浏览量不高的网站,但是对与一些浏览量比较高的网站,来说就显得力不从心。针对上述问题,基于Hadoop-Streaming框架,设计开发一个流量统计工具以及数据可视化平台,可以对网站的访问情况,访问时间,访问页面等进行多维度统计,为网站调整运营架构提供可靠的数据依据。
Internet technology has developed rapidly, electricity providers' website, corporate portal, government agencies website, UGC and other sites began to appear on the Internet in large numbers. Therefore, the site's PV statistics is particularly important. The current website PV statistical methods are different, such as the use of the counter on the website, placed in the target site embedded script. These statistical methods for some of the pages of the page are not high, such as: enterprises, government agencies can be applied to the site, but with some relatively high views of the site, such as: large UGC, large-scale electricity providers' website seems powerless, There will be many problems. Based on Hadoop-Streaming framework, designs a large-scale website for the flow of statistical tools and visualization of these data web platform, the site can visit, visit time, visit the page for muhi-dimensional statistics, Adjusts the operating structure, further enriches the content, enhances service capabilities, enhances the interaction and so on to provide reliable data basis.
出处
《现代计算机》
2018年第1期73-77,共5页
Modern Computer