摘要
伴随网络信息时代的发展,网民人数持续增加,人们可以通过各种方式查找信息,但数据量太大导致获取个性化信息变得困难,耗时变长。基于此,借助Python爬虫技术,采用Scrapy框架,创建针对旅游信息的数据抓取项目。主要介绍了爬取数据的基本流程,给出了爬取数据的具体实例,对爬取数据的持久化存储进行了相关论述。
With the development of the Internet information age,the number of Internet users continues to increase,and people can find information through various ways,but the large amount of data makes it difficult to obtain personalized information and takes longer.Based on this,with the help of Python crawler technology,Scrapy framework is adopted to create a data scraping project for tourism information.This paper mainly introduces the basic flow of crawling data,gives a concrete example of crawling data,and discusses the persistent storage of crawling data.
作者
郭晨灏
柳箐
姜澳
赵美娇
徐子薇
王博
GUO Chen-hao;LIU Qing;JIANG Ao;ZHAO Mei-jiao;XU Zi-wei;WANG Bo(College of Applied Technology,University of Science and Technology Liaoning,Anshan 114000,China)
出处
《电脑与信息技术》
2024年第5期71-74,90,共5页
Computer and Information Technology
基金
辽宁科技大学大学生创新创业训练计划项目(项目编号:X202310146056)。