Python-爬虫-请求数据-request.Request类 左手的ㄟ右手 2021-10-13 03:58 387阅读 0赞 ## 关于request.Request类: ## ![watermark_type_ZmFuZ3poZW5naGVpdGk_shadow_10_text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2xpbmRhaTMyOQ_size_16_color_FFFFFF_t_70][] 代码: from urllib import request # 拼接请求头 headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36' } # 实现网络请求 rq = request.Request('https://www.baidu.com/', headers=headers) # 发送请求获取数据 resp = request.urlopen(rq) print(resp.read()) 实战: 获取猫眼票房数据 from urllib import request # 拼接请求头 headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36' } rq = request.Request('http://piaofang.maoyan.com/dashboard',headers=headers) resp = request.urlopen(rq) print(resp.read()) 结果: 不对,因为发现不了实时数据。最后发现数据存在一个json文件中 ![20190821173141231.png][] ![20190821173202361.png][] 所以直接请求json文件获取数据 from urllib import request # 拼接请求头 headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36' } # 因为发现数据存在一个json文件之中,所以直接请求json文件地址 rq = request.Request('https://box.maoyan.com/promovie/api/box/second.json',headers=headers) resp = request.urlopen(rq) print(resp.read().decode('utf-8')) ![watermark_type_ZmFuZ3poZW5naGVpdGk_shadow_10_text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2xpbmRhaTMyOQ_size_16_color_FFFFFF_t_70 1][] [watermark_type_ZmFuZ3poZW5naGVpdGk_shadow_10_text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2xpbmRhaTMyOQ_size_16_color_FFFFFF_t_70]: /images/20211013/c325131789ed450d895bff0ff6c9cc3c.png [20190821173141231.png]: /images/20211013/8d22b598f4ff4e409757db540938e8b4.png [20190821173202361.png]: /images/20211013/83263060f68949d3ae7098284bf82b2e.png [watermark_type_ZmFuZ3poZW5naGVpdGk_shadow_10_text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2xpbmRhaTMyOQ_size_16_color_FFFFFF_t_70 1]: /images/20211013/a74764949b9a45c99a9223560109ea5a.png
还没有评论,来说两句吧...