爬取图片

痛定思痛。 2022-06-07 02:54 367阅读 0赞

-——

爬取百度贴吧的图片：

# coding=utf-8

import urllib

import re

def getHtml(url):

page = urllib.urlopen(url)

html = page.read()

return html

def getImg(html):

reg = r’src=”(.+?\.jpg)” pic_ext’

imgre = re. compile(reg)

imglist = re. findall(imgre,html)

x = 0

for imgurl in imglist:

urllib.urlretrieve(imgurl,’%s.jpg’ % x)

x+=1

html = getHtml(“http://tieba.baidu.com/p/2460150866“)

print getImg(html)

#————————————————————————————————————————————————————-

爬取豆瓣的图片：

#!/usr/bin/python

# encoding:utf-8

import urllib2

import re

a = urllib2.urlopen(‘https://movie.douban.com/').read()

b = re.findall(r’https://.+\\.jpg‘, a)

i = 0

try:

for c in b:

f = open(str(i) + ‘.jpg’, ‘wb’)

req = urllib2.urlopen(c)

buf = req.read()

f. write(buf)

i += 1

f.close()

except Exception as e:

print e

发表评论取消回复

表情：

评论列表（有 0 条评论，367人围观）

还没有评论，来说两句吧...

相关阅读

相关 urllib爬取图片

使用 urllib 库来爬取图片 > import urllib.request > > 图片的 URL 链接 > image_url

水深无声/ 2023年10月09日 21:41/ 0 赞/ 135 阅读

相关 Jsoup爬取网站图片

Jsoup 是一款 Java 的 HTML 解析器，我们可以用它进行网站图片的爬取，然后下载到本地文件夹中。首先在pom.xml中添加依赖。 <dependen

谁践踏了优雅/ 2023年01月21日 13:23/ 0 赞/ 341 阅读

相关 python 爬取图片

!/usr/bin/nev python --coding:utf8-- import tkinter as tk import

Dear 丶/ 2022年10月12日 05:23/ 0 赞/ 319 阅读

相关 Java爬取网站图片

[Java爬虫-使用爬虫下载千张美女图片！][Java_-] [https://blog.csdn.net/qq\_35402412/article/details/113

ゞ浴缸里的玫瑰/ 2022年09月10日 05:14/ 0 赞/ 415 阅读

相关 Python爬取图片

参考了别人的代码。给代码添加了多线程和Queue的结合应用。 \[python\] [ view plain][view plain] [copy][view plain]

心已赠人/ 2022年08月13日 13:52/ 0 赞/ 369 阅读

相关爬取图片

\----- 爬取百度贴吧的图片： \ coding=utf-8 import urllib import re def getHtml(url): pa

痛定思痛。/ 2022年06月07日 02:54/ 0 赞/ 368 阅读

相关 Scrapy 爬取图片实例

目标:360摄影美图创建scrapy: scrapy startproject images360 创建spider: scrapy genspider images

电玩女神/ 2022年04月18日 02:40/ 0 赞/ 382 阅读

相关用requests爬取图片

coding=utf-8 from bs4 import BeautifulSoup import requests import

╰半夏微凉°/ 2021年12月16日 14:59/ 0 赞/ 402 阅读

相关爬取网页图片

下载表情包吧指定网页的所有图片 #coding:utf-8 import urllib.request import urllib.parse import urlli...

系统管理员/ 2021年05月03日 05:44/ 0 赞/ 750 阅读

相关 Python爬取图片

import requests # 模块导入的俩种方法 from multiprocessing import Pool import...

灰太狼/ 2021年04月08日 04:26/ 0 赞/ 872 阅读