问题描述
- python3 爬取https指向的图片链接 问题
- 初学python爬虫部分,昨天动手做了做爬取网页图片并且下载的小程序。
发现网页中有的图片可以被下载并且保存在本地,有的却不能。
查看了下网页代码原来可以被爬取的是
而不能被爬取的是
用urllib包好像处理不了请问如何解决
附上程序报错提示
Traceback (most recent call last):
File ""D:Python34liburllib
equest.py"" line 1182 in do_open
h.request(req.get_method() req.selector req.data headers)
File ""D:Python34libhttpclient.py"" line 1088 in request
self._send_request(method url body headers)
File ""D:Python34libhttpclient.py"" line 1126 in send_request
self.endheaders(body)
File ""D:Python34libhttpclient.py"" line 1084 in endheaders
self._send_output(message_body)
File ""D:Python34libhttpclient.py"" line 922 in _send_output
self.send(msg)
File ""D:Python34libhttpclient.py"" line 857 in send
self.connect()
File ""D:Python34libhttpclient.py"" line 1231 in connect
server_hostname=server_hostname)
File ""D:Python34libssl.py"" line 365 in wrap_socket
_context=self)
File ""D:Python34libssl.py"" line 583 in __init_
self.do_handshake()
File ""D:Python34libssl.py"" line 810 in do_handshake
self._sslobj.do_handshake()
ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:600)During handling of the above exception another exception occurred:
Traceback (most recent call last):
File ""D:py_practicepachong_rumen_1024.py"" line 45 in
bytes = urllib.request.urlopen(url);
File ""D:Python34liburllib
equest.py"" line 161 in urlopen
return opener.open(url data timeout)
File ""D:Python34liburllib
equest.py"" line 463 in open
response = self._open(req data)
File ""D:Python34liburllib
equest.py"" line 481 in _open
'_open' req)
File ""D:Python34liburllib
equest.py"" line 441 in _call_chain
result = func(*args)
File ""D:Python34liburllib
equest.py"" line 1225 in https_open
context=self._context check_hostname=self._check_hostname)
File ""D:Python34liburllib
equest.py"" line 1184 in do_open
raise URLError(err)
urllib.error.URLError:
解决方案
解决方案二:
Python3爬取图片
解决方案三:
图片的网址对比一下是否有问题,两者图片地址用浏览器分别打开试试
解决方案四:
写爬虫,用神箭手云爬虫吧,比其他爬虫框架快多了,官方客服mm很热情