1. 程式人生 > >Python爬蟲抓取煎蛋(jandan.net)無聊圖

Python爬蟲抓取煎蛋(jandan.net)無聊圖

下載 logs start input req com read ref color

 1 #!/usr/bin/python
 2 #encoding:utf-8
 3 ‘‘‘
 4 @python 3.6.1
 5 @author: [email protected]
 6 @create date: 20170611
 7 
 8 ‘‘‘
 9 import requests
10 import urllib
11 import re
12 import os
13 
14 #當前最大頁數
15 url=http://jandan.net/pic
16 data=urllib.request.urlopen(url).read().decode(utf-8)
17
max_page_num=re.findall(r<span class.*current-comment-page.*?>\[(.*?)\]</span>,data)[1] 18 19 def download_pic(start_page,stop_page,download_file): 20 for num in range(start_page,stop_page): 21 os.chdir(download_file)#圖片保存目錄 22 url=http://jandan.net/pic/page-%s % num
23 data=urllib.request.urlopen(url).read().decode(utf-8) 24 pics=re.findall(r<a href="//(.*?)" target.*?</a><br />,data, re.I|re.S|re.M) 25 for i in pics: 26 i=http://+i 27 r=requests.get(i) 28 pic_name=i[28:100] 29 with open(pic_name,
wb) as f: 30 f.write(r.content) 31 f.close() 32 33 if __name__==__main__: 34 print(當前最大頁數為: %s % max_page_num) 35 page1,page2=input(請輸入要下載頁數範圍,例如 500,513 :).split(,) 36 download_pic(int(page1),int(page2),rC:\Users\samsung\Desktop\jd_pics) 37

Python爬蟲抓取煎蛋(jandan.net)無聊圖