Python爬取妹子網圖片

阿新 • • 發佈：2019-01-27

提取文章標題

import requests
from bs4 import BeautifulSoup


url = 'http://www.mzitu.com/26685'
header = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) '
                        'Chrome/50.0.2661.102 UBrowser/6.1.2107.204 Safari/537.36'}

html = requests.get(url, headers=header)

soup = BeautifulSoup(html.text 
, 'html.parser')  

all_a = soup.find('div', class_='postlist').find_all('a', target="_blank")

for a in all_a:
    title = a.get_text()  # 提取文字
    print(title)

程式原始碼

import requests
from bs4 import BeautifulSoup
import os

all_url = 'http://www.mzitu.com'

# http請求頭
Hostreferer = {
    'User-Agent' 
: 'Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)',
    'Referer': 'http://www.mzitu.com'
               }
Picreferer = {
    'User-Agent': 'Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)',
    'Referer': 'http://i.meizitu.net'
}

#此請求頭破解盜鏈
start_html = requests.get(all_url, headers=Hostreferer)

#儲存地址 

path = '/Users/mubai888/Desktop/meizi/'

#找尋最大頁數
soup = BeautifulSoup(start_html.text, 'html.parser')
page = soup.find_all('a', class_='page-numbers')
max_page = page[-2].text


same_url = 'http://www.mzitu.com/page/'
for n in range(1, int(max_page)+1):
    ul = same_url+str(n)
    start_html = requests.get(ul, headers=Hostreferer)
    soup = BeautifulSoup(start_html.text, 'html.parser')
    all_a = soup.find('div', class_='postlist').find_all('a', target='_blank')
    for a in all_a:
        title = a.get_text()
        if title != '':
            print('準備扒取：' + title)

            # win不能建立帶？的目錄
            if os.path.exists(path+title.strip().replace('?', '')):
                # print('目錄已存在')
                flag =1
            else:
                os.makedirs(path+title.strip().replace('?', ''))
                flag = 0
            os.chdir(path + title.strip().replace('?', ''))
            href = a['href']
            html = requests.get(href, headers=Hostreferer)
            mess = BeautifulSoup(html.text, "html.parser")
            pic_max = mess.find_all('span')
            pic_max = pic_max[10].text  # 最大頁數

            if flag == 1 and len(os.listdir(path + title.strip().replace('?', ''))) >= int(pic_max):
                print('已經儲存完畢，跳過')
                continue
            for num in range(1, int(pic_max) + 1):
                pic = href + '/' + str(num)
                html = requests.get(pic, headers=Hostreferer)
                mess = BeautifulSoup(html.text, "html.parser")
                pic_url = mess.find('img', alt=title)
                print(pic_url['src'])
                # exit(0)
                html = requests.get(pic_url['src'], headers=Picreferer)
                file_name = pic_url['src'].split(r'/')[-1]
                f = open(file_name, 'wb')
                f.write(html.content)
                f.close()
            print('完成')
        print('第', n, '頁完成')

Python爬取妹子網圖片

提取文章標題 import requests from bs4 import BeautifulSoup url = 'http://www.mzitu.com/26685' header = {'User-Agent': 'Mozilla/5.0 (

python爬去妹子網整個圖片資源教程（最詳細版）

python爬蟲；爬取妹子網的低級教程連接如下：[爬妹子網](https://blog.csdn.net/baidu_35085676/article/details/68958267)ps：只支持單個套圖下載，不支持整體下載在說說我的這個爬蟲代碼的設計思路：①當我們瀏覽這個網站時，會發現，每一個頁面的URL

Python使用BeautifulSoup簡單實現爬取妹子mm圖片--初級篇

先來個效果截圖（屈服在我的淫威之下吧！壞壞...嘿0.0）因為是簡易版而且是自己寫著玩玩而已，自己也剛學，亦是筆記亦是分享，大佬輕噴就好。主要目的是希望更多人能夠體驗爬取一些seqing圖片的快樂？？哈哈完整程式碼：文末已貼出應該安裝個bs4的包就可

利用python爬取點小圖片，滿足私欲(爬蟲)

.text write ret append jpg use download div pat import requestsimport reimport os,syslinks=[]titles=[] headers = { "User-Agent": "Mozi

Python爬取天氣網歷史天氣數據

ast 信息爬蟲 cmake tex for roc ins fonts 使用Python的requests 和BeautifulSoup模塊，Python 2.7.12可在命令行中直接使用pip進行模塊安裝。爬蟲的核心是利用BeautifulSoup的select語句獲

python爬取微博圖片數據存到Mysql中遇到的各種坑python Mysql存儲圖片

字符轉義 process 程序 zha 有一個 utf-8 get ctime python3 本人長期出售超大量微博數據，並提供特定微博數據打包，Message to [email protected] 前言由於硬件等各種原因需要把大概

python爬取百度圖片代碼

python爬蟲；import json import itertools import urllib import requests import os import re import sys word=input("請輸入關鍵字：") path="./ok" if

Python爬取全書網小說，免費看小說

tle 3.6 tro con fin 保存 get 正在 url地址什麽是網絡爬蟲網絡爬蟲（又被稱為網頁蜘蛛，網絡機器人，在FOAF社區中間，更經常的稱為網頁追逐者），是一種按照一定的規則，自動地抓取萬維網信息的程序或者腳本。另外一些不常使用的名字還有螞蟻、自

Python爬蟲案例：利用Python爬取笑話網

htm 分享 targe pen 技術分享搞笑 lan tle import 學校的服務器可以上外網了，所以打算寫一個自動爬取笑話並發到bbs的東西，從網上搜了一個笑話網站，感覺大部分還不太冷，html結構如下：可以看到，笑話的鏈接列表都在<div cla

Python 爬取百度圖片的高清原圖

# coding=utf-8 """ 爬取百度圖片的高清原圖 Author : MirrorMan Created : 2017-11-10 """ import re import urllib import os import requests de

Python爬取網頁的圖片資料

本案例是基於PyCharm開發的，也可以使用idea。在專案內新建一個python檔案TestCrawlers.py TestCrawlers.py # 匯入urllib下的request模組 import urllib.request # 匯入正則匹配包 import re

python爬取百度圖片---釋出exe小計編碼是個大坑

#*--coding:utf-8--* import requests import sitecustomize import os import sys reload(sys) sys.setdefaultencoding('utf-8') type=sys.getfilesystemencodi

Python-爬取妹子圖(單執行緒和多執行緒版本)

一、參考文章 Python爬蟲之——爬取妹子圖片上述文章中的程式碼講述的非常清楚，我的基本能思路也是這樣，本篇文章中的程式碼僅僅做了一些異常處理和一些日誌顯示優化工作，寫此文章主要是當做筆記，方便以後查閱，修改的地方如下： 1、異常處理

python 爬取豆瓣網搜尋結果同城活動資料

主要使用的庫： requests:爬蟲請求並獲取原始碼 re：使用正則表示式提取資料 json:使用JSON提取資料 pandas：使用pandans儲存資料 bs4:網頁程式碼解析以下是原始碼： #!coding=utf-8 import requests

Python爬取下載網易雲音樂

from urllib import request import requests import re from bs4 import BeautifulSoup from pprint import pprint import urllib, time def get

python爬取今日頭條圖片

import requests from urllib.parse import urlencode from requests import codes import os # qianxiao996精心製作 #部落格地址：https://blog.csdn.

利用Python爬取攝影網站圖片，切勿商用

今天我們繼續爬取一個網站，這個網站為 http://image.fengniao.com/ ，蜂鳥一個攝影大牛聚集的地方，本教程請用來學習，不要用於商業目的，不出意外，蜂鳥是有版權保護的網站。 Python學習資料或者需要程式碼、視訊加Python學習群：9604104

Python 爬取妹子圖(注意身體/滑稽)

... #!/usr/bin/env python import urllib.request from bs4 import BeautifulSoup def crawl(url): headers = {'User-Agent':'Mozilla/5.0 (Windows; U; W

使用python爬取豆瓣電影圖片（-）

學python沒多久，主要想用它來做爬蟲，寫api建議用node.js,做全站頁面渲染用php搞定，做爬蟲還得看python: 這裡沒有用python的一些爬蟲框架，先採用python內建模組urllib直接處理頁面抓取，然後解析內容然後直接下載圖片：直接抓取豆瓣圖片

教你用Python爬取妹子圖APP

教你用Python爬美之圖APP(妹子圖) 爬取結果程式只運行了2h,最後認為程式沒有問題了就關了(我可不是去殺生去了…… 執行環境 Python 3.5+ Windows 10 VSCode 如何使用下載專案原始碼 https

Python爬取妹子網圖片

提取文章標題

程式原始碼

相關推薦