爬蟲 requests.post

阿新 • • 發佈：2018-11-01

爬蟲 requests.post

可以模擬網頁向伺服器傳送訊息，獲取想要的內容

1.無返回值

開啟並登陸豆瓣
這裡寫圖片描述

這裡寫程式碼片

這裡寫圖片描述

模擬豆瓣登陸

import requests

postUrl = 'https://www.douban.com/accounts/login'
id = '******' #賬戶
passwd = '*****' #密碼
headers = {
    'Referer':'https://www.douban.com/',
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36' 

}
postData ={
    'source':'index_nav',
'form_email':id,
'form_password':passwd,
'captcha-solution':'sponge',
'captcha-id':'T65SuHhM8GeYaQb8QFGsmI2H:ens'
}
responseRes = requests.post(postUrl, data=postData, headers=headers)
if (responseRes.status_code == 200):
    print("模擬登陸成功")

2.返回html

爬取的某大學的本學期的成績

# code=utf-8
import requests
from bs4 import BeautifulSoup
import csv

userAgent = "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) "
header = {
    "Referer": "http://210.44.176.116/cjcx/zcjcx_login.html",
    'User-Agent': userAgent,
}


def To_csv(id, html):
    soup = BeautifulSoup(html, features="html.parser" 
)
    stu = soup.table
    stu_table = stu.table
    stu_label = stu_table.find_all("th")
    stu_info = stu_table.find_all("td")
    print("學生基本資訊：")
    for i in range(len(stu_label)):
        if (stu_info[i].text != " "):
            print(stu_label[i].text + ":" + stu_info[i].text)

    score_table = stu.find_all("table")[1]
    label_list = []
    for label in score_table.find_all("th"):
        label_list.append(label.text)

    score_list = []
    score_tr = score_table.find_all("tr")
    for row in range(1, len(score_tr)):
        course = score_tr[row]
        dist = {}
        i = 0;
        for score in course.find_all("td"):
            dist[label_list[i]] = score.text
            i += 1
        score_list.append(dist)
    print("開始寫入csv")
    with open(id + '（本學期）.csv', 'w', encoding='utf-8-sig') as csvfile:
        fieldnames = label_list
        writer = csv.DictWriter(csvfile, fieldnames=fieldnames)
        writer.writeheader()
        for list in score_list:
            writer.writerow(list)
    print("寫入成功")


def Login(account):
    print("開始獲取" + account + "的成績")

    postUrl = "http://210.44.176.116/cjcx/dqcjcx_list.php"

    postData = {
        "post_xuehao": account,
        "Submit": "提交"
    }
    responseRes = requests.post(postUrl, data=postData, headers=header)
    if (responseRes.status_code == 200):
        print("成績爬取成功")
    return responseRes.text


if __name__ == "__main__":
    id = "******" #學號
    text = Login(id)
    To_csv(id, text)

3.返回josn

爬取某旅遊網站的列表資訊

import requests
import json

# post取內容
post_url = 'http://www.mafengwo.cn/mdd/base/list/pagedata_citylist'

form = {
    'mddid': '13061',
    'page': 1
}
# 模擬Post請求form
response_json = requests.post(post_url, data=form).text
text = json.loads(response_json)

li_text = text['list']
print(li_text)

爬蟲 requests.post

爬蟲 requests.post 可以模擬網頁向伺服器傳送訊息，獲取想要的內容 1.無返回值開啟並登陸豆瓣模擬豆瓣登陸 import requests postUrl = 'https://www.douban.com/accounts/logi

[Python爬蟲]requests模組使用post方法提交表單

使用requests庫中的post(url,params)方法,先通過觀察表單的網頁原始碼,或者是通過逆向工程的方法獲取表單提交的欄位,構造引數params,就能實現模擬登入操作. 例如: url =

Python爬蟲學習4：requests.post模擬登入豆瓣（包括獲取驗證碼）

1. 在豆瓣登入網頁嘗試登入後開啟開發者工具，可以查詢後去Headers和Form Data資訊。2. 實現程式碼import requests import html5lib import re from bs4 import BeautifulSoup s = re

requests post一個json數據

accep safari cati size ica not gzip eee content # post一個json數據 import requests headers={ "Accept":"application/json, text/plain, */*",

網絡爬蟲--requests庫中兩個重要的對象

resp head ppa except 代碼 http http響應 sts _for 當我們使用resquests.get（）時，返回的時response的對象，他包含服務器返回的所有信息，也包含請求的request的信息。首先： response對象的屬性有以下幾個

Python Requests post並將得到結果轉換為json

request blog req pre AS log details class ocs Python Requests post並將得到結果轉換為json 學習了：https://blog.csdn.net/sinat_28680819/article/details/

python 爬蟲 requests+BeautifulSoup 爬取巨潮資訊公司概況代碼實例

pan 字符 selenium 5.0 target 自我 color list tails 第一次寫一個算是比較完整的爬蟲，自我感覺極差啊，代碼low，效率差，也沒有保存到本地文件或者數據庫，強行使用了一波多線程導致數據順序發生了變化。。。貼在這裏，引以為戒吧。 #

關於爬蟲的日常復習（13）—— 爬蟲requests的初級高級的基本用法

bubuko req src http ima 基本爬蟲用法 image 關於爬蟲的日常復習（13）—— 爬蟲requests的初級高級的基本用法

Python爬蟲之post請求

對象 parse ... src pytho clas open 網址源代碼暑假放假在家沒什麽事情做，所以在學習了爬蟲，在這個博客園裏整理記錄一些學習的筆記。構建表單數據（以http://www.iqianyue.com/mypost 這個簡單的網頁為例）查看源代碼

Python之爬蟲-- Requests

目錄 Requests-獻給人類一、簡介二、安裝方式三、 GET請求四、POST請求五、顯示json檔案六、代理（proxies引數）七、使用者驗證八、Cookies 和 Session 1、Cookies 2、Se

requests.post()方法中的data引數和json引數

json和dict python中的dict型別要轉換為json格式的資料需要用到json庫： import json <json> = json.dumps(<dict>) <dict> = json.loads(<json>) 需要

Python requests.post方法中data與json引數區別

在通過requests.post()進行POST請求時，傳入報文的引數有兩個，一個是data，一個是json。 data與json既可以是str型別，也可以是dict型別。區別： 1、不管json是str還是dict，如果不指定headers中的content-type，預設為application/

python:爬蟲之Post請求以及動態Ajax資料的爬取（3）

#爬蟲的post方式作用：對引數進行打包反饋給伺服器 import urllib.request import urllib.parse #對引數打包 url = "http://www.sunck.wang:8085/form" data = { "use

爬蟲---requests錯誤

import requests r = requests.get("http://www.baidu.com/") print(r) 結果： <提示：如果你的抓取軟體沒有認證，認證後也不會出現錯誤，這個屬於衝突，個人理解> <Response [200]>

python爬蟲---requests庫的用法

href 分享圖片三方庫 put src from ges 2.x con requests是python實現的簡單易用的HTTP庫，使用起來比urllib簡潔很多因為是第三方庫，所以使用前需要cmd安裝 pip install requests 安裝完成後imp

爬蟲 | Requests: 讓 HTTP 服務人類

Requests官方文件： http://cn.python-requests.org/zh_CN/latest/ 例子：獲取豆瓣短評頁面原始碼 import requests url='https://book.douban.com/subject/1084

Python爬蟲——Requests庫

Python爬蟲——Requests庫 Requests庫 HTTP協議在說爬蟲之前，先了解了解什麼是HTTP協議。 HTTP–Hyper Text Transfer Protocol，超文字傳輸協議，是一種建立在TCP上的無狀態連線，整個基本的工作流

.net爬蟲獲取post資料

用Fiddler抓下請求的資訊，在http頭裡面會看到下面的資料： Accept: */* Accept-Encoding: gzip, deflate Accept-Language: zh-CN User-Agent: Mozilla/4.0 (compatible;

爬蟲Requests基本使用

Requests基本使用安裝 pip install requests 一、Requests模組請求獲取網頁(不帶引數) r = requests.get('http://www.chinahufei.com') r = requests.post('http://www.chinahufei.c

爬蟲-requests模組

引入 Requests 唯一的一個非轉基因的 Python HTTP 庫，人類可以安全享用。警告：非專業使用其他 HTTP 庫會導致危險的副作用，包括：安全缺陷症、冗餘程式碼症、重新發明輪子症、啃文件症、抑鬱、頭疼、甚至死亡。今日概要基於requests的get請求基於r

爬蟲 requests.post

爬蟲 requests.post

1.無返回值

2.返回html

3.返回josn

相關推薦