python的多執行緒程式設計

阿新 • • 發佈：2018-12-22

1，python中一個執行緒對應於c語言中的一個執行緒
gil使得同一個時刻只有一個執行緒在一個cpu上執行位元組碼, 無法將多個執行緒對映到多個cpu上執行
gil會根據執行的位元組碼行數以及時間片釋放gil，gil在遇到io的操作時候主動釋放

total = 0

def add():
    #1. dosomething1
    #2. io操作
    # 1. dosomething3
    global total
    for i in range(1000000):
        total += 1
def desc():
    global total
    for i in 
 range(1000000):
        total -= 1

import threading
thread1 = threading.Thread(target=add)
thread2 = threading.Thread(target=desc)
thread1.start()
thread2.start()

thread1.join()
thread2.join()
print(total)

每一次執行的結果都會不一樣，所以有GIL的python執行緒也不是安全的，但是python遇到io操作的話，會等到io操作時候主動釋放GIL,

2，多執行緒程式設計

①對於io操作來說，多執行緒和多程序效能差別不大

----------------------------------------------------

方式1：

通過Thread類來例項化

import time
import threading

def get_detail_html(url):
    print("get detail html started")
    time.sleep(2)
    print("get detail html end")


def get_detail_url(url):
    print("get detail url started")
    time.sleep( 
4)
    print("get detail url end")


if __name__ =="__main__":
    thread1 = threading.Thread(target=get_detail_html, args=("",))
    thread2 = threading.Thread(target=get_detail_url, args=("",))
    start_time = time.time()
    thread1.start()
    thread2.start()
    print("last time {}".format(time.time()-start_time))

get detail html started
get detail url started
last time 0.0010006427764892578
get detail html end
get detail url end

執行時間居然是0，兩個執行緒並行時間不應該是2秒嗎？其實實際上這是有3個執行緒，可以通過pycharm的IDE中進行debug

可以看得到其實是三個執行緒的

那就意味著三個執行緒並行，2個執行緒睡2秒，但第三個執行緒依舊可以繼續向下進行，因為他們是並行的，因此，時間才會接近於0，

但是此時雖然主執行緒結束了，但是並沒有退出！子執行緒依舊可以執行，如何設定主執行緒退出之後立即kill掉子執行緒呢？

thread1 = threading.Thread(target=get_detail_html, args=("",))
    thread2 = threading.Thread(target=get_detail_url, args=("",))
    start_time = time.time()
    thread1.setDaemon(True)  # setDaemon 設定為True是將其設定為守護執行緒
    thread2.setDaemon(True)
    thread1.start()
    thread2.start()

但是如何讓這個主執行緒等待其餘2個子執行緒結束之後再去執行呢？

thread1 = threading.Thread(target=get_detail_html, args=("",))
    thread2 = threading.Thread(target=get_detail_url, args=("",))
    start_time = time.time()
    thread1.start()
    thread2.start()
    thread1.join()
    thread2.join()

join()就是設定主執行緒必須等待子執行緒結束之後才能夠退出，注意：必須在start()之後寫

那如何簡化多執行緒程式設計呢？（繼承Thread類）

②通過繼承Thread來實現多執行緒

class GetDetailHtml(threading.Thread):
    def __init__(self, name):
        super().__init__(name=name)

    def run(self):  過載run方法
        print("get detail html started")
        time.sleep(2)
        print("get detail html end")


class GetDetailUrl(threading.Thread):
    def __init__(self, name):
        super().__init__(name=name)

    def run(self):
        print("get detail url started")
        time.sleep(4)
        print("get detail url end")

if __name__ == "__main__":
    thread1 = GetDetailHtml("get_detail_html")
    thread2 = GetDetailUrl("get_detail_url")
    start_time = time.time()
    thread1.start()
    thread2.start()

    thread1.join()
    thread2.join()

    # 當主執行緒退出的時候， 子執行緒kill掉
    print("last time: {}".format(time.time() - start_time))

那歸根到底就能夠自定義很多複雜的邏輯了

---------------------------------------------------------

執行緒間的通訊和共享變數

從第一個例子中我們就公用了同一個total變數

但是共享變數會導致變數被反覆修改

# 通過queue的方式進行執行緒間同步
from queue import Queue
import time
import threading


def get_detail_html(queue):
    # 爬取文章詳情頁
    while True:
        url = queue.get()  # queue是一個阻塞方法，佇列中沒有值得時候他會一直阻塞
        # for url in detail_url_list:
        print("get detail html started")
        time.sleep(2)
        print("get detail html end")


def get_detail_url(queue):
    # 爬取文章列表頁
    while True:
        print("get detail url started")
        time.sleep(4)
        for i in range(20):
            queue.put("http://projectsedu.com/{id}".format(id=i))  # 佇列滿了也會阻塞住
        print("get detail url end")


# 1. 執行緒通訊方式- 共享變數

if __name__ == "__main__":
    detail_url_queue = Queue(maxsize=1000)  # 宣告最大值的訊息佇列，執行緒是安全的

    thread_detail_url = threading.Thread(target=get_detail_url, args=(detail_url_queue,))
    for i in range(10):
        html_thread = threading.Thread(target=get_detail_html, args=(detail_url_queue,))
        html_thread.start()
    # # thread2 = GetDetailUrl("get_detail_url")
    start_time = time.time()
    # thread_detail_url.start()
    # thread_detail_url1.start()
    #
    # thread1.join()
    # thread2.join()
    detail_url_queue.task_done()  # 必須呼叫
    detail_url_queue.join()  # 和執行緒一致

    # 當主執行緒退出的時候， 子執行緒kill掉
    print("last time: {}".format(time.time() - start_time))

因此，當涉及到共享變數的時候，首先推薦採用queue來完成

1，執行緒安全

2，對於可以採用task_done 隨時停止

-----------------------------------------------------------------------------------------

4，執行緒同步：（鎖機制）

# -*- coding:UTF-8 -*-
__autor__ = 'zhouli'
__date__ = '2018/12/18 21:44'


from threading import Lock


total = 0
lock = RLock()


def add():
    # 1. dosomething1
    # 2. io操作
    # 1. dosomething3
    global lock
    global total
    for i in range(1000000):
        lock.acquire()
        total += 1
        lock.release()


def desc():
    global total
    global lock
    for i in range(1000000):
        lock.acquire()
        total -= 1
        lock.release()


import threading

thread1 = threading.Thread(target=add)
thread2 = threading.Thread(target=desc)
thread1.start()
thread2.start()

#
thread1.join()
thread2.join()
print(total)

# 1. 用鎖會影響效能
# 2. 鎖會引起死鎖
# 死鎖的情況 A（a，b）

加鎖一定要釋放！！否則死鎖!!

因為使用鎖的情況下會很繞，所以python給我們重新定義了一個Rlock（可重入的鎖）

# 在同一個執行緒裡面，可以連續呼叫多次acquire， 一定要注意acquire的次數要和release的次數相等

程式碼修改如下：

from threading import Lock, RLock, Condition  # 可重入的鎖

# 在同一個執行緒裡面，可以連續呼叫多次acquire， 一定要注意acquire的次數要和release的次數相等
total = 0
lock = RLock()


def add():
    # 1. dosomething1
    # 2. io操作
    # 1. dosomething3
    global lock
    global total
    for i in range(1000000):
        lock.acquire()
        lock.acquire()
        total += 1
        lock.release()
        lock.release()


def desc():
    global total
    global lock
    for i in range(1000000):
        lock.acquire()
        total -= 1
        lock.release()


import threading

thread1 = threading.Thread(target=add)
thread2 = threading.Thread(target=desc)
thread1.start()
thread2.start()

#
thread1.join()
thread2.join()
print(total)

# 1. 用鎖會影響效能
# 2. 鎖會引起死鎖
# 死鎖的情況 A（a，b）
"""
A(a、b)
acquire (a)
acquire (b)

B(a、b)
acquire (a)
acquire (b)
"""

在同一個執行緒裡面才是如此，不同執行緒之間還是一個互相競爭的關係！

多執行緒的難點：condition（條件變數）

他是多執行緒中用於複雜的多執行緒通訊中的鎖，條件變數

通過原始碼可知其中的wait和notify方法

其中wait()方法是等待執行緒的的啟動，notify去通知另一個執行緒的啟動

import threading


# 條件變數， 用於複雜的執行緒間同步
# class XiaoAi(threading.Thread):
#     def __init__(self, lock):
#         super().__init__(name="小愛")
#         self.lock = lock
#
#     def run(self):
#         self.lock.acquire()
#         print("{} : 在 ".format(self.name))
#         self.lock.release()
#
#         self.lock.acquire()
#         print("{} : 好啊 ".format(self.name))
#         self.lock.release()
#
#
# class TianMao(threading.Thread):
#     def __init__(self, lock):
#         super().__init__(name="天貓精靈")
#         self.lock = lock
#
#     def run(self):
#         self.lock.acquire()
#         print("{} : 小愛同學 ".format(self.name))
#         self.lock.release()
#
#         self.lock.acquire()
#         print("{} : 我們來對古詩吧 ".format(self.name))
#         self.lock.release()


# 通過condition完成協同讀詩

class XiaoAi(threading.Thread):
    def __init__(self, cond):
        super().__init__(name="小愛")
        self.cond = cond

    def run(self):
        with self.cond:  # 一定要使用with語句
            self.cond.wait()  # 後說話使用先要等待
            print("{} : 在 ".format(self.name))
            self.cond.notify()  # 去通知

            self.cond.wait()
            print("{} : 好啊 ".format(self.name))
            self.cond.notify()

            self.cond.wait()
            print("{} : 君住長江尾 ".format(self.name))
            self.cond.notify()

            self.cond.wait()
            print("{} : 共飲長江水 ".format(self.name))
            self.cond.notify()

            self.cond.wait()
            print("{} : 此恨何時已 ".format(self.name))
            self.cond.notify()

            self.cond.wait()
            print("{} : 定不負相思意 ".format(self.name))
            self.cond.notify()


class TianMao(threading.Thread):
    def __init__(self, cond):
        super().__init__(name="天貓精靈")
        self.cond = cond

    def run(self):
        with self.cond:
            print("{} : 小愛同學 ".format(self.name))
            self.cond.notify()  # 先去通知
            self.cond.wait()  # 等待

            print("{} : 我們來對古詩吧 ".format(self.name))
            self.cond.notify()
            self.cond.wait()

            print("{} : 我住長江頭 ".format(self.name))
            self.cond.notify()
            self.cond.wait()

            print("{} : 日日思君不見君 ".format(self.name))
            self.cond.notify()
            self.cond.wait()

            print("{} : 此水幾時休 ".format(self.name))
            self.cond.notify()
            self.cond.wait()

            print("{} : 只願君心似我心 ".format(self.name))
            self.cond.notify()
            self.cond.wait()


if __name__ == "__main__":
    from concurrent import futures

    cond = threading.Condition()
    xiaoai = XiaoAi(cond)
    tianmao = TianMao(cond)

    # 啟動順序很重要
    # 在呼叫with cond之後才能呼叫wait或者notify方法
    # condition有兩層鎖， 一把底層鎖會線上程呼叫了wait方法的時候釋放， 上面的鎖會在每次呼叫wait的時候分配一把並放入到cond的等待佇列中，等到notify方法的喚醒
    xiaoai.start()
    tianmao.start()

5,Semaphore的使用

# Semaphore 是用於控制進入數量的鎖
# 檔案， 讀、寫， 寫一般只是用於一個執行緒寫，讀可以允許有多個

# 做爬蟲
import threading
import time


class HtmlSpider(threading.Thread):
    def __init__(self, url, sem):
        super().__init__()
        self.url = url
        self.sem = sem

    def run(self):
        time.sleep(2)
        print("got html text success")
        self.sem.release()  # 一定要注意鎖的釋放的位置，一旦鎖被釋放sem就會增加1


class UrlProducer(threading.Thread):
    def __init__(self, sem):
        super().__init__()
        self.sem = sem

    def run(self):
        for i in range(20):
            self.sem.acquire()
            html_thread = HtmlSpider("https://baidu.com/{}".format(i), self.sem)
            html_thread.start()


if __name__ == "__main__":
    sem = threading.Semaphore(3)
    url_producer = UrlProducer(sem)
    url_producer.start()

6，執行緒池

from concurrent.futures import ThreadPoolExecutor

為什麼要執行緒池？

主執行緒中可以獲取某一個執行緒的狀態或者某一個任務的狀態，以及返回值

當一個執行緒完成的時候我們主執行緒能立即知道

futures可以讓多執行緒和多程序編碼介面一致

from concurrent.futures import ThreadPoolExecutor, as_completed, wait, FIRST_COMPLETED


# 未來物件，task的返回容器


# 執行緒池， 為什麼要執行緒池
# 主執行緒中可以獲取某一個執行緒的狀態或者某一個任務的狀態，以及返回值
# 當一個執行緒完成的時候我們主執行緒能立即知道
# futures可以讓多執行緒和多程序編碼介面一致
import time


def get_html(times):
    time.sleep(times)
    print("get page {} success".format(times))
    return times


executor = ThreadPoolExecutor(max_workers=2)
# 通過submit函式提交執行的函式到執行緒池中, submit 是立即返回
task1 = executor.submit(get_html, (3,))  # 第一個引數是函式名稱，第二個引數是引數
task2 = executor.submit(get_html, (2,))  # submit的返回時是非常重要，用於判斷是否執行成功等
print(task1.done)  # 判斷任務是否完成

結果

當然task1.result()方法也是可以的，檢視task的結果

實際上我們也可以將某一個任務關閉掉，但是要注意，任務在執行中或者是執行完成時是無法取消的，只有未開始執行才會被cancel()掉

# 要獲取已經成功的task的返回
urls = [3, 2, 4]
all_task = [executor.submit(get_html, (url,)) for url in urls]

for future in as_completed(all_task):  # as_completed 實際上是一個生成器，將已經完成的返回
    data = future.result()
    print("get {} page".format(data))

這個執行結果順序是誰先完成任務誰先出來

或者

# 要獲取已經成功的task的返回
urls = [3, 2, 4]
all_task = [executor.submit(get_html, (url,)) for url in urls]
# for future in as_completed(all_task):  # as_completed 實際上是一個生成器，將已經完成的返回
#     data = future.result()
#     print("get {} page".format(data))
# 通過executor的map獲取已經完成的task的值
for data in executor.map(get_html, urls):  # map方法更加簡單
    print("get {} page".format(data))

但是這樣和上面的不一樣的是，這邊直接返回的就是結果了，也就是data = future.result()這一步被省略了

而且map方法返回的順序是列表的順序

wait 方法：（讓主執行緒進行阻塞）

# 要獲取已經成功的task的返回
urls = [3, 2, 4]
all_task = [executor.submit(get_html, (url,)) for url in urls]
wait(all_task, return_when=FIRST_COMPLETED)  # 讓主執行緒阻塞，如果沒有return_when引數 預設是等待全部任務結束放行
print("main")
# for future in as_completed(all_task):  # as_completed 實際上是一個生成器，將已經完成的返回
#     data = future.result()
#     print("get {} page".format(data))
# 通過executor的map獲取已經完成的task的值
for data in executor.map(get_html, urls):  # map方法更加簡單
    print("get {} page".format(data))

放上完整版

from concurrent.futures import ThreadPoolExecutor, as_completed, wait, FIRST_COMPLETED
from concurrent.futures import Future
from multiprocessing import Pool

# 未來物件，task的返回容器


# 執行緒池， 為什麼要執行緒池
# 主執行緒中可以獲取某一個執行緒的狀態或者某一個任務的狀態，以及返回值
# 當一個執行緒完成的時候我們主執行緒能立即知道
# futures可以讓多執行緒和多程序編碼介面一致
import time


def get_html(times):
    time.sleep(times)
    print("get page {} success".format(times))
    return times


executor = ThreadPoolExecutor(max_workers=2)
# 通過submit函式提交執行的函式到執行緒池中, submit 是立即返回
task1 = executor.submit(get_html, (3,))  # 第一個引數是函式名稱，第二個引數是引數
task2 = executor.submit(get_html, (2,))  # submit的返回時是非常重要，用於判斷是否執行成功等


# 要獲取已經成功的task的返回
urls = [3, 2, 4]
all_task = [executor.submit(get_html, (url,)) for url in urls]
wait(all_task, return_when=FIRST_COMPLETED)  # 讓主執行緒阻塞，如果沒有return_when引數 預設是等待全部任務結束放行
print("main")
# for future in as_completed(all_task):  # as_completed 實際上是一個生成器，將已經完成的返回
#     data = future.result()
#     print("get {} page".format(data))
# 通過executor的map獲取已經完成的task的值
for data in executor.map(get_html, urls):  # map方法更加簡單
    print("get {} page".format(data))


# #done方法用於判定某個任務是否完成
# print(task1.done())
# print(task2.cancel())
# time.sleep(3)
# print(task1.done())
#
# #result方法可以獲取task的執行結果
# print(task1.result())

Python多執行緒程式設計,執行緒鎖

1 2 3 from threading import Thread 4 import time 5 6 class MyThread(Thread): 7 name1 = 'MyThread-1' 8 def __init__(self,target,args

Python多執行緒程式設計,執行緒鎖,以及補充上一篇多程序文章

程序補充程序間的訊號訊號是唯一的非同步通訊方法一個程序向另一個程序傳送一個訊號來傳遞某種資訊，接受者根據傳遞的資訊來做相應的事 $ kill -l檢視系統訊號說明 $ kill -9 pid號對程序傳送訊號訊號名稱說明

Python多執行緒程式設計

#!/usr/bin/python #!coding=utf-8 import threading import time exitFlag = 0 class MyThread(threading.Thread): def __init__(self, threadID, name, counte

談談python多執行緒程式設計

談談python多執行緒程式設計 Python中GIL概念 Python(CPython)不是執行緒安全的，所以我們需要一個GIL(Global interpreter Lock)，來保證資料完性和安全性。也就是同一時間內同一核CPU中只能有一個GIL。 Threading的GI

python多執行緒程式設計之Queue---put/get 方法的阻塞

python 中，佇列是執行緒間最常用的交換資料的形式。Queue模組是提供佇列操作的模組，雖然簡單易用，但是不小心的話，還是會出現一些意外。 1. 阻塞模式導致資料汙染 import Queue q = Queue.Queue(10) for

一文學會 Python 多執行緒程式設計

import logging import threading class MyThread(threading.Thread): def __init__(self, number, logger): threading.Thread.__init__(self)

python 多執行緒程式設計（一個經典例子）

python 多執行緒經典案例（摘自《python核心程式設計》）使用佇列的資料結構，生產者生產商品，消費者選取商品，且時間均不固定 from random import randint from time import sleep from queu

python 多執行緒程式設計

import threading #當前執行緒列印執行緒名 t=threading.current_thread() print(t.name) #活動執行緒數 print(threading.active_count()) #當前主執行緒 t=threading

python多執行緒程式設計(4): 死鎖和可重入鎖

線上程間共享多個資源的時候，如果兩個執行緒分別佔有一部分資源並且同時等待對方的資源，就會造成死鎖。儘管死鎖很少發生，但一旦發生就會造成應用的停止響應。下面看一個死鎖的例子： # encoding: UTF-8import threadingimport timec

python多執行緒程式設計(3): 使用互斥鎖同步執行緒

問題的提出上一節的例子中，每個執行緒互相獨立，相互之間沒有任何關係。現在假設這樣一個例子：有一個全域性的計數num，每個執行緒獲取這個全域性的計數，根據num進行一些處理，然後將num加1。很容易寫出這樣的程式碼： # encoding: UTF-8import

[Python]多執行緒程式設計&執行緒間共享變數&消費者生產者問題的解決

由於單程序爬蟲的種種弊端，以及大量獲取資料的需要，我最近開始寫分散式爬蟲。儘管網上已經有比較現成的方案，如scrapy+rq等，但是出於種種原因考慮，比如部署的難易程度，任務比較單一，以及想自己練練手等，還是決定由自己實現儘可能多的功能。在寫的過程中，不可避

python多執行緒程式設計(8):執行緒的合併和後臺執行緒

threading import random import time class MyThread(threading.Thread): def run(self): wait_time=random.randrange(1,10) print "%s will

用Python 多程序程式設計解決python多執行緒程式設計CPU利用率低的問題

之前用python寫了個多執行緒，但發現四核的電腦，CPU利用率卻用了不到30%，後來使用多程序程式設計，四核全開，CPU利用率達到了100%！python中的多執行緒其實並不是真正的多執行緒，如果想要充分地使用多核CPU的資源，在python中大部分情況需要使用多程序。Py

python多執行緒程式設計（二）--threading模組

threading模組物件物件描述 Thread 一個執行緒的執行物件 Lock 鎖物件 RLock 可重入鎖物件，使單執行緒可以再次獲得已經獲得了的鎖（遞迴鎖定） Condition 條件變數，讓一個執行緒停下來，等待其它執行緒滿足了某個條件 Event

Python Threading 多執行緒程式設計

寫在篇前 threading模組是python多執行緒處理包，使用該模組可以很方便的實現多執行緒處理任務，本篇文章的基礎是需要掌握程序、執行緒基本概念，對PV原語、鎖等傳統同步處理方法有一定的瞭解。另外，threading模組的實現是參考java多執行緒處理方式，並且只實現了其中的一

Python實戰之多執行緒程式設計thread模組

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

風火程式設計--python多執行緒下載檔案

多執行緒下載檔案出現異常的執行緒會自動重新下載, 所有的進度會在同一位置輪換顯示 import os import datetime from urllib import request url_list = ["url1","url2","url3"

Python中的多執行緒程式設計，執行緒安全與鎖(一) 聊聊Python中的GIL 聊聊Python中的GIL python基礎之多執行緒鎖機制 python--threading多執行緒總結 Python3入門之執行緒threading常用方法

1. 多執行緒程式設計與執行緒安全相關重要概念在我的上篇博文聊聊Python中的GIL 中，我們熟悉了幾個特別重要的概念：GIL，執行緒，程序，執行緒安全，原子操作。以下是簡單回顧，詳細介紹請直接看聊聊Python中的GIL GIL:&n

Python中的多執行緒程式設計，執行緒安全與鎖(二) Python中的多執行緒程式設計，執行緒安全與鎖(一)

在我的上篇博文Python中的多執行緒程式設計，執行緒安全與鎖(一)中，我們熟悉了多執行緒程式設計與執行緒安全相關重要概念， Threading.Lock實現互斥鎖的簡單示例，兩種死鎖（迭代死鎖和互相等待死鎖）情況及處理。今天我們將聚焦於Python的Threading模組總結和執行緒同步問題。

Python實戰之多執行緒程式設計threading Thread

在Python中可以使用繼承threading.Thread類來實現多執行緒程式設計，其中子類可以重寫父類的__init__和run方法來實現使用者執行緒的邏輯，如下是一個簡單的多執行緒類實現[python] view plain copy print?import threa

python的多執行緒程式設計

相關推薦