使用imgaug--python影象資料增強庫進行Bounding Boxes影像增強

阿新 • • 發佈：2018-11-25

使用imgaug影象資料增強庫進行Bounding Boxes影像增強

簡介
imgaug安裝
Bounding Boxes實現

讀取原影像bounding boxes座標
生成變換後的bounding boxe座標檔案
生成變換序列
bounding box 變化後坐標計算

使用示例

資料準備
設定檔案路徑
設定增強次數
設定增強引數
輸出

簡介

相較於Augmentor，imgaug具有更多的功能，比如對影像增強的同時，對keypoint, bounding box進行相應的變換。例如在目標檢測的過程中，訓練集包括影像及其對應的bounding box檔案，在對影像增強的時候，同時解算出bounding box 相應變換的座標生成對應的bounding box檔案。

程式碼

imgaug安裝

imgaug使用文件

安裝依賴庫

pip install six numpy scipy matplotlib scikit-image opencv-python imageio

安裝imgaug

方式一（安裝github最新版本）：

pip install　git+https://github.com/aleju/imgaug

方式二（安裝pypi版本）：

pip install imgaug

Bounding Boxes實現

讀取原影像bounding boxes座標

讀取xml檔案並使用ElementTree對xml檔案進行解析，找到每個object的座標值。

def read_xml_annotation(root, image_id):
    in_file = open(os.path.join(root, image_id))
    tree = ET.parse(in_file)
    root = tree.getroot()
    bndboxlist = []

    for object in root.findall('object'):  # 找到root節點下的所有object節點
        bndbox = object.find('bndbox')  # 子節點下節點rank的值

        xmin = 
 int(bndbox.find('xmin').text)
        xmax = int(bndbox.find('xmax').text)
        ymin = int(bndbox.find('ymin').text)
        ymax = int(bndbox.find('ymax').text)
        # print(xmin,ymin,xmax,ymax)
        bndboxlist.append([xmin,ymin,xmax,ymax])
        # print(bndboxlist)

    bndbox = root.find('object').find('bndbox')
    return bndboxlist

生成變換後的bounding boxe座標檔案

傳入目標變換後的bounding boxe座標，將原座標替換成新座標並生成新的xml檔案。

def change_xml_list_annotation(root, image_id, new_target,saveroot,id):

    in_file = open(os.path.join(root, str(image_id) + '.xml'))  # 這裡root分別由兩個意思
    tree = ET.parse(in_file)
    xmlroot = tree.getroot()
    index = 0

    for object in xmlroot.findall('object'):  # 找到root節點下的所有country節點
        bndbox = object.find('bndbox')  # 子節點下節點rank的值

        new_xmin = new_target[index][0]
        new_ymin = new_target[index][1]
        new_xmax = new_target[index][2]
        new_ymax = new_target[index][3]

        xmin = bndbox.find('xmin')
        xmin.text = str(new_xmin)
        ymin = bndbox.find('ymin')
        ymin.text = str(new_ymin)
        xmax = bndbox.find('xmax')
        xmax.text = str(new_xmax)
        ymax = bndbox.find('ymax')
        ymax.text = str(new_ymax)

        index = index + 1

    tree.write(os.path.join(saveroot, str(image_id) + "_aug_" + str(id) + '.xml'))

生成變換序列

產生一個處理圖片的Sequential。

seq = iaa.Sequential([
        iaa.Flipud(0.5),  # vertically flip 20% of all images
        iaa.Fliplr(0.5),  # 映象
        iaa.Multiply((1.2, 1.5)),  # change brightness, doesn't affect BBs
        iaa.GaussianBlur(sigma=(0, 3.0)),
        # iaa.GaussianBlur(0.5),
        iaa.Affine(
            translate_px={"x": 15, "y": 15},
            scale=(0.8, 0.95),
            rotate=(-30, 30)
        )  # translate by 40/60px on x/y axis, and scale to 50-70%, affects BBs
    ])

bounding box 變化後坐標計算

先讀取該影像對應xml檔案，獲取所有目標的bounding boxes，然後依次計算每個box變化後的座標。

bndbox = read_xml_annotation(XML_DIR, name)
for epoch in range(AUGLOOP):
    seq_det = seq.to_deterministic()  # 保持座標和影象同步改變，而不是隨機

    # 讀取圖片
    img = Image.open(os.path.join(IMG_DIR, name[:-4] + '.jpg'))
    img = np.array(img)

    # bndbox 座標增強
    for i in range(len(bndbox)):
        bbs = ia.BoundingBoxesOnImage([
        ia.BoundingBox(x1=bndbox[i][0], y1=bndbox[i][1], x2=bndbox[i][2], y2=bndbox[i][3]),
        ], shape=img.shape)

        bbs_aug = seq_det.augment_bounding_boxes([bbs])[0]
        boxes_img_aug_list.append(bbs_aug)

使用示例

資料準備

輸入資料為兩個資料夾一個是需要增強的影像資料（JPEGImages），一個是對應的xml檔案（Annotations）。注意：影像檔名需和xml檔名相對應！

Annotations

JPEGImages

設定檔案路徑

    IMG_DIR = "./dataset/JPEGImages" #輸入的影像資料夾路徑
    XML_DIR = "./dataset/Annotations" # 輸入的XML資料夾路徑


    AUG_XML_DIR = "./dataset/AUG_XML" # 儲存增強後的XML資料夾路徑
    mkdir(AUG_XML_DIR)

    AUG_IMG_DIR = "./dataset/AUG_IMG" # 儲存增強後的影像資料夾路徑
    mkdir(AUG_IMG_DIR)

設定增強次數

    AUGLOOP = 10 # 每張影像增強的數量

設定增強引數

通過修改Sequential函式引數進行設定，具體設定參考imgaug使用文件

seq = iaa.Sequential([
        iaa.Flipud(0.5),  # vertically flip 50% of all images
        iaa.Fliplr(0.5),  # 映象
        iaa.Multiply((1.2, 1.5)),  # change brightness, doesn't affect BBs
        iaa.GaussianBlur(sigma=(0, 0.5)),
         # iaa.GaussianBlur(0.5),
        iaa.Affine(
            translate_px={"x": 15, "y": 15},
            scale=(0.8, 0.95),
            rotate=(-30, 30)
        )  
    ])

輸出

執行XMLaug.py ，執行結束後即可得到增強的影像和對應的xml資料夾
在這裡插入圖片描述

在這裡插入圖片描述

使用imgaug--python影象資料增強庫進行Bounding Boxes影像增強

使用imgaug影象資料增強庫進行Bounding Boxes影像增強簡介 imgaug安裝 Bounding Boxes實現讀取原影像bounding boxes座標生成變換後的bounding boxe座標檔案

Python大資料處理庫PySpark實戰

https://cloud.tencent.com/developer/article/1096712 Spark的安裝和使用(Python版) http://dblab.xmu.edu.cn/blog/1689-2/ https://blog.csdn.net/qq_14959801/

python常用資料處理庫的安裝（numpy pandas matplotlib）

這篇文章記錄的不錯，轉載一把https://www.cnblogs.com/lxmhhy/p/6029465.htmlpip install matplotlib -i http://pypi.douban.com/simple --trusted-host pypi.dou

機器視覺 OpenCV—python 影象資料集獲取工具（視訊取幀）

一、前言之前在做影象分類的時候，人臉識別（開原始碼）的練手，資料集獲取麻煩（沒人願意將自己照片給人家做資料集），於是就用自己造資料集，但是拍照拍幾百張訓練效果不好，也嫌麻煩，乾脆就是視訊取幀的方式，在這之前使用專門的軟體。不過opencv自帶了視訊處理的API

python地理資料處理庫geopy

python地理位置處理python地理編碼地址以及用來處理經緯度的庫GeoDjango – 世界級地理圖形 web 框架。GeoIP – MaxMind GeoIP Legacy 資料庫的Python API。geojson – GeoJSON 的 Python 繫結及工具

Ubuntu16.04安裝Python的資料分析庫numpy，pandas，scipy,matplotlib

1. 安裝依賴庫 sudo apt-get install python-dev 2. 使用pip方式安裝 sudo pip install numpy sudo pip install scipy sudo pip install pandas sudo pi

深度學習之資料增強庫imgaug使用方法

在上一篇文章中，介紹了常用的資料增強的方法，並提到了實現這些方法的一個庫imgaug，這篇文章就對該庫的使用方法進行一個總結。 1 介紹 imgaug是一個用於機器學習實驗中影象增強的python庫，支援python2.7和3.4以上的版本。它支援多種增強技術，允許輕鬆組合這些技術，具

深度學習資料增廣庫imgaug——Bounding Boxes變換

imgaug在影象變換的同時變換影象中的bound box。 bounding的支援包括: 將bounding box封裝成物件對bounding box進行變換將bounding box畫在影象上移動bounding box的位置,將變換後的bounding

Win7下安裝Python影象處理庫PIL、pytesser、tesseract進行驗證碼識別

前言今天看見一個關於Python進行驗證碼識別的文章，其中程式碼很短，但是感覺很有趣，加上最近也在學習一些簡單的Python知識，所以決定實驗一下準備工作 PIL版本選擇從網上搜索得知，PIL官方只有32位的安裝檔案，安裝時會提示找不到py

python 讀寫txt文件並用jieba庫進行中文分詞

mage 亂碼技術分享流行 ictclas 函數結果 class 配置 python用來批量處理一些數據的第一步吧。對於我這樣的的萌新。這是第一步。 #encoding=utf-8 file=‘test.txt‘ fn=open(file,"r") print f

Python對數據庫進行操作

服務器ip connector pass 校驗環境 execute odi man commit 步驟一：安裝好python開發環境步驟二：確定數據庫類型，下載響應的數據庫jar包，將數據庫jar包放入Python的包目錄。（百度解決）步驟三：編寫Python腳本，下例

Python影象處理庫PIL中影象格式轉換（二）

參考：https://blog.csdn.net/icamera0/article/details/50843196?utm_source=blogxgwz0 接上一篇《Python影象處理庫PIL中影象格式轉換（一）》二、其他不同模式轉換為“RGB”模式模式“RGB”為24位彩色影

影象資料增強實戰

by 小韓 (翻譯自：https://towardsdatascience.com/image-augmentation-examples-in-python-d552c26f2873) 我目前正在做影象資料增強的深度和有效性的研究。這項研究的目的是學習怎樣增加只有有限或少

影象資料增強的若干方法

影象分類的資料集非常大。儘管如此，依然需要資料增強來提高模型泛化能力。資料增強一般包括重新縮放影象的隨機裁剪、隨機水平翻轉、隨機 RGB 顏色與亮度變換等技術。此外，也存在不同的縮放、裁剪等技術（即單尺度訓練 vs 多尺度訓練）。在測試階段進行多裁剪評估也是經常使用的途徑，不過該方案的計

初探：Python中使用request和BeautifulSoup庫進行網路爬蟲

說起網路爬蟲，Python中最底層的應該是urllib，但是語法結構有些繁瑣，需要使用正則。而使用request和BeautifulSoup庫進行網路爬蟲，發現這真的是web開發人員的福音。凡是懂一些前端知識的人來說，使用request和BeautifulSoup庫進行爬蟲，真的有一種開心而愉快

Python 資料處理庫 pandas 入門教程

Python 資料處理庫 pandas 入門教程2018/04/17 · 工具與框架 · Pandas, Python 原文出處：強波的技術部落格 pandas是一個Python語言的軟體包，在我們使用Python語言進行機器學習程式設計的時候，這是一個非常常用的基礎程式設計庫。本文是對它的一個入門教程。p

如何使用python對資料夾中的檔案進行批量改名（增、刪、改字串欄位）

【時間】2018.10.12 【題目】如何使用python對資料夾中的檔案進行批量改名（增、刪、改字串欄位）【問題描述】今天需要對資料夾中的檔案進行批量改名，主要是因為名字中多出了自己不想要的字元段“data”想要將其刪除。這裡便以刪除名字中的字元段為例，至於增、改道理類

AugGAN：基於GAN的影象資料增強

資料增強方法無疑是需要重點研究的基本任務之一，因為我們的主流深度學習演算法還是一個有監督過程。臺灣國立清華大學在ECCV2018發表了一篇AugGAN開始把GAN用在資料增強方面了，當然，這並不是這個領域的第一篇。不過很具有參考意義，也很能解決實際問題。所以特地寫一個blog研究一番。讀本文需

python: c_char_p指向的bitmap影象資料，通過c_char_Array最終賦值給PIL的Image物件

def GetCurrentImage(self): ok, bitmap, buff_len = self.GetCurrentFrameBitmap() #呼叫C函式，返回點陣圖資料的指標. bitmap是c_char_p型別 if not ok:

學習OpenCV-Python——影象增強

影象增強影象增強可以分為兩種：領域處理技術。對畫素點及其周圍的點進行處理，即使用卷積核。點處理技術。只對單個畫素進行處理。歸一化 cv2.normalize(src, dst, alpha, beta, norm_type, dtype, mask

使用imgaug--python影象資料增強庫進行Bounding Boxes影像增強

使用imgaug影象資料增強庫進行Bounding Boxes影像增強

簡介

imgaug安裝

Bounding Boxes實現

讀取原影像bounding boxes座標

生成變換後的bounding boxe座標檔案

生成變換序列

bounding box 變化後坐標計算

使用示例

資料準備

設定檔案路徑

設定增強次數

設定增強引數

輸出

相關推薦