【python】小目標檢測中對一幅高解析度圖分塊且改變目標bounding box的座標

阿新 • • 發佈：2018-12-21

很多時候，在小目標檢測中，對於一副高解析度影象，我們很難直接輸入一整幅大圖來進行目標檢測，特別是對於one-stage的方法，如SSD的輸入尺寸是300和512， YOLO的輸入尺寸是416，而高解析度影象通常有幾千×幾千畫素。所以我在此分享將一副高解析度影象分塊同時寫入對應目標的boundingbox改變後的座標到xml檔案中。

import torch
import pickle
import sys
import os
import cv2
import numpy as np
import os.path
import torch.utils.data as data
import torchvision.transforms as transforms
from PIL import Image
from xml.dom.minidom import Document
from tqdm import tqdm
if sys.version_info[0] ==2:
    import xml.etree.cElementTree as ET
else:
    import xml.etree.ElementTree as ET
origin_dir = '原影象存放地址'
target_dir1 = '分塊影象存放地址'
annota_dir = '原boundingbox的xml檔案存放地址'
target_dir2 = '分塊boundingbox的xml檔案存放地址'
def clip_img(No, oriname):
    from_name = os.path.join(origin_dir, oriname+'.jpg')
    img = cv2.imread(from_name)
    h_ori,w_ori, _ =img.shape#儲存原圖的大小
    img = cv2.resize(img, (2048, 2048))#可以resize也可以不resize，看情況而定
    h, w, _ = img.shape
    xml_name = os.path.join(annota_dir, oriname+'.xml')#讀取每個原影象的xml檔案
    xml_ori = ET.parse(xml_name).getroot()
    res = np.empty((0,5))#存放座標的四個值和類別
    for obj in xml_ori.iter('object'):
        difficult = int(obj.find('difficult').text) == 1
        if difficult:
            continue
        name = obj.find('name').text.lower().strip()
        bbox = obj.find('bndbox')
        pts = ['xmin', 'ymin', 'xmax', 'ymax']
        bndbox = []
        for i, pt in enumerate(pts):
            cur_pt = int(bbox.find(pt).text) - 1
            cur_pt = int(cur_pt*h/h_ori) if i%2==1 else int(cur_pt * w / w_ori)
            bndbox.append(cur_pt)
        #label_idx = self.class_to_ind[name]
        bndbox.append(name)
        res = np.vstack((res, bndbox))
    i = 0
    win_size = 256#分塊的大小
    stride = 128#重疊的大小，設定這個可以使分塊有重疊
    for r in range(0, h - win_size, stride):
        for c in range(0, w - win_size, stride):
            flag = np.zeros([1,10])
            youwu = False
            xiefou = True
            tmp = img[r: r+win_size, c: c+win_size]
            for re in range(res.shape[0]):
                xmin,ymin,xmax,ymax,label = res[re]
                if int(xmin)>=c and int(xmax) <=c+win_size and int(ymin)>=r and int(ymax)<=r+win_size:
                    flag[0][re] = 1
                    youwu = True
                elif int(xmin)<c or int(xmax) >c+win_size or int(ymin) < r or int(ymax) > r+win_size:
                    pass
                else:
                    xiefou = False
                    break;
            if xiefou:#如果物體被分割了，則忽略不寫入
                if youwu:#有物體則寫入xml檔案
                    doc = Document()
                    annotation = doc.createElement('annotation')
                    doc.appendChild(annotation)
                    for re in range(res.shape[0]):
                        xmin,ymin,xmax,ymax,label = res[re]
                        xmin=int(xmin)
                        ymin=int(ymin)
                        xmax=int(xmax)
                        ymax=int(ymax)
                        if flag[0][re] == 1:
                            xmin=str(xmin-c)
                            ymin=str(ymin-r)
                            xmax=str(xmax-c)
                            ymax=str(ymax-r)
                            object_charu = doc.createElement('object')
                            annotation.appendChild(object_charu)
                            name_charu = doc.createElement('name')
                            name_charu_text = doc.createTextNode(label)
                            name_charu.appendChild(name_charu_text)
                            object_charu.appendChild(name_charu)
                            dif = doc.createElement('difficult')
                            dif_text = doc.createTextNode('0')
                            dif.appendChild(dif_text)
                            object_charu.appendChild(dif)
                            bndbox = doc.createElement('bndbox')
                            object_charu.appendChild(bndbox)
                            xmin1 = doc.createElement('xmin')
                            xmin_text = doc.createTextNode(xmin)
                            xmin1.appendChild(xmin_text)
                            bndbox.appendChild(xmin1)
                            ymin1 = doc.createElement('ymin')
                            ymin_text = doc.createTextNode(ymin)
                            ymin1.appendChild(ymin_text)
                            bndbox.appendChild(ymin1)
                            xmax1 = doc.createElement('xmax')
                            xmax_text = doc.createTextNode(xmax)
                            xmax1.appendChild(xmax_text)
                            bndbox.appendChild(xmax1)
                            ymax1 = doc.createElement('ymax')
                            ymax_text = doc.createTextNode(ymax)
                            ymax1.appendChild(ymax_text)
                            bndbox.appendChild(ymax1)
                        else:
                            continue
                    xml_name = oriname+'_%3d.xml' % (i)
                    to_xml_name = os.path.join(target_dir2, xml_name)
                    with open(to_xml_name, 'wb+') as f:
                        f.write(doc.toprettyxml(indent="\t", encoding='utf-8'))
                    #name = '%02d_%02d_%02d_.bmp' % (No, int(r/win_size), int(c/win_size))
                    img_name = oriname+'_%3d.jpg' %(i)
                    to_name = os.path.join(target_dir1, img_name)
                    i = i+1
                    cv2.imwrite(to_name, tmp)
for No, name in tqdm(enumerate(os.listdir(origin_dir))):
    clip_img(No, name.rstrip('.jpg'))

這樣就將一個大圖分塊且儲存了座標xml檔案。
注意xml檔案的key視情況而定，不是通用的。

【python】小目標檢測中對一幅高解析度圖分塊且改變目標bounding box的座標

很多時候，在小目標檢測中，對於一副高解析度影象，我們很難直接輸入一整幅大圖來進行目標檢測，特別是對於one-stage的方法，如SSD的輸入尺寸是300和512， YOLO的輸入尺寸是416，而高解析度影象通常有幾千×幾千畫素。所以我在此分享將一副高解析度影象分塊同時寫入對應目標的bound

【Python】從文件中讀取數據

ima pen strip() print語句 top src 絕對路徑列表 pad 從文件中讀取數據 1.1 讀取整個文件要讀取文件，需要一個包含幾行文本的文件（文件PI_DESC.txt與file_reader.py在同一目錄下） PI_DESC.txt 3.141

【Python】Linux和Windows中python的差異

() 問題選項警告 gpo lin 私有屬性調試 bsp 慢慢寫... --------------------------------------------------------------------------------------------------

【java】spring項目中對entity進行本類間的克隆

tor mini cti false display des private rac 重寫方法1：【使用spring自帶BeanUtils實現克隆】【要求：需要被克隆的類實現Cloneable接口並且重寫clone()方法】》例子：》》實體： package

【技巧】算法競賽中對拍程序的寫法

定向軟件 err 出錯 OS 進行小數據 goto 隨機數在競賽過程中一個對拍程序可以幫你排除許多錯誤，如果擔心自己寫的正解被一些小數據卡掉，我們通常會寫個對拍程序來檢查正解的正確性，通過大量數據觀察正解與暴力的輸出是否相同。我們首先拿出我們寫的可能會超時但是

目標檢測中對端對端（End to end）的理解

End to end：指的是輸入原始資料，輸出的是最後結果，應用在特徵學習融入演算法，無需單獨處理。 end-to-end（端對端）的方法，一端輸入我的原始資料，一端輸出我想得到的結果。只關心輸入和輸出，中間的步驟全部都不管。　　端到端指的是輸入是原始資料，輸出是最後的結果，原來輸入端不是

【Python】向json檔案中追加新的物件

def get_json(path, write_path): file = open(path, encoding=‘utf8’, errors=‘ignore’) file_lines = file.read() file.close() file_json

【 FPGA 】7 Series FPGA中對MUX的設計指導

目錄 MUXF7_D MUXF7_L MUXF8 MUXF8_D MUXF8_L 內容來自：Xilinx 7 Series FPGA and Zynq-7000 All Programmable SoC Libraries Guide for HDL Designs

【 FPGA 】7 Series FPGA中對SRL的設計指導

Xilinx 7 Series FPGA and Zynq-7000 All Programmable SoC Libraries Guide for HDL Designs 最近在看關於Ultrafast設計方法學的視訊以及Vivado design Methodology的使用者手冊時，

【Python】在Mac系統中安裝Pygame

我們通過Homebrew來安裝Pygame，Homebrew是Mac OSX上的軟體包管理工具，如果還沒安裝Homebrew，將以下命令貼上至終端先安裝Homebrew /usr/bin/ruby -e "$(curl -fsSL https://raw.gi

【Python】Jupyter在PyCharm中的使用

最近在學CS231n的課程，打算把作業做一下。由於官方給的例程是用的IPython，字尾名為ipynb，和之前接觸的Python寫法不一樣，來記錄一下自己今天踩到的一個坑。其實有一個很簡單的解決方法就是安裝Anaconda，我也不知道自己為什麼要在PyCha

【Python】往json檔案中追加內容

往json檔案中追加內容已存在的json檔案內容如下：需求：想要追加json內容（例如：{'e':'5555','f':'6666'}），追加後如下：若直

【Python】小談 numpy 陣列佔用記憶體空間問題

之前跟同學討論過numpy陣列的佔用空間大小問題，但是今天給忘了，又重新試驗了一下，主要是利用sys模組的getsizeof函式，使用的版本是 Python3.5。記錄下來，以備後忘。問題

【Python】循環結構中的else

使用出了 for pri 正常 else nbsp == range else在循環結構中，只有循環正常結束後才執行else，如果使用break跳出了循環，不會執行else for i in range(0,10):　　print(i)else:　　print("els

【C++】小甲魚視訊筆記（一）從C到C++過渡的幾個小程式

例1：陣列元素的求和 C示例 int main() { int data[]={0,1,2,3,5,6,7,8,9}; int size=sizeof(data)/sizeof(data[0]); printf(“data:%d\n”,sizeof

【Python】電商網站如何解決秒殺高併發超賣問題

蒐集了一些解決秒殺、高併發、超賣、問題的思路，僅供參考！搶訂單環節一般會帶來2個問題：　　1、高併發　　比較火熱的秒殺線上人數都是10w起的，如此之高的線上人數對於網站架構從前到後都是一種考驗。　　2、超賣　　任何商品都會有數量上限，如何避免成功下訂單

【python】獲取linux主機ip的一種方法

python有好幾種方法可以獲取主機的ip地址。我常用的一種是通過socket.socket().inet_ntoa()來實現,非常方便；但這種方法有個限制就是要把網絡卡名（比如eth0）作為引數傳進來。所以加多了一個條件判斷，如果發現以上方法獲取ip拋異常時，那就用調起s

【Python】字典或者對象類型中鍵或者屬性的獲取與存在性判斷

ssss som lse dma 一點 ror orm something erro # 定義測試用對象A，字典B class A(object): length = 10 B ={"length":10} # 判斷對象是否含有某種屬性 # 推薦這種方式,更Py

【python】pytorch中如何使用DataLoader對資料集進行批處理

第一步：我們要建立torch能夠識別的資料集型別（pytorch中也有很多現成的資料集型別，以後再說）。首先我們建立兩個向量X和Y，一個作為輸入的資料，一個作為正確的結果：隨後我們需要把X和Y組成一個完整的資料集，並轉化為pytorch能

【Python】Python中中文的字串格式化對齊

中文字元在字元佔用上相當於兩個英文字元，但是字型設計上，一般一箇中文字元的寬度不會等於兩個英文字元的寬度，所以打印出來的效果有偏差。如： c = [ '決', '決決', '決決決', '決決決決', '決決決決決', '決

【python】小目標檢測中對一幅高解析度圖分塊且改變目標bounding box的座標

相關推薦