mask rcnn實現教程

阿新 • • 發佈：2019-02-12

一，首先去github上下載mask-rcnn原始碼，這裡提供一個百度網盤地址

連結：https://pan.baidu.com/s/1htJYyNy 密碼：0r2b

二，下載對應的mask_rcnn_coco.h5模型，這裡給出百度網盤下載地址

連結：https://pan.baidu.com/s/1drKvfg 密碼：yer9

三，執行如下程式碼，根據提示安裝相應的庫

import os
import sys
import random
import math
import numpy as np
import skimage.io
import matplotlib
import matplotlib.pyplot as plt

import coco
import utils
import model as modellib
import visualize

對於pycocotools庫安裝方法如下

git clone https://github.com/pdollar/coco

cd coco/PythonAPI

將makefile中的python 改為python3

然後先執行安裝python3-dev

然後命令列輸入

make -j8

然後將pycocotools資料夾複製到mask-rcnn下

最後再

sudo pip3 install h5py

四，當編譯器不再報錯時執行如下程式

import os
import sys
import random
import math
import numpy as np
import skimage.io
import matplotlib
import matplotlib.pyplot as plt

import coco
import utils
import model as modellib
import visualize

#%matplotlib inline 

# Root directory of the project
ROOT_DIR = os.getcwd()

# Directory to save logs and trained model
MODEL_DIR = os.path.join(ROOT_DIR, "logs")

# Local path to trained weights file
COCO_MODEL_PATH =  "mask_rcnn_coco.h5"


# Directory of images to run detection on
IMAGE_DIR = os.path.join(ROOT_DIR, "images")





class InferenceConfig(coco.CocoConfig):
    # Set batch size to 1 since we'll be running inference on
    # one image at a time. Batch size = GPU_COUNT * IMAGES_PER_GPU
    GPU_COUNT = 1
    IMAGES_PER_GPU = 1

config = InferenceConfig()
config.display()





# Create model object in inference mode.
model = modellib.MaskRCNN(mode="inference", model_dir=MODEL_DIR, config=config)

# Load weights trained on MS-COCO
model.load_weights(COCO_MODEL_PATH, by_name=True)





# COCO Class names
# Index of the class in the list is its ID. For example, to get ID of
# the teddy bear class, use: class_names.index('teddy bear')
class_names = ['BG', 'person', 'bicycle', 'car', 'motorcycle', 'airplane',
               'bus', 'train', 'truck', 'boat', 'traffic light',
               'fire hydrant', 'stop sign', 'parking meter', 'bench', 'bird',
               'cat', 'dog', 'horse', 'sheep', 'cow', 'elephant', 'bear',
               'zebra', 'giraffe', 'backpack', 'umbrella', 'handbag', 'tie',
               'suitcase', 'frisbee', 'skis', 'snowboard', 'sports ball',
               'kite', 'baseball bat', 'baseball glove', 'skateboard',
               'surfboard', 'tennis racket', 'bottle', 'wine glass', 'cup',
               'fork', 'knife', 'spoon', 'bowl', 'banana', 'apple',
               'sandwich', 'orange', 'broccoli', 'carrot', 'hot dog', 'pizza',
               'donut', 'cake', 'chair', 'couch', 'potted plant', 'bed',
               'dining table', 'toilet', 'tv', 'laptop', 'mouse', 'remote',
               'keyboard', 'cell phone', 'microwave', 'oven', 'toaster',
               'sink', 'refrigerator', 'book', 'clock', 'vase', 'scissors',
               'teddy bear', 'hair drier', 'toothbrush']



# Load a random image from the images folder
file_names = next(os.walk(IMAGE_DIR))[2]
image = skimage.io.imread(os.path.join(IMAGE_DIR, random.choice(file_names)))

# Run detection
results = model.detect([image], verbose=1)

# Visualize results
r = results[0]
visualize.display_instances(image, r['rois'], r['masks'], r['class_ids'], 
                            class_names, r['scores'])
print('OK')

至此mask-rcnn完成

五，用mask-rcnn訓練自己資料

這裡提供一個最新原始碼（沒積分的留言聯絡我，我發給你的郵箱）

這裡我們主要用到原始碼提供的coco.py

首先我們去如下兩個網址下載coco資料集

http://images.cocodataset.org/zips/train2014.zip

http://images.cocodataset.org/zips/val2014.zip

接著我們下載對應的json檔案

https://dl.dropboxusercontent.com/s/o43o90bna78omob/instances_minival2014.json.zip?dl=0

https://dl.dropboxusercontent.com/s/s3tw5zcg7395368/instances_valminusminival2014.json.zip?dl=0

可能上面連結失效，這裡提供instances_minival2014.json和instances_valminusminival2014.json一個csdn的下載地址json下載

instances_train2014.json和instances_val2014.json百度雲下載連結為：

連結：https://pan.baidu.com/s/1qHoeAOULbsAFiPnBr4a8JA 密碼：fk62

將上面下載的原始碼解壓，將sample中coco/coco.py複製到Mask_RCNN-master 根目錄下，新建一個資料夾coco用來存放我們上面下載的資料圖片及json檔案

進入coco資料夾中解壓train2014.zip和val2014.zip 到當前目錄下

解壓上面的包含json檔案的zip，這裡我們只需要

instances_minival2014.json instances_train2014.json instances_val2014.json instances_valminusminival2014.json

這四個json ,在coco目錄下新建一個資料夾annotations用來存放上面的四個json,

最終目錄如下：

在home目錄存放預訓練模型mask_rcnn_coco.h5

此時我們可以回到Mask_RCNN-master目錄下，執行命令

python3 coco.py train --dataset=coco/ --model=coco

然後我們會看到如下介面

Logs處就是我們儲存訓練後的模型所在目錄

到此，我們成功開始訓練coco資料集

六，分析coco資料集

1，為了更好地分析coco資料集，這裡我們準備一個工具labelme，這是一個打標的工具

安裝方法如下：

pip3 install labelme

安裝完成之後開啟

labelme

第二幅圖就是我們自己給圖片打標註後，我們進行儲存會生成一個json檔案,開啟生成的json檔案我們可以看到標註的所有點的x,y座標

這個工具可以用來標註我們自己的資料集，然後進行訓練。

2，獲取coco標註檔案內容

coco標註檔案比較大，一個json有500M多，我們用普通的記事本是打不開的，這裡我們要用到coco官網提供的一個python API包，該api是抽象的，封裝了各裝函式用來獲取json的資料，我們分析後發現該json相當於一個字典檔案，鍵值對形式呈現。

# The following API functions are defined:
#  COCO       - COCO api class that loads COCO annotation file and prepare data structures.
#  decodeMask - Decode binary mask M encoded via run-length encoding.
#  encodeMask - Encode binary mask M using run-length encoding.
#  getAnnIds  - Get ann ids that satisfy given filter conditions.
#  getCatIds  - Get cat ids that satisfy given filter conditions.
#  getImgIds  - Get img ids that satisfy given filter conditions.
#  loadAnns   - Load anns with the specified ids.
#  loadCats   - Load cats with the specified ids.
#  loadImgs   - Load imgs with the specified ids.
#  annToMask  - Convert segmentation in an annotation to binary mask.
#  showAnns   - Display the specified annotations.
#  loadRes    - Load algorithm results and create API for accessing them.
#  download   - Download COCO images from mscoco.org server.
# Throughout the API "ann"=annotation, "cat"=category, and "img"=image.
# Help on each functions can be accessed by: "help COCO>function".

# See also COCO>decodeMask,
# COCO>encodeMask, COCO>getAnnIds, COCO>getCatIds,
# COCO>getImgIds, COCO>loadAnns, COCO>loadCats,
# COCO>loadImgs, COCO>annToMask, COCO>showAnns

from pycocotools.coco import COCO
from pycocotools.cocoeval import COCOeval
from pycocotools import mask as maskUtils
coco=COCO("pycocotools/instances_train2014.json")

3，分析coco中的segemention

我們提取其中一幅圖片的segemention用如下程式碼將其按labelme要求的json格式寫入test.txt檔案中

l = [345.28, 220.68, 348.17, 269.8, 355.4, 307.36, 377.07, 318.92, 395.85, 370.93, 444.97, 565.96, 473.86, 616.52, 478.19, 628.08, 431.96, 628.08, 401.63, 581.85, 377.07, 477.83, 375.62, 529.84, 387.18, 600.63, 397.29, 628.08, 325.06, 623.75, 216.7, 622.3, 216.7, 606.41, 251.38, 529.84, 223.93, 529.84, 209.48, 528.4, 202.26, 505.28, 193.59, 485.06, 167.58, 375.26, 179.14, 334.81, 203.7, 324.7, 229.71, 313.14, 209.48, 278.47, 193.59, 248.13, 208.04, 188.89, 223.93, 175.89, 236.93, 168.67, 258.6, 162.89, 294.72, 168.67, 310.61, 174.45, 326.5, 197.56]
l0 = []
l1 = []
l3 = []
l4 = []
for i in range(len(l)):
	if i%2==0:
		l0.append(l[i])
	else:
		l1.append(l[i])
for i in range(len(l)):
	if i%2==0:
		l3.append(l[i])
	else:
		l3.append(l[i])
		l4.append(l3)
		l3 = []
print(l0)
print(l1)
print(l4)
f = open("test.txt","w")
for e in l4:
	f.write('\n        [\n          ')
	f.write(str(e[0]))
	f.write(',\n          ')
	f.write(str(e[1]))
	f.write('\n        ],')
f.close()
a = input()

然後我們將之前labelme儲存的json檔案中的位置座標進行替換，我們得到如下圖片：

mask rcnn實現教程

一，首先去github上下載mask-rcnn原始碼，這裡提供一個百度網盤地址連結：https://pan.baidu.com/s/1htJYyNy 密碼：0r2b 二，下載對應的mask_rcnn_coco.h5模型，這裡給出百度網盤下載地址連結：https:

Mask RCNN 實現視訊和圖片中的多人姿態檢測

Mask RCNN是目標分割檢測框架--擴充套件到人體關鍵點檢測對於原理不清晰的同學，建議你去看一下Kaming He的論文:https://arxiv.org/pdf/1703.06870.pdf 我的部落格裡也有論文的翻譯版:Mask R-CNN 論文翻譯對於視訊中的多人進行姿態估計，

用自己的資料集訓練Mask-RCNN實現過程中的坑

本文僅僅是自己實現過程的筆記記錄，僅僅用來交流的。在網上大量蒐集資料後，實現Mask-RCNN，但是過程中還是出現了很多很多的問題，所以將過程記錄如下，方便日後學習。一、實驗前準備 1. COCO資料集 COCO的全稱是Common Objects in COn

Pytorch mask-rcnn 實現細節

DataLoader Dataset不能滿足需求需自定義繼承torch.utils.data.Dataset時需要override __init__, __getitem__, __len__ ，否則DataLoader匯入自定義Dataset時缺少上述函式

keras版本的Mask-RCNN裡的形狀目標檢測例子跑通教程 keras版本的Mask-RCNN裡的形狀目標檢測例子跑通教程

原 keras版本的Mask-RCNN裡的形狀目標檢測例子跑通教程 2018年06月27日 15:12:21 yangdashi888 閱讀數：279

c++/python opencv實現mask Rcnn

OpenCV中使用Mask R-CNN進行基於深度學習的物件檢測和例項分割（Python / C ++）我覺得可以嘗試一下幾個星期前，我們用YOLOv3寫了一篇關於物體檢測的文章。物件檢測器的輸出是在影象或視訊幀中檢測到的物件周圍的邊界框陣列，但我們沒有得到關於邊

谷歌開源的TensorFlow Object Detection API視頻物體識別系統實現教程

cti blog tail xiaoxiao pan clas post ont 谷歌教程：http://blog.csdn.net/xiaoxiao123jun/article/details/76605928 全部代碼：https://github.com/lyj83

RHEL6.9_Mysql5.6 for MHA 0.56 With VIP實現教程

mha安裝MHA是一套Mysql故障切換方案，來保障數據庫的高可用性，它的功能是能在0-30s之內實現主Mysql故障轉移（failover），MHA故障轉移可以很好的幫我們解決從庫數據的一致性問題，同時最大化挽回故障發生後數據的一致性。操作系統版本:Red Hat Enterprise Linux Serv

Mask RCNN 學習筆記

目標泛化插值留言筆記步長 roi 閱讀開始涉及到的知識點補充：FasterRCNN：https://www.cnblogs.com/wangyong/p/8513563.html RoIPooling、RoIAlign：https://www.cnblogs.

Mask RCNN 原理

adding 保留 rgb 固定特征添加原理尺度 obj 轉自：https://blog.csdn.net/ghw15221836342/article/details/80084861 https://blog.csdn.net/g

Mask-RCNN數據集制作

window rom ash 當前 enc 直接 clas glob 參數轉自https://blog.csdn.net/pingushen2100/article/details/80513043 一.Mask-RCNN數據集

『計算機視覺』RCNN學習_其二：Mask-RCNN

參考檢測語義 tail font 技術 src spa sta 參考資料 Mask R-CNN Mask R-CNN詳解開源代碼： Tensorflow版本代碼鏈接； Keras and TensorFlow版本代碼鏈接； MxNet版本代碼鏈接

【Mask RCNN】《Mask R-CNN》

ICCV-2017 目錄目錄 1 Motivation 2 Innovation 3 Advantages 4 Methods

學習筆記-目標檢測、定位、識別（RCNN，Fast-RCNN, Faster-RCNN，Mask-RCNN，YOLO，SSD 系列）

0. 前言說到深度學習的目標檢測，就要提到傳統的目標檢測方法。傳統的目標檢測流程： 1）區域選擇（窮舉策略：採用滑動視窗，且設定不同的大小，不同的長寬比對影象進行遍歷，時間複雜度高） 2）特徵提取（SIFT、HOG等；形態多樣性、光照變化多樣性、背景多樣性使得特徵魯棒性差）

Mask RCNN 顯微鏡下細胞檢測示例測試

參考連結： https://blog.csdn.net/l297969586/article/details/79140840/ https://cloud.tencent.com/developer/news/189753 1、首先參考細胞檢測對應連結：https://github.c

ubuntu16.04 Mask-RCNN-tf GPU demo測試

1、安裝CUDA和CUDNN 安裝詳細請參考：https://blog.csdn.net/oMoDao1/article/details/83303385 2、下載mask-rcnn原始碼： https://github.com/matterport/Mask_RCNN 3、安裝Py

mask-RCNN筆記——標註工具以及各式轉換

使用標註工具lablem 1、下載 https://github.com/wkentaro/labelme 2、安裝使用環境，ubuntu16.04，python3 進入labelme資料夾下，開啟終端，進入虛擬環境，執行程式碼 pip install pyqt5 # p

mask-RCNN筆記——inspect_data使用

mask-RCNN的使用，其中程式碼inspect_data的使用，用於dateset的讀取 1、下載coco資料庫，安裝coco工具包pycocotools https://blog.csdn.net/Diana_Z/article/details/83576598 2、新建一個*.p

mask-RCNN筆記——coco安裝及使用

mask-rcnn的coco資料集使用： coco安裝： 1、下載程式碼，clone或者直接download https://github.com/waleedka/coco 2、我使用的是python，進入目錄，並執行make 進入有makefile的資料夾在終端開啟，執

Mask Rcnn使用篇-訓練自己的資料集

首先膜拜一下何凱明大神，其實首次知道FCN做語義分割的時候，也產生過是否可以與Faster Rcnn結合的想法，不過也就那麼一個念頭閃過而已，沒具體想估計也想不明白。看了Mask Rcnn後有種豁然開朗的感覺，除了膜拜沒別的想法了。這篇只寫怎麼使用，原理後面在寫吧。必要的開發環境我就不

mask rcnn實現教程

相關推薦