OpenCV開發（2）——神經網絡使用示例

阿新 • • 發佈：2018-06-28

rtu cer reads wait 開發 wap 文檔 multi module

OpenCV3.4的神經網絡功能主要提供了以下三種：

ml模塊中的多層感知機（Artificial Neural Networks - Multi-Layer Perceptrons），提供了MLP的創建、訓練、參數設置等函數。如：


static Ptr< ANN_MLP >   create ()
    Creates empty model. 
static Ptr< ANN_MLP >   load (const String &filepath)
    Loads and creates a serialized ANN from a file. 
void    setAnnealFinalT (double val)
void    setAnnealInitialT (double val)
void    setAnnealItePerStep (int val)
virtual void    setBackpropMomentumScale (double val)=0
virtual void    setBackpropWeightScale (double val)=0
virtual void    setLayerSizes (InputArray _layer_sizes)=0
virtual void    setRpropDW0 (double val)=0
virtual void    setRpropDWMax (double val)=0

enum    ActivationFunctions { 
    IDENTITY = 0, 
    SIGMOID_SYM = 1, 
    GAUSSIAN = 2, 
    RELU = 3, 
    LEAKYRELU = 4 
}

enum    TrainFlags { 
    UPDATE_WEIGHTS = 1, 
    NO_INPUT_SCALE = 2, 
    NO_OUTPUT_SCALE = 4 
}
enum    TrainingMethods { 
    BACKPROP =0, 
    RPROP = 1, 
    ANNEAL = 2 
}

請參看幫助文檔。

DNN模塊，提供了很多用於創建、加載、訓練深度網絡和參數設置以及加載TensorFlow、Caffe、Torch模型的方法和類，如：

class   cv::dnn::BackendNode
    Derivatives of this class encapsulates functions of certain backends.
class   cv::dnn::BackendWrapper
    Derivatives of this class wraps cv::Mat for different backends and targets.
class   cv::dnn::Dict
    This class implements name-value dictionary, values are instances of DictValue.
struct      cv::dnn::DictValue
    This struct stores the scalar value (or array) of one of the following type: double, cv::String or int64. 
class   cv::dnn::Layer
    This interface class allows to build new Layers - are building blocks of networks.
class   cv::dnn::LayerParams
    This class provides all data needed to initialize layer. 
class   cv::dnn::Net
    This class allows to create and manipulate comprehensive artificial neural networks.
    Mat     cv::dnn::blobFromImages (const std::vector< Mat > &images, double scalefactor=1.0, Size size=Size(), const Scalar &mean=Scalar(), bool swapRB=true, bool crop=true)
    Creates 4-dimensional blob from series of images. Optionally resizes and crops images from center, subtract mean values, scales values by scalefactor, swap Blue and Red channels.
void    cv::dnn::NMSBoxes (const std::vector< Rect > &bboxes, const std::vector< float > &scores, const float score_threshold, const float nms_threshold, std::vector< int > &indices, const float eta=1.f, const int top_k=0)
    Performs non maximum suppression given boxes and corresponding scores.

Net     cv::dnn::readNetFromCaffe (const String &prototxt, const String &caffeModel=String())
    Reads a network model stored in Caffe framework‘s format.
Net     cv::dnn::readNetFromDarknet (const String &cfgFile, const String &darknetModel=String())
    Reads a network model stored in Darknet model files.
Net     cv::dnn::readNetFromTensorflow (const String &model, const String &config=String())
    Reads a network model stored in TensorFlow framework‘s format. 
Net     cv::dnn::readNetFromTorch (const String &model, bool isBinary=true)

參看幫助文檔。

第三方深度網絡工具，詳情請查看幫助文檔。

下面給出示例。
1.基於MLP的識別。該程序人工生成四類動物數據，通過MLP網絡訓練模型並檢測測試數據類型。

    #exam1.py 
    import cv2
    import numpy as np
    from random import randint
    #創建MLP網絡，並設置訓練方法、激活函數、層大小和叠代終止條件。
    animals_net = cv2.ml.ANN_MLP_create()
    animals_net.setTrainMethod(cv2.ml.ANN_MLP_RPROP | cv2.ml.ANN_MLP_UPDATE_WEIGHTS)
    animals_net.setActivationFunction(cv2.ml.ANN_MLP_SIGMOID_SYM)
    animals_net.setLayerSizes(np.array([3, 6, 4]))
    animals_net.setTermCriteria(( cv2.TERM_CRITERIA_EPS | cv2.TERM_CRITERIA_COUNT, 10, 1 ))
    #生成四類動物數據及類標記
    def dog_sample():
        return [randint(10, 20), 1, randint(38, 42)]
    def dog_class():
        return [1, 0, 0, 0]
    def condor_sample():
        return [randint(3,10), randint(3,5), 0]
    def condor_class():
        return [0, 1, 0, 0]
    def dolphin_sample():
        return [randint(30, 190), randint(5, 15), randint(80, 100)]
    def dolphin_class():
        return [0, 0, 1, 0]
    def dragon_sample():
        return [randint(1200, 1800), randint(30, 40), randint(160, 180)]
    def dragon_class():
        return [0, 0, 0, 1]
    #將動物數據和類標記組成一個記錄（樣本）
    def record(sample, classification):
        return (np.array([sample], dtype=np.float32), np.array([classification], dtype=np.float32))
    #獲取5000個樣本數據
    records = []
    RECORDS = 5000
    for x in range(0, RECORDS):
        records.append(record(dog_sample(), dog_class()))
        records.append(record(condor_sample(), condor_class()))
        records.append(record(dolphin_sample(), dolphin_class()))
        records.append(record(dragon_sample(), dragon_class()))
    #訓練MLP網絡
    EPOCHS = 2
    for e in range(0, EPOCHS):
        print("Epoch %d:" % e)
        for t, c in records:
            animals_net.train(t, cv2.ml.ROW_SAMPLE, c)
    #預測測試樣本類別
    TESTS = 100
    dog_results = 0
    for x in range(0, TESTS):
        clas = int(animals_net.predict(np.array([dog_sample()], dtype=np.float32))[0])
        print("class: %d" % clas)
        if (clas) == 0:
            dog_results += 1
    condor_results = 0
    for x in range(0, TESTS):
        clas = int(animals_net.predict(np.array([condor_sample()], dtype=np.float32))[0])
        print("class: %d" % clas)
        if (clas) == 1:
            condor_results += 1
    dolphin_results = 0
    for x in range(0, TESTS):
        clas = int(animals_net.predict(np.array([dolphin_sample()], dtype=np.float32))[0])
        print("class: %d" % clas)
        if (clas) == 2:
            dolphin_results += 1
    dragon_results = 0
    for x in range(0, TESTS):
        clas = int(animals_net.predict(np.array([dragon_sample()], dtype=np.float32))[0])
        print("class: %d" % clas)
        if (clas) == 3:
            dragon_results += 1
    #輸出測試準確率
    print("Dog accuracy: %f%%" % (dog_results))
    print("condor accuracy: %f%%" % (condor_results))
    print("dolphin accuracy: %f%%" % (dolphin_results))
    print("dragon accuracy: %f%%" % (dragon_results))

2.基於DNN的識別。該程序加載預先訓練的caffe模型在攝像頭獲取的圖像上檢測人臉。

import numpy as np
import argparse
import cv2 as cv
#若出現ImportError，請配置環境變量PYTHONPATH為Python可執行文件的地址。
#若不能解決，請更新相關包（或卸載後重新安裝）。
try:
    import cv2 as cv
except ImportError:
    raise ImportError(‘Can\‘t find OpenCV Python module. If you\‘ve built it from sources without installation, ‘
                      ‘configure environemnt variable PYTHONPATH to "opencv_build_dir/lib" directory (with "python3" subdirectory if required)‘)
#導入DNN模塊
from cv2 import dnn
inWidth = 300
inHeight = 300
confThreshold = 0.5
#該文件包含在opencv3.4\sources\samples\dnn\face_detector目錄中，該目錄的上級目錄為OpenCV3.4的下載或安裝目錄
prototxt = ‘face_detector/deploy.prototxt‘
#該caffe模型文件需先下載，請參看opencv3.4\sources\samples\dnn\face_detector目錄中的文本文件
caffemodel = ‘face_detector/res10_300x300_ssd_iter_140000.caffemodel‘
#加載caffe模型並從攝像頭獲取圖像
if __name__ == ‘__main__‘:
    net = dnn.readNetFromCaffe(prototxt, caffemodel)
    cap = cv.VideoCapture(0)
    while True:
        ret, frame = cap.read()
        cols = frame.shape[1]
        rows = frame.shape[0]
                #將獲取的圖像設置為網絡輸入，設置網絡傳播方向，檢測人臉
        net.setInput(dnn.blobFromImage(frame, 1.0, (inWidth, inHeight), (104.0, 177.0, 123.0), False, False))
        detections = net.forward()
        perf_stats = net.getPerfProfile()
        print(‘Inference time, ms: %.2f‘ % (perf_stats[0] / cv.getTickFrequency() * 1000))
        for i in range(detections.shape[2]):
            confidence = detections[0, 0, i, 2]
            if confidence > confThreshold:
                xLeftBottom = int(detections[0, 0, i, 3] * cols)
                yLeftBottom = int(detections[0, 0, i, 4] * rows)
                xRightTop = int(detections[0, 0, i, 5] * cols)
                yRightTop = int(detections[0, 0, i, 6] * rows)
                cv.rectangle(frame, (xLeftBottom, yLeftBottom), (xRightTop, yRightTop),
                             (0, 255, 0))
                label = "face: %.4f" % confidence
                labelSize, baseLine = cv.getTextSize(label, cv.FONT_HERSHEY_SIMPLEX, 0.5, 1)
                cv.rectangle(frame, (xLeftBottom, yLeftBottom - labelSize[1]),
                                    (xLeftBottom + labelSize[0], yLeftBottom + baseLine),
                                    (255, 255, 255), cv.FILLED)
                cv.putText(frame, label, (xLeftBottom, yLeftBottom),
                           cv.FONT_HERSHEY_SIMPLEX, 0.5, (0, 0, 0))
        cv.imshow("detections", frame)
        if cv.waitKey(1) != -1:
            break

OpenCV開發（2）——神經網絡使用示例

rtu cer reads wait 開發 wap 文檔 multi module OpenCV3.4的神經網絡功能主要提供了以下三種： ml模塊中的多層感知機（Artificial Neural Networks - Multi-Layer Perceptrons），

20180813視頻筆記深度學習基礎上篇（1）之必備基礎知識點深度學習基礎上篇（2）神經網絡模型視頻筆記：深度學習基礎上篇（3）神經網絡案例實戰和深度學習基礎下篇

計算概念人臉識別大量 png 技巧表現 lex github 深度學習基礎上篇（3）神經網絡案例實戰 https://www.bilibili.com/video/av27935126/?p=1 第一課:開發環境的配置 Anaconda的安裝庫的安裝 Windo

第八章網絡的時代—網絡開發（2）

dex pre header for pen eba 協議名稱 host 會有 8.3基於最成熟的Web協議—HTTP協議編程8.3.1 HTTP協議簡單介紹超文本傳輸協定（HTTP。HyperTextTransferProtocol）是互聯網上應用最為廣泛的一種網絡協

Windows Phone開發（2）：豎立自信，初試鋒茫

一鍵優秀保持知識 sdn ant emulator 一個動畫上一篇文章中，我們聊了一些“大炮”話題，從這篇文章開始，我們一起來學習WP開發吧。一、我們有哪些裝備。安裝完VS 學習版 for WP後，也連同SDK一並安裝了，不必像安卓那樣，安裝JDK，下載

Android studio 百度地圖開發（2）地圖定位

gcj02 settings tick all adding ext tope wid erro Android studio 百度地圖開發（2）地圖定位 email:[email protected]/* */ 開發環境：win7 64位

ng機器學習視頻筆記（六） ——神經網絡基礎

一個變量視頻 img 輸入 center 內容 line 基礎 ng機器學習視頻筆記（六） ——神經網絡基礎（轉載請附上本文鏈接——linhxx）一、概述神經網絡，可以理解為輸入的內容，經過一系列的內部的處理，得到輸出的假設函數。簡單的神

57. Python saltstack 二次開發（2）

http協議 class 方式 clas 調用官網創建分享 tex 回顧上一節：grains 和 pillar 都是定義他們的屬性的grains 定義在minion端（定義完必須重啟minion，才能生效）pillar 定義在master端（無需重啟即可生效）sal

OpenCV開發（1）——OpenCV3.4+Python3.5+Windows10安裝問題解決

opencv3.4 Python3.5 opencv-python ImportError DLL load failed OpenCV近幾年功能不斷增強，目標檢測、跟蹤等方面出現了不少新算法。自3.3版開始，火熱的深度神經網絡的功能也加入其中。早期的OpenCV僅支持簡單的視頻播放功能，

ML（5）——神經網絡（1）

包括用途合數向量三次曲線回歸算法 color 數量　　上一章介紹了使用邏輯回歸處理分類問題。盡管邏輯回歸是個非常好用的模型，但是在處理非線性問題時仍然顯得力不從心，下圖就是一個例子：　　線性模型已經無法很好地擬合上面的樣本，所以選擇了更復雜的模型，得到了

Python全棧開發之路【第八篇】：面向對象編程設計與開發（2）

ssi pen 解析執行示例動態類型 put 所有一、繼承與派生什麽是繼承？繼承指的是類與類之間的關系，是一種什麽是什麽的關系，繼承的功能之一就是用來解決代碼重用問題。繼承是一種創建新的類的方式，在python中，新建的類可以繼承一個或多個父類，父類又可以成

Linux驅動開發（2）——設備註冊

結構體platform_device 註冊裝置使用結構體platform_device，原始碼路徑include/linux/platform_device.h struct platform_device { const char * name;//裝

Android開發（2）：資料儲存之一：SharedPrefrences和檔案讀寫

一、資料儲存本文主要講前兩種儲存方式，其中檔案讀寫只記錄Internal Storage方式 1. SharedPrefrences方式輕量級NVP方式儲存，以XML的檔案方式儲存，適合少量資料的儲存。 NVP：Name/Value pair, 名稱/值對。 2.

安卓應用開發（2）建立工程

由於Google是官方，所以本系列內容全部是掛外網，請參考者注意。如若打不開，請使用vpn。 Google官方教程 https://developer.android.com/training/basics/firstapp/creating-project 1.新建工程

【影象處理——OpenCV】（2）

目錄 1、測試程式碼 2、報錯如下 3、報錯原因 4、解決方法 1、測試程式碼 #include <iostream> #include "stdafx.h" #include <opencv2/core/core.hpp> #include <

opencv筆記（2）操作畫素

1、Mat類包含多種方法，可用來訪問影象的各種屬性： cols和rows可得到影象的列數和行數。（column、rows） Mat的at(int y,int x)方法訪問元素。在呼叫at方法時，必須指定影象元素的型別。 image.at<uchar>(j,i)=255;

Android studio3.0對於百度地圖api開發（2）——百度地圖定位指定地點以及地圖型別的變換

承接上一篇文章寫的，https://blog.csdn.net/qq_41562408/article/details/82794772已經實現了百度地圖的工具準備以及基本的地圖顯示，不過就一個地圖應用來說，只是單純的顯示一種地圖型別以及只是能夠定位到天安門其他地

關於資料序列化（2）二進位制流示例

將一個物件保寫進2進位制流，儲存在檔案中，然後從檔案中恢復物件問題：像這樣大家覺的直接writeInt(),writeByt();用來跟客戶端通訊和做持久化存在硬碟有什麼問題嗎 protobuf哪裡能看出來是省資源了，他的原理不也是格式化儲存嗎難

微信小遊戲開發（2）

一、檔案結構在上一節中，我們已經看到了小程式開發的介面。在介面中的中間部分，為檔案結構區域：可以看到該專案中有： audio資料夾：用來存放音訊檔案； images資料夾：用來存放圖片檔案； js資料夾：用來存放指令碼檔案； .game.js檔案； .game.json

Panda學習筆記4——多表功能開發（2）後端介面開發

進行功能性的開發，主要涉及到：序號型別名稱 1 DTO CodeRulesTest1Header 2 DTO CodeRulesTest1Line 3 Mapper CodeRulesTest1HeaderMapper 4 Ma

Android多媒體開發（2）————使用Android NKD編譯原版FFmpeg

/******************************************************************************************** * author：[email protected]大鐘

OpenCV開發（2）——神經網絡使用示例

相關推薦