Object Detection （5）Faster RCNN Keras 釋出為api

阿新 • • 發佈：2018-11-11

本文基於git專案做二次開發：

改造後git地址：https://github.com/xvshu/keras-frcnn-web

原git地址:https://github.com/yhenon/keras-frcnn

1，改造檢測類

思路:從拿到img開始截斷，釋出為一個方法，引數為img

在test-frcnn.py中，新增程式碼:

def tf_fit_img(img):
	X, ratio = format_img(img, C)

	if K.image_dim_ordering() == 'tf':
		X = np.transpose(X, (0, 2, 3, 1))

	# get the feature maps and output from the RPN
	print(X)
	[Y1, Y2, F] = model_rpn.predict(X)


	R = roi_helpers.rpn_to_roi(Y1, Y2, C, K.image_dim_ordering(), overlap_thresh=0.7)

	# convert from (x1,y1,x2,y2) to (x,y,w,h)
	R[:, 2] -= R[:, 0]
	R[:, 3] -= R[:, 1]

	# apply the spatial pyramid pooling to the proposed regions
	bboxes = {}
	probs = {}

	for jk in range(R.shape[0]//C.num_rois + 1):
		ROIs = np.expand_dims(R[C.num_rois*jk:C.num_rois*(jk+1), :], axis=0)
		if ROIs.shape[1] == 0:
			break

		if jk == R.shape[0]//C.num_rois:
			#pad R
			curr_shape = ROIs.shape
			target_shape = (curr_shape[0],C.num_rois,curr_shape[2])
			ROIs_padded = np.zeros(target_shape).astype(ROIs.dtype)
			ROIs_padded[:, :curr_shape[1], :] = ROIs
			ROIs_padded[0, curr_shape[1]:, :] = ROIs[0, 0, :]
			ROIs = ROIs_padded

		[P_cls, P_regr] = model_classifier_only.predict([F, ROIs])

		for ii in range(P_cls.shape[1]):

			if np.max(P_cls[0, ii, :]) < bbox_threshold or np.argmax(P_cls[0, ii, :]) == (P_cls.shape[2] - 1):
				continue

			cls_name = class_mapping[np.argmax(P_cls[0, ii, :])]

			if cls_name not in bboxes:
				bboxes[cls_name] = []
				probs[cls_name] = []

			(x, y, w, h) = ROIs[0, ii, :]

			cls_num = np.argmax(P_cls[0, ii, :])
			try:
				(tx, ty, tw, th) = P_regr[0, ii, 4*cls_num:4*(cls_num+1)]
				tx /= C.classifier_regr_std[0]
				ty /= C.classifier_regr_std[1]
				tw /= C.classifier_regr_std[2]
				th /= C.classifier_regr_std[3]
				x, y, w, h = roi_helpers.apply_regr(x, y, w, h, tx, ty, tw, th)
			except:
				pass
			bboxes[cls_name].append([C.rpn_stride*x, C.rpn_stride*y, C.rpn_stride*(x+w), C.rpn_stride*(y+h)])
			probs[cls_name].append(np.max(P_cls[0, ii, :]))

	all_dets = []

	for key in bboxes:
		bbox = np.array(bboxes[key])

		new_boxes, new_probs = roi_helpers.non_max_suppression_fast(bbox, np.array(probs[key]), overlap_thresh=0.5)
		for jk in range(new_boxes.shape[0]):
			(x1, y1, x2, y2) = new_boxes[jk,:]

			(real_x1, real_y1, real_x2, real_y2) = get_real_coordinates(ratio, x1, y1, x2, y2)

			cv2.rectangle(img,(real_x1, real_y1), (real_x2, real_y2), (int(class_to_color[key][0]), int(class_to_color[key][1]), int(class_to_color[key][2])),2)

			textLabel = '{}: {}'.format(key,int(100*new_probs[jk]))
			all_dets.append((key,100*new_probs[jk]))

			(retval,baseLine) = cv2.getTextSize(textLabel,cv2.FONT_HERSHEY_COMPLEX,1,1)
			textOrg = (real_x1, real_y1-0)

			cv2.rectangle(img, (textOrg[0] - 5, textOrg[1]+baseLine - 5), (textOrg[0]+retval[0] + 5, textOrg[1]-retval[1] - 5), (0, 0, 0), 2)
			cv2.rectangle(img, (textOrg[0] - 5,textOrg[1]+baseLine - 5), (textOrg[0]+retval[0] + 5, textOrg[1]-retval[1] - 5), (255, 255, 255), -1)
			cv2.putText(img, textLabel, textOrg, cv2.FONT_HERSHEY_DUPLEX, 1, (0, 0, 0), 1)

	# print('Elapsed time = {}'.format(time.time() - st))
	print(all_dets)
	return json.dumps(all_dets)

2，釋出httpapi介面

新增frcnn_api.py

from flask import Flask, jsonify, abort, make_response, request, url_for,render_template
from flask_httpauth import HTTPBasicAuth
import test_frcnn
import os
from werkzeug.utils import secure_filename
import cv2
from scipy import misc

app = Flask(__name__)
# 圖片最大為16M
app.config['MAX_CONTENT_LENGTH'] = 16 * 1024 * 1024
auth = HTTPBasicAuth()

#設定post請求中獲取的圖片儲存的路徑
UPLOAD_FOLDER = 'pic_tmp/'
if not os.path.exists(UPLOAD_FOLDER):
    os.makedirs(UPLOAD_FOLDER)
else:
    pass
ALLOWED_EXTENSIONS = set(['png', 'jpg', 'jpeg'])
app.config['UPLOAD_FOLDER'] = UPLOAD_FOLDER


@app.route('/')
def index():
    return render_template("img_fit.html")

@app.route('/img/fit', methods=['POST'])
def face_insert():
    #分別獲取post請求中的圖片資訊
    upload_files = request.files['imagefile']
    #從post請求圖片儲存到本地路徑中
    file = upload_files
    filename = secure_filename(file.filename)
    file.save(os.path.join(app.config['UPLOAD_FOLDER'], filename))
    image_path = os.path.join(app.config['UPLOAD_FOLDER'], filename)
    print(image_path)
    img = cv2.imread(os.path.expanduser(image_path))
    # img = misc.imread(os.path.expanduser(image_path), mode='RGB')

    return test_frcnn.tf_fit_img(img)


@auth.get_password
def get_password(username):
    if username == 'root':
        return 'root'
    return None


@auth.error_handler
def unauthorized():
    return make_response(jsonify({'error': 'Unauthorized access'}), 401)


@app.errorhandler(400)
def not_found(error):
    return make_response(jsonify({'error': 'Invalid data!'}), 400)

if __name__ == '__main__':
    app.run(host='172.30.53.250', port=8099)

3，開發頁面，測試

測試頁面：

img_fit.html

<!DOCTYPE html>
<html lang="en">

</body>
</html>

<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <title>可樂識別</title>
    <link rel="stylesheet" href="/static/css/user_form.css">
</head>
<body>
<header>
    <div class="header-line"></div>
</header>
<div class="content">
    <img class="content-logo" src="/static/img/logo.png" alt="logo">
    <h1 class="content-title">圖片識別</h1>
    <div class="content-form">
        <form method="post"  action="/img/fit" enctype="multipart/form-data" >
            <div id="change_margin_1">
                圖片： <input class="user" type="file" name="imagefile" style="width:160px;" />
            </div>
            <p id="remind_2"></p>
            <div id="change_margin_3">
                <input class="content-form-signup" type="submit" value="鑑定">
            </div>
        </form>
    </div>
</div>
</body>
</html>

測試結果為：

單獨執行 python test-frcnn.py -p $yourtestpath

通過頁面進行檢測：

主頁：

檢測結果：

Object Detection （5）Faster RCNN Keras 釋出為api

目錄 Object Detection （1）VOC2007資料集製作 Object Detection （2）Faster RCNN詳解

Object Detection （4）Faster RCNN Keras 原理+程式碼第二部分

目錄 Object Detection （1）VOC2007資料集製作 Object Detection （2）Faster RCNN詳解 &

Object Detection （3）Faster RCNN Keras 原理+程式碼第一部分

目錄 Object Detection （1）VOC2007資料集製作 Object Detection （2）Faster RCNN詳解 &

Object Detection（一）RCNN

近段時間，筆者開始系統的看了看目標檢測方面的文章，以後可能會在這個方向上發展。所以這裡準備寫個目標檢測系列，先是RCNN系列（從RCNN一直到Mask RCNN，以及YOLO和SSD）。之後可能還會寫語義分割，目標追蹤，3D檢測方向blog。區域卷積神經

（原）faster rcnn的tensorflow程式碼的理解

轉載請註明出處：參考網址：論文：https://arxiv.org/abs/1506.01497 tf的第三方faster rcnn：https://github.com/endernewton/tf-faster-rcnn IOU：https://www.cnblogs.com/

長短期記憶（LSTM）系列_LSTM的資料準備（5）——如何配置Keras中截斷反向傳播預測的輸入序列步長

導讀：這篇文章是介紹了BPTT的概念，說明了資料截斷的原因和方法，即提高網路的學習效率。以及如何找到最好的截斷方法，即利用網格搜尋。文中都是一些概念介紹，這裡直接把原文貼上來了。原文連結：https://machinelearningmastery.com/truncated-ba

【目標檢測】（三）Faster RCNN

Faster R-CNN與RCNN,fast RCNN最大的區別在於，提出RPN網路取代Selective Search演算法使得檢測任務可以由神經網路端到端地完成。fast RCNN先進行提取特徵再結合候選框進行後續步驟，這使得RCNN中重複特徵提取造成的計算量大的缺點得到了解決。而Faster-RCNN

Tensorflow框架下Faster-RCNN實踐（一）——Faster-RCNN所需資料集製作（附程式碼）

最近剛實現了在Ubuntu16.04、Tensorfllow1.0下 Faster R-CNN 從資料製作到訓練再到利用生成的模型檢測的測試圖片的全過程，現在將具體的過程記錄在部落格，方便遇到困惑或者需要的朋友檢視。製作資料集利用Fast

論文閱讀筆記（二十二）：Feature Pyramid Networks for Object Detection（FPN）

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But recent deep learning o

論文閱讀筆記（六）Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

采樣分享最終產生 pre 運算減少 att 我們作者：Shaoqing Ren, Kaiming He, Ross Girshick, and Jian SunSPPnet、Fast R-CNN等目標檢測算法已經大幅降低了目標檢測網絡的運行時間。可是盡管如此，仍然

深度學習目標檢測(object detection)系列（四） Faster R-CNN

Faster R-CNN簡介 RBG團隊在2015年，與Fast R-CNN同年推出了Faster R-CNN，我們先從頭回顧下Object Detection任務中各個網路的發展，首先R-CNN用分類+bounding box解決了目標檢測問題，SP

[深度學習]Object detection物體檢測之Faster R-CNN(5)

目錄 1.綜述 2.Region Proposal Networks （RPN） Anchor（錨） loss function Training RPNs 3.Sharing Features for RPN and Fast R-CNN 1.交替訓練&nb

Keras入門（5）——卷積padding的補0策略

0. 前言作為最基礎的卷積層——CNN，我們應當對他最為熟悉。但是在實現的時候，忽然發現對於其第一步驟，就有困惑的地方，那就是padding，也就是補0策略。在Keras中，卷積層的定義是如下： keras.layers.convolutional.Conv1D(filt

object detection（物體檢測）系列論文梳理

object detection論文閱讀梳理： 1、R-CNN：Rich feature hierarchies for accurate object detection and semantic segmentation 技術路線：selective s

Enhancement of SSD by concatenating feature maps for object detection（R-SSD）

github地址：https://github.com/soo89/Rainbow-SSD摘要我們提出了一種物件檢測方法，該方法提高了傳統SSD（Single Shot Multibox Detector）的準確性，該方法是精度和速度兩方面的頂級目標檢測演算法之一。深度網路的

Object-C高階程式設計讀書筆記（5）——Block的物件型別擷取

在之前的部落格中，我們探討了Block對於普通型別資料的擷取，其實現很簡單，就是在Block物件中儲存一份值拷貝。那麼，對於OC中的物件型別（包括系統自帶型別NSArray，NSString和自定義物件型別），Block又是怎麼儲存的呢？在《OC高階程式設計》書中對於該部

程序猿的量化交易之路（17）--Cointrader之Temporal實體（5）

eas 建表 times create bject cloud temp 存儲時間轉載須要註明：http://blog.csdn.net/minimicall，http://cloudtrader.top/ 這一小節說明一個時間實體Temporal實體，它的代碼非常

C++傳智筆記（5）：C++完整demo

內部 urn else clas spa char log getx system MyPoint.h #pragma once class MyPoint { private: double x0, y0; //點坐標 public: void setPoint(d

Windows Phone開發（5）：室內裝修

表示 index can 進行解釋技術面板啟動垂直為什麽叫室內裝修呢？呵呵，其實說的是布局，具體些嘛，就是在一個頁面中，你如何去擺放你的控件，如何管理它們，你說，像不像我們剛搬進新住所，要“裝修”一番？買一套什麽樣的茶幾和杯具（我說的“杯具”指的是原意，不要理解

構建之法——讀書筆記（5）

exp 時間微軟 padding 層次結構敏捷參加解決問題企業第七章 MSF What is MSF?——Microsoft Solution Framework（微軟解決方案框架）即一個方法論，也就是微軟推薦的軟件開發方法。 MSF基本原則： MSF沒有像敏捷

Object Detection （5）Faster RCNN Keras 釋出為api

1，改造檢測類

2，釋出httpapi介面

3，開發頁面，測試

相關推薦