Tensorflow學習——結合ROS呼叫模型實現目標識別

阿新 • • 發佈：2018-12-31

環境：Ubuntu16.04+Tensorflow-cpu-1.6.0+ROS Kinetic+OpenCV3.3.1

前期準備：

完成Object Detection api配置
完成OpenCV配置

完成模型訓練後就是模型的應用，這裡通過ROS利用Object Detection api呼叫模型實現目標物體的識別。

一、模型匯入

模型路徑設定如下圖所示，注意設定目標物件型別數目。

		#Get models
		rospy.loginfo("begin initialization...")
		self.PATH_TO_CKPT = '../frozen_inference_graph.pb'
		self.PATH_TO_LABELS = '../bottel.pbtxt'
		self.NUM_CLASSES = 2
		self.detection_graph = self._load_model()
		self.category_index = self._load_label_map()
		self.image_tensor = self.detection_graph.get_tensor_by_name('image_tensor:0')
		self.boxes = self.detection_graph.get_tensor_by_name('detection_boxes:0')
		self.scores = self.detection_graph.get_tensor_by_name('detection_scores:0')
		self.classes = self.detection_graph.get_tensor_by_name('detection_classes:0')
		self.num_detections = self.detection_graph.get_tensor_by_name('num_detections:0')

二、資料處理

呼叫模型識別目標物件前需進行資料處理，流程如下圖所示。

相機獲取的影象資訊會以ROSImage Message的格式釋出在ROS平臺上，然後通過CvBridge對獲取的影象資訊進行轉換，將其從ROSImage Message格式轉變為Mat格式。
通過OpenCV對獲取影象資料進行預處理後轉換為numpy陣列，然後呼叫ObjectDetection API進行識別。
完成影象中目標物體的識別後，識別結果以陣列的形式釋出到相關話題中，同時視覺識別程式會將識別出來的目標物體使用帶有顏色的矩形框出來並在其上方標識識別物體的標籤及其概率，然後在轉換為ROSImage Message格式釋出到相應話題中。

程式碼實現

	# detect object from the image		
	def imgprogress(self, image_msg):
		with self.detection_graph.as_default():
			with tf.Session(graph=self.detection_graph) as sess:
				#translate image_msg data
				cv_image = self._cv_bridge.imgmsg_to_cv2(image_msg, "rgb8")
				pil_img = Image.fromarray(cv_image)
				(im_width, im_height) = pil_img.size
				image_np =np.array(pil_img.getdata()).reshape((im_height, im_width, 3)).astype(np.uint8)
				# Expand dimensions since the model expects images to have shape: [1, None, None, 3]
				image_np_expanded = np.expand_dims(image_np, axis=0)

				# Actual detection.
				(boxes, scores, classes, num_detections) = sess.run([self.boxes, self.scores, self.classes, self.num_detections],feed_dict={self.image_tensor: image_np_expanded})
				
				# Visualization of the results of a detection.
				vis_util.visualize_boxes_and_labels_on_image_array(image_np,np.squeeze(boxes),np.squeeze(classes).astype(np.int32),np.squeeze(scores),
				self.category_index,
    		    use_normalized_coordinates=True,
     		 	line_thickness=8)
				
				#public img_msg
				ROSImage_pro=self._cv_bridge.cv2_to_imgmsg(image_np,encoding="rgb8")
				self._pub.publish(ROSImage_pro)

三、觸發識別

因通過Object Detection API進行物體識別需要佔用大量資源，所以採用動態識別的會非常卡，這裡採用觸發器進行觸發識別，本程式設定了一個訂閱器self._sub用於獲取用於識別的圖片，當需要進行識別時，釋出圖片到image_topic即可觸發程式，同時結果會通過self._pub釋出到object_detection話題中。

		# Subscribe to judge
		self._sub = rospy.Subscriber(image_topic, ROSImage, self.imgprogress, queue_size=10)
		 
		# Subscribe to the image
		self._pub = rospy.Publisher('object_detection', ROSImage, queue_size=1)

完整程式

#!/usr/bin/env python

import rospy
from sensor_msgs.msg import Image as ROSImage
from cv_bridge import CvBridge
import cv2
import matplotlib
import numpy as np
import os
import six.moves.urllib as urllib
import sys
import tarfile
import tensorflow as tf
import zipfile
import uuid
from collections import defaultdict
from io import StringIO
from PIL import Image
from math import isnan

# This is needed since the notebook is stored in the object_detection folder.
from object_detection.utils import label_map_util
from object_detection.utils import visualization_utils as vis_util

class ObjectDetectionDemo():
	def __init__(self):
		rospy.init_node('tfobject')

	    # Set the shutdown function (stop the robot)
		rospy.on_shutdown(self.shutdown)
		camera_topic = "/camera/rgb/image_raw" #rospy.get_param("~image_topic", "")
		image_topic = "/image/rgb/object"

		self.vfc=0
		self._cv_bridge = CvBridge()

		#Get models
		rospy.loginfo("begin initialization...")
		self.PATH_TO_CKPT = '../frozen_inference_graph.pb'
		self.PATH_TO_LABELS = '../bottel.pbtxt'
		self.NUM_CLASSES = 2
		self.detection_graph = self._load_model()
		self.category_index = self._load_label_map()
		self.image_tensor = self.detection_graph.get_tensor_by_name('image_tensor:0')
		self.boxes = self.detection_graph.get_tensor_by_name('detection_boxes:0')
		self.scores = self.detection_graph.get_tensor_by_name('detection_scores:0')
		self.classes = self.detection_graph.get_tensor_by_name('detection_classes:0')
		self.num_detections = self.detection_graph.get_tensor_by_name('num_detections:0')

		# Subscribe to judge
		self._sub = rospy.Subscriber(image_topic, ROSImage, self.imgprogress, queue_size=10)
		 
		# Subscribe to the image
		self._pub = rospy.Publisher('object_detection', ROSImage, queue_size=1)
		rospy.loginfo("initialization has finished...")
	
	def _load_model(self):
		detection_graph = tf.Graph()
		with detection_graph.as_default():
			od_graph_def = tf.GraphDef()
			with tf.gfile.GFile(self.PATH_TO_CKPT, 'rb') as fid:
				serialized_graph = fid.read()
				od_graph_def.ParseFromString(serialized_graph)
				tf.import_graph_def(od_graph_def, name='')
		return detection_graph
	
	def _load_label_map(self):
		label_map = label_map_util.load_labelmap(self.PATH_TO_LABELS)
		categories = label_map_util.convert_label_map_to_categories(label_map,max_num_classes=self.NUM_CLASSES,use_display_name=True)
		category_index = label_map_util.create_category_index(categories)
		return category_index
	
	# detect object from the image		
	def imgprogress(self, image_msg):
		with self.detection_graph.as_default():
			with tf.Session(graph=self.detection_graph) as sess:
				#translate image_msg data
				cv_image = self._cv_bridge.imgmsg_to_cv2(image_msg, "rgb8")
				pil_img = Image.fromarray(cv_image)
				(im_width, im_height) = pil_img.size
				image_np =np.array(pil_img.getdata()).reshape((im_height, im_width, 3)).astype(np.uint8)
				# Expand dimensions since the model expects images to have shape: [1, None, None, 3]
				image_np_expanded = np.expand_dims(image_np, axis=0)

				# Actual detection.
				(boxes, scores, classes, num_detections) = sess.run([self.boxes, self.scores, self.classes, self.num_detections],feed_dict={self.image_tensor: image_np_expanded})
				
				# Visualization of the results of a detection.
				vis_util.visualize_boxes_and_labels_on_image_array(image_np,np.squeeze(boxes),np.squeeze(classes).astype(np.int32),np.squeeze(scores),
				self.category_index,
    		    use_normalized_coordinates=True,
     		 	line_thickness=8)
				
				#public img_msg
				ROSImage_pro=self._cv_bridge.cv2_to_imgmsg(image_np,encoding="rgb8")
				self._pub.publish(ROSImage_pro)
	
	# stop node
	def shutdown(self):
		rospy.loginfo("Stopping the tensorflow object detection...")
		rospy.sleep(1) 
	
if __name__ == '__main__':
    try:
        ObjectDetectionDemo()
        rospy.spin()
    except rospy.ROSInterruptException:
        rospy.loginfo("RosTensorFlow_ObjectDetectionDemo has started.")

Tensorflow學習——結合ROS呼叫模型實現目標識別

環境：Ubuntu16.04+Tensorflow-cpu-1.6.0+ROS Kinetic+OpenCV3.3.1前期準備：完成Object Detection api配置完成OpenCV配置完成模型訓練後就是模型的應用，這裡通過ROS利用Object Detection

tensorflow學習筆記（二）實現MNIST

import tensorflow as tf from tensorflow.contrib import rnn import numpy as np import input_data input_vec_size = lstm_size = 28 time_st

tensorflow學習（2.網路模型的儲存以及提取）

第一篇學習了CNN網路的構建以及程式碼的基礎結構，第二篇則是實際專案過程中需要的網路模型的儲存先放上儲存的程式碼： #tf可以認為是全域性變數，從該變數為類，從中取input_data變數 import tensorflow.examples.tutorials.mni

Tensorflow學習筆記：VGG16模型——Finetuning，貓狗大戰，VGGNet的重新針對訓練

這一篇介紹一下VGG16模型的修改 Step 1: 對模型的修改首先是對模型的修改（VGG16_model.py檔案），在這裡原先的輸出結果是對1000個不同的類別進行判定，而在此是對2個影象，也就是貓和狗的判斷，因此首先第一步就是修改輸出層的全連線資料。

TensorFlow學習筆記（5）--實現卷積神經網路（MNIST資料集）

這裡使用TensorFlow實現一個簡單的卷積神經網路，使用的是MNIST資料集。網路結構為：資料輸入層–卷積層1–池化層1–卷積層2–池化層2–全連線層1–全連線層2（輸出層），這是一個簡單但非常有代表性的卷積神經網路。 import tensorflow

TensorFlow學習筆記（4）--實現多層感知機（MNIST資料集）

前面使用TensorFlow實現一個完整的Softmax Regression，並在MNIST資料及上取得了約92%的正確率。現在建含一個隱層的神經網路模型（多層感知機）。 import tensorflow as tf import numpy as np

tensorflow學習筆記(一)-基礎模型

Tensor tensor基本可以視作矩陣處理，如下面的程式碼就構造了一個1x2的0矩陣。 import tensorflow as tf # 在下面所有程式碼中，都去掉了這一行，預設已經匯入 a = tf.zeros(shape=[1,2]) Variable Variable表示變數，下面的程式碼

TensorFlow學習筆記（7）--實現卷積神經網路（同(5),不同的程式風格）

import tensorflow as tf import numpy as np import input_data mnist = input_data.read_data_sets('data/', one_hot=True) print("MNIST

Tensorflow 學習筆記之使用LSTM實現MNIST資料集

LSTM實現MNIST手寫集識別這幾天剛好看了RNN之後瞭解了LSTM（原理可以去參考這個）。雖然LSTM主要用於處理自然語言、語音、機器人翻譯等領域，但圖片也可以看做一個有序列的資料。所以用LSTM

深度學習-tensorflow學習筆記(2)-MNIST手寫字體識別

image utf-8 詳情識別標簽 ins AI tor 第一個　　　　　　　　　　深度學習-tensorflow學習筆記(2)-MNIST手寫字體識別　　這是tf入門的第一個例子。minst應該是內置的數據集。　　前置知識在學習筆記(1)裏面講過了　　這裏直

機器學習實戰：用nodejs實現人臉識別

機器學習實戰：用nodejs實現人臉識別在本文中，我將向你展示如何使用face-recognition.js執行可靠的人臉檢測和識別。我曾經試圖找一個能夠精確識別人臉的Node.js庫，但是

用TensorFlow基於最近鄰域法實現影象識別

1、匯入程式設計庫 import random import numpy as np import tensorflow as tf import matplotlib.pyplot as plt from PIL import Image from tens

opencv筆記（3）——模板匹配實現目標識別與跟蹤

1 知識補充 1.1 回撥函式在影象處理時，如果我們需要實現實時的改變值，並重新開始程式，就需要我們自己實現回撥函式，其中，對於滑鼠事件的回撥，需要我們重寫滑鼠回撥函式void onMouse(int event, int x, int y, int flags, void* us

java呼叫face++實現人臉識別

首先為什麼我會選擇曠視的face++呢，face++在人工智慧尤其是在人臉識別是業內做的比較好的一家公司，又有自己的技術群，每天為人們解答各種疑難，為開發人員提供的好的幫助，我的這篇主要是應用於微信公眾號開發當中的一個小功能程式碼如下：/** * 人臉識別工具類 * 耿直

TensorFlow學習筆記(二)：手寫數字識別之多層感知機

在【TensorFlow學習筆記(一)：手寫數字識別之softmax迴歸】中：我使用softmax迴歸演算法識別mnist資料集的手寫數字，在我機器上的mnist測試集上最好結果是 92.9% 。

Python+Opencv呼叫攝像頭實現人臉識別並儲存視訊及儲存空檔案問題解決

前言本文介紹了利用Python+Opencv實現呼叫攝像頭、進行人臉識別、並儲存為avi格式視訊的程式方法。至於python+opencv測試環境的搭建，回頭再開一個帖子進行介紹。前期配置 Python2.7+Opencv2.4.13 實現過程先貼程式碼：

在 C/C++ 中使用 TensorFlow 預訓練好的模型—— 直接呼叫Ｃ++ 介面實現

現在的深度學習框架一般都是基於 Python 來實現，構建、訓練、儲存和呼叫模型都可以很容易地在 Python 下完成。但有時候，我們在實際應用這些模型的時候可能需要在其他程式語言下進行，本文將通過直接呼叫 TensorFlow 的 C/C++ 介面來匯入 T

Tensorflow學習筆記：變數作用域、模型的載入與儲存、執行緒與佇列實現多執行緒讀取樣本

# tensorflow變數作用域用上下文語句規定作用域 with tf.variable_scope("作用域_name") ......

深度學習之影象分類模型AlexNet結構分析和tensorflow實現

在ImageNet上的影象分類challenge上，Hinton和他的學生Alex Krizhevsky提出的AlexNet網路結構模型贏得了2012屆的冠軍，重新整理了Image Classification的機率。因此，要研究CNN型別深度學習模型在影象分

Tensorflow學習筆記（五）——結構化模型及Skip-gram模型的實現

一、結構化模型結構化我們的模型，可以方便我們Debug和良好的視覺化。一般我們的模型都是由以下兩步構成，第一步是構建計算圖，第二步是執行計算圖。 Assemble Graph Define placeholders for Inp

Tensorflow學習——結合ROS呼叫模型實現目標識別

相關推薦