Tensorflow object detection API 原始碼閱讀筆記：架構

阿新 • • 發佈：2019-01-08

在之前的博文中介紹過用tf提供的預訓練模型進行inference，非常簡單。這裡我們深入原始碼，瞭解檢測API的程式碼架構，每個部分的深入閱讀留待後續。

'''構建自己模型的介面是虛基類DetectionModel，具體有5個抽象函式需要實現。
'''
object_detection/core/model.py
  def groundtruth_lists(self, field):
    """Access list of groundtruth tensors."""
  def groundtruth_has_field(self, field):
    """Determines whether the groundtruth includes the given field." 
""
  def provide_groundtruth(self,
                          groundtruth_boxes_list,
                          groundtruth_classes_list,
                          groundtruth_masks_list=None,
                          groundtruth_keypoints_list=None):
    """Provide groundtruth tensors."""

  @abstractmethod 

  def preprocess(self, inputs):

  @abstractmethod
  def predict(self, preprocessed_inputs)

  @abstractmethod
  def postprocess(self, prediction_dict, **params)

  @abstractmethod
  def loss(self, prediction_dict)

  @abstractmethod
  def restore_map(self, from_detection_checkpoint=True)

object_detection/meta_architectures/faster_rcnn_meta_arch.py

class 
 FasterRCNNFeatureExtractor(object):
  """Faster R-CNN Feature Extractor definition."""
  def __init__(self,
               is_training,
               first_stage_features_stride,
               batch_norm_trainable=False,
               reuse_weights=None,
               weight_decay=0.0)

  @abstractmethod
  def preprocess(self, resized_inputs):
    """Feature-extractor specific preprocessing (minus image resizing)."""

  def extract_proposal_features(self, preprocessed_inputs, scope):
    """Extracts first stage RPN features."""
  @abstractmethod
  def _extract_proposal_features(self, preprocessed_inputs, scope):

  def extract_box_classifier_features(self, proposal_feature_maps, scope):
    """Extracts second stage box classifier features."""
  @abstractmethod
  def _extract_box_classifier_features(self, proposal_feature_maps, scope):
    """Extracts second stage box classifier features, to be overridden."""

  def restore_from_classification_checkpoint_fn(
      self,
      first_stage_feature_extractor_scope,
      second_stage_feature_extractor_scope):
    """Returns a map of variables to load from a foreign checkpoint."""

class FasterRCNNMetaArch(model.DetectionModel):
  """Faster R-CNN Meta-architecture definition."""
  """暫時主要看哪些地方呼叫了feature_extractor: A FasterRCNNFeatureExtractor object.換一個cnn還是比較簡單的，只需要重寫一個faster_rcnn_new_cnn_feature_extractor。最終構建的檢測模型是這個類的物件。"""

  def preprocess(self, inputs):
  """For Faster R-CNN, we perform image resizing in the base class --- each
    class subclassing FasterRCNNMetaArch is responsible for any additional
    preprocessing (e.g., scaling pixel values to be in [-1, 1]).
    見下面程式碼塊中實現的preprocess函式"""

object_detection/models/faster_rcnn_resnet_v1_feature_extractor.py
"""這一塊和slim結合緊密，我們仔細看看。
"""

class FasterRCNNResnetV1FeatureExtractor(
    faster_rcnn_meta_arch.FasterRCNNFeatureExtractor):
  """Faster R-CNN Resnet V1 feature extractor implementation."""
    def __init__(self,
               architecture,
               resnet_model,
               is_training,
               first_stage_features_stride,
               batch_norm_trainable=False,
               reuse_weights=None,
               weight_decay=0.0):

    def preprocess(self, resized_inputs):
    """Faster R-CNN Resnet V1 preprocessing."""
        channel_means = [123.68, 116.779, 103.939]
        return resized_inputs - [[channel_means]]

    def _extract_proposal_features(self, preprocessed_inputs, scope):
    """Extracts first stage RPN features.
    使用endpoints輸出resnet block3的值。
    """

    def _extract_box_classifier_features(self, proposal_feature_maps, scope):
    """Extracts second stage box classifier features.
    拆分出resnet的block4。注意variable_scope和arg_scope的使用。
    """

class FasterRCNNResnet152FeatureExtractor(FasterRCNNResnetV1FeatureExtractor):
  """Faster R-CNN Resnet 152 feature extractor implementation."""

  def __init__(self,
               is_training,
               first_stage_features_stride,
               batch_norm_trainable=False,
               reuse_weights=None,
               weight_decay=0.0):
    """Constructor.
    Args:
      is_training: See base class.
      first_stage_features_stride: See base class.
      batch_norm_trainable: See base class.
      reuse_weights: See base class.
      weight_decay: See base class.
    Raises:
      ValueError: If `first_stage_features_stride` is not 8 or 16,
        or if `architecture` is not supported.
    """
    super(FasterRCNNResnet152FeatureExtractor, self).__init__(
        'resnet_v1_152', resnet_v1.resnet_v1_152, is_training,
        first_stage_features_stride, batch_norm_trainable,
        reuse_weights, weight_decay)
    """往前看各個類的init，'resnet_v1_152', resnet_v1.resnet_v1_152只用在了上面的class FasterRCNNResnetV1FeatureExtractor"""

同樣建議跑一跑test指令碼。會遇到如下檔案，按照test中出現的順序逐個閱讀這些檔案，以及對應的test指令碼。

"""Builder function to construct tf-slim arg_scope for convolution, fc ops.
看一下這個指令碼的test，很容易理解超引數配置是怎麼讀取的了，類似OpenFOAM中的dict。object_detection.protos.hyperparams_pb2.Hyperparams。
"""
from object_detection.builders import hyperparams_builder

"""Contains routines for printing protocol messages in text format.
同樣是上面這個test指令碼，目前主要用在    
conv_hyperparams_proto = hyperparams_pb2.Hyperparams()
text_format.Merge(conv_hyperparams_text_proto, conv_hyperparams_proto)
其中conv_hyperparams_text_proto是包含引數配置的字串，conv_hyperparams_proto是hyperparams.proto object，hyperparams_builder.build的第一個引數。
"""
from google.protobuf import text_format

"""Function to build box predictor from configuration.
Box predictors are classes that take a high level
image feature map as input and produce two predictions,
(1) a tensor encoding box locations, and
(2) a tensor encoding classes for each box.
object_detection/core/box_predictor.py留待後續研讀。注意conv_hyperparams_text_proto是放進box_predictor_text_proto然後一起傳遞給class ConvolutionalBoxPredictor(BoxPredictor)的。
"""
from object_detection.builders import box_predictor_builder

"""Generates grid anchors on the fly as used in Faster RCNN.
下次細看。
"""
from object_detection.anchor_generators import grid_anchor_generator

"""Builder function for post processing operations."""
from object_detection.builders import post_processing_builder

"""Classification and regression loss functions for object detection."""
from object_detection.core import losses

"""proto檔案，下次再結合相應的core和builder來具體研究如何編寫和讀取這些檔案"""
from object_detection.protos import box_predictor_pb2
from object_detection.protos import hyperparams_pb2
from object_detection.protos import post_processing_pb2

"""A function to build a DetectionModel from configuration.
很多內容在faster_rcnn_meta_arch_test_lib.py測試過了。
"""
object_detection/builders/model_builder.py

Tensorflow object detection API 原始碼閱讀筆記：架構

在之前的博文中介紹過用tf提供的預訓練模型進行inference，非常簡單。這裡我們深入原始碼，瞭解檢測API的程式碼架構，每個部分的深入閱讀留待後續。 '''構建自己模型的介面是虛基類DetectionModel，具體有5個抽象函式需要實現。 ''' o

Tensorflow object detection API 原始碼閱讀筆記：RPN

Update: 建議先看從程式設計實現角度學習Faster R-CNN，比較直觀。這裡由於原始碼抽象程度較高，顯得比較混亂。 faster_rcnn_meta_arch.py中這兩個對應知乎文章中RPN包含的3*3和1*1卷積： rpn_box_pred

Tensorflow object detection API 原始碼閱讀筆記：RFCN

有了前面Faster R-CNN的基礎，RFCN就比較容易了。 """object_detection/meta_architectures/rfcn_meta_arch.py The R-FCN

初窺Tensorflow Object Detection API 原始碼之(1) FeatureExtractor

models/research/object_detection/models/faster_rcnn_resnet_v1_feature_extractor.py models/research/object_detection/meta_a

谷歌開源Tensorflow Object Detection API學習筆記

谷歌宣佈開源其內部使用的 TensorFlow Object Detection API 物體識別系統。本教程針對ubuntu16.04系統，快速搭建環境以及實現視訊物體識別系統功能。 https://yq.aliyun.com/ziliao/405237 https://www.cnblo

配置tensorflow object detection api

could ror blog test creat not pre setup.py python 3：安裝tensorflow model 以及slim 版本號為1.4以上的，model和slim均在research 文件夾下打開research文件目錄 python

谷歌開源的TensorFlow Object Detection API視頻物體識別系統實現教程

cti blog tail xiaoxiao pan clas post ont 谷歌教程：http://blog.csdn.net/xiaoxiao123jun/article/details/76605928 全部代碼：https://github.com/lyj83

#tensorflow object detection api 源碼分析

clas fas mask api 錯誤眼界沒有 lan 入門深度學習前言 Tensorflow 推出的 Object Detection API是一套抽象程度極高的目標檢測框架，可以快速用於生產部署。但網絡上大多數相關的中英文文章均只局限於應用層面的分析，對於該套

TensorFlow object detection API

storage 系統 pipeline -s doc 直接下載 and 獲取數據 ons https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/running_pet

TensorFlow object detection API應用一

ofo ash png figure lin 調用安裝包 pat eight 目標檢測在圖形識別的基礎上有了更進一步的應用，但是代碼也更加繁瑣，TensorFlow專門為此開設了一個object detection API，接下來看看怎麽使用它。一、object det

Ubuntu 16.04 安裝Tensorflow Object Detection API遇到的問題解決

** Ubuntu 16.04 安裝Tensorflow Object Detection API ** 本篇的內容主要參考以下連結：https://blog.csdn.net/pkokocl/article/details/82596089，該博主描述的比較清楚，對於解決實際

Tensorflow Object Detection API之MaskRCNN-資料處理篇

TensorFlow官網介紹：Run an Instance Segmentation Model 要求將資料處理為PNG Instance Segmentation Masks格式以下部分為處理單張Mask圖片的方式： from PIL import Image, ImageDr

Tensorflow object detection API--修改visualization_utils檔案,裁剪並儲存bounding box部分

任務描述：用Tensorflow object detection API檢測出來的結果是一整張圖片，想要把檢測出的bounding box部分單獨截取出來並儲存執行環境：spyder 效果展示：測試圖片：test_images --> 檢測圖片：testsave_images -

基於TensorFlow Object Detection API進行相關開發的步驟

1/安裝或升級protoc 2/編譯proto檔案 protoc object_detection/protos/*.proto --python_out=. 3將slim加入PYTHONPATH export PYTHONPATH="$PYTHONPATH:/home/user/DL

Tensorflow Object Detection API安裝與使用

一、簡介《21個專案玩轉深度學習：基於Tensorflow的實踐詳解》第五章實踐 win10、jupyter notebook、python3.6， Tensorflow Object Detection API專案地址：https://github.com/tensorflow/mo

Tensorflow object detection API(1)---環境搭建與測試

參考： https://blog.csdn.net/dy_guox/article/details/79081499 https://blog.csdn.net/u010103202/article/details/79899293 https://blog.csdn.n

windows+tensorflow object detection api 深度學習目標檢測實踐

1、在github上下載tensorflow/model專案 1. 首先把protoc-win32資料夾下面的protoc.exe移至protobuf-python/src目錄下。 2. 在cmd中進入protobuf-python/python目錄，先執行a

TensorFlow Object Detection API 超詳細教程和踩坑過程（安裝）

目錄 cuda安裝 cudnn安裝 anaconda安裝並建立環境 tensorflow環境 Tensorflow.models下載 Protobuf配置與測試 1.配置環境首先說一下我

TensorFlow Object Detection API 超詳細教程和踩坑過程（資料準備和訓練）

1.準備資料 object detection的資料是需要tfrecord格式的，但是一般我們還是先製作voc格式的資料更加方便。 1.voc格式資料的準備：github上下載一個label-img：然後選擇VOC格式，開始漫長的資料

基於谷歌開源的TensorFlow Object Detection API視訊物體識別系統實現教程

安裝Python 進入Python3.6.2下載頁，選擇 Files 中Windows平臺的Python安裝包，下載並安裝（本人安裝的是3.6.2版本的python，可根據實際情況下載不同版本的python）安裝TensorFlow 進入TensorFlow

Tensorflow object detection API 原始碼閱讀筆記：架構

相關推薦