TensorFlow實現ResNet（ResNet 152網路結構的forward耗時檢測）（轉）

阿新 • • 發佈：2019-01-04

結構有ResNet 50、ResNet 152、ResNet 200，考慮耗時原因只跑了ResNet 152網路結構的forward。

# coding:UTF-8
"""

Typical use:

   from tensorflow.contrib.slim.nets import resnet_v2

ResNet-101 for image classification into 1000 classes:

   # inputs has shape [batch, 224, 224, 3]
   with slim.arg_scope(resnet_v2.resnet_arg_scope(is_training)):
      net, end_points = resnet_v2.resnet_v2_101(inputs, 1000)

ResNet-101 for semantic segmentation into 21 classes:

   # inputs has shape [batch, 513, 513, 3]
   with slim.arg_scope(resnet_v2.resnet_arg_scope(is_training)):
      net, end_points = resnet_v2.resnet_v2_101(inputs,
                                                21,
                                                global_pool=False,
                                                output_stride=16)
"""
import collections # 原生的collections庫
import tensorflow as tf
slim = tf.contrib.slim # 使用方便的contrib.slim庫來輔助建立ResNet



class Block(collections.namedtuple('Block', ['scope', 'unit_fn', 'args'])):
  '''
  使用collections.namedtuple設計ResNet基本模組組的name tuple，並用它建立Block的類
  只包含資料結構，不包含具體方法。
  定義一個典型的Block，需要輸入三個引數：
  scope：Block的名稱
  unit_fn：ResNet V2中的殘差學習單元 
  args：Block的args。
  '''


########定義一個降取樣的方法########
def subsample(inputs, factor, scope=None): 
  """Subsamples the input along the spatial dimensions.
  Args:
    inputs: A `Tensor` of size [batch, height_in, width_in, channels].
    factor: The subsampling factor.（取樣因子）
    scope: Optional variable_scope.

  Returns:
    output: 如果factor為1，則不做修改直接返回inputs；如果不為1，則使用
    slim.max_pool2d最大池化來實現，通過1*1的池化尺寸，stride作步長，實
    現降取樣。
  """
  if factor == 1:
    return inputs
  else:
    return slim.max_pool2d(inputs, [1, 1], stride=factor, scope=scope)


########建立卷積層########
def conv2d_same(inputs, num_outputs, kernel_size, stride, scope=None): 
  """
  Args:
    inputs: A 4-D tensor of size [batch, height_in, width_in, channels].
    num_outputs: An integer, the number of output filters.
    kernel_size: An int with the kernel_size of the filters.
    stride: An integer, the output stride.
    rate: An integer, rate for atrous convolution.
    scope: Scope.

  Returns:
    output: A 4-D tensor of size [batch, height_out, width_out, channels] with
      the convolution output.
  """
  if stride == 1:
    return slim.conv2d(inputs, num_outputs, kernel_size, stride=1,
                       padding='SAME', scope=scope)
  else: # 如果不為1，則顯式的pad zero，pad zero總數為kernel_size - 1
    #kernel_size_effective = kernel_size + (kernel_size - 1) * (rate - 1)
    pad_total = kernel_size - 1
    pad_beg = pad_total // 2
    pad_end = pad_total - pad_beg
    inputs = tf.pad(inputs, # 對輸入變數進行補零操作
                    [[0, 0], [pad_beg, pad_end], [pad_beg, pad_end], [0, 0]])
    # 因為已經進行了zero padding，所以只需再使用一個padding模式為VALID的slim.conv2d建立這個卷積層
    return slim.conv2d(inputs, num_outputs, kernel_size, stride=stride,
                       padding='VALID', scope=scope)


########定義堆疊Blocks的函式########
@slim.add_arg_scope
def stack_blocks_dense(net, blocks,
                       outputs_collections=None):
  """
  Args:
    net: A `Tensor` of size [batch, height, width, channels].輸入。
    blocks: 是之前定義的Block的class的列表。
    outputs_collections: 收集各個end_points的collections。

  Returns:
    net: Output tensor 

  """
  # 使用兩層迴圈，逐個Residual Unit地堆疊
  for block in blocks: # 先使用兩個tf.variable_scope將殘差學習單元命名為block1/unit_1的形式
    with tf.variable_scope(block.scope, 'block', [net]) as sc:
      for i, unit in enumerate(block.args):

        with tf.variable_scope('unit_%d' % (i + 1), values=[net]):
          # 在第2層迴圈中，我們拿到每個block中每個Residual Unit的args並展開為下面四個引數
          unit_depth, unit_depth_bottleneck, unit_stride = unit
          net = block.unit_fn(net, # 使用殘差學習單元的生成函式順序的建立並連線所有的殘差學習單元
                              depth=unit_depth,
                              depth_bottleneck=unit_depth_bottleneck,
                              stride=unit_stride)
      net = slim.utils.collect_named_outputs(outputs_collections, sc.name, net) # 將輸出net新增到collections中

  return net # 當所有block中的所有Residual Unit都堆疊完成之後，再返回最後的net作為stack_blocks_dense


# 建立ResNet通用的arg_scope,arg_scope用來定義某些函式的引數預設值
def resnet_arg_scope(is_training=True, # 訓練標記
                     weight_decay=0.0001, # 權重衰減速率
                     batch_norm_decay=0.997, # BN的衰減速率
                     batch_norm_epsilon=1e-5, #  BN的epsilon預設1e-5
                     batch_norm_scale=True): # BN的scale預設值

  batch_norm_params = { # 定義batch normalization（標準化）的引數字典
      'is_training': is_training,
      'decay': batch_norm_decay,
      'epsilon': batch_norm_epsilon,
      'scale': batch_norm_scale,
      'updates_collections': tf.GraphKeys.UPDATE_OPS,
  }

  with slim.arg_scope( # 通過slim.arg_scope將[slim.conv2d]的幾個預設引數設定好
      [slim.conv2d],
      weights_regularizer=slim.l2_regularizer(weight_decay), # 權重正則器設定為L2正則 
      weights_initializer=slim.variance_scaling_initializer(), # 權重初始化器
      activation_fn=tf.nn.relu, # 啟用函式
      normalizer_fn=slim.batch_norm, # 標準化器設定為BN
      normalizer_params=batch_norm_params):
    with slim.arg_scope([slim.batch_norm], **batch_norm_params):
      with slim.arg_scope([slim.max_pool2d], padding='SAME') as arg_sc: # ResNet原論文是VALID模式，SAME模式可讓特徵對齊更簡單
        return arg_sc # 最後將基層巢狀的arg_scope作為結果返回



# 定義核心的bottleneck殘差學習單元
@slim.add_arg_scope
def bottleneck(inputs, depth, depth_bottleneck, stride,
               outputs_collections=None, scope=None):
  """
  Args:
    inputs: A tensor of size [batch, height, width, channels].
    depth、depth_bottleneck:、stride三個引數是前面blocks類中的args
    rate: An integer, rate for atrous convolution.
    outputs_collections: 是收集end_points的collection
    scope: 是這個unit的名稱。
  """
  with tf.variable_scope(scope, 'bottleneck_v2', [inputs]) as sc: # slim.utils.last_dimension獲取輸入的最後一個維度，即輸出通道數。
    depth_in = slim.utils.last_dimension(inputs.get_shape(), min_rank=4) # 可以限定最少為四個維度
    # 使用slim.batch_norm對輸入進行batch normalization，並使用relu函式進行預啟用preactivate
    preact = slim.batch_norm(inputs, activation_fn=tf.nn.relu, scope='preact') 

    if depth == depth_in:
      shortcut = subsample(inputs, stride, 'shortcut')
      # 如果殘差單元的輸入通道數和輸出通道數一致，那麼按步長對inputs進行降取樣
    else:
      shortcut = slim.conv2d(preact, depth, [1, 1], stride=stride,
                             normalizer_fn=None, activation_fn=None,
                             scope='shortcut')
      # 如果不一樣就按步長和1*1的卷積改變其通道數，使得輸入、輸出通道數一致

    # 先是一個1*1尺寸，步長1，輸出通道數為depth_bottleneck的卷積
    residual = slim.conv2d(preact, depth_bottleneck, [1, 1], stride=1,
                           scope='conv1')
    # 然後是3*3尺寸，步長為stride，輸出通道數為depth_bottleneck的卷積
    residual = conv2d_same(residual, depth_bottleneck, 3, stride,
                                        scope='conv2')
    # 最後是1*1卷積，步長1，輸出通道數depth的卷積，得到最終的residual。最後一層沒有正則項也沒有啟用函式
    residual = slim.conv2d(residual, depth, [1, 1], stride=1,
                           normalizer_fn=None, activation_fn=None,
                           scope='conv3')

    output = shortcut + residual # 將降取樣的結果和residual相加

    return slim.utils.collect_named_outputs(outputs_collections, # 將output新增進collection並返回output作為函式結果
                                            sc.name,
                                            output)


########定義生成resnet_v2的主函式########
def resnet_v2(inputs, # A tensor of size [batch, height_in, width_in, channels].輸入
              blocks, # 定義好的Block類的列表
              num_classes=None, # 最後輸出的類數
              global_pool=True, # 是否加上最後的一層全域性平均池化
              include_root_block=True, # 是否加上ResNet網路最前面通常使用的7*7卷積和最大池化
              reuse=None, # 是否重用
              scope=None): # 整個網路的名稱
  # 在函式體先定義好variable_scope和end_points_collection
  with tf.variable_scope(scope, 'resnet_v2', [inputs], reuse=reuse) as sc:
    end_points_collection = sc.original_name_scope + '_end_points' # 定義end_points_collection
    with slim.arg_scope([slim.conv2d, bottleneck,
                         stack_blocks_dense],
                        outputs_collections=end_points_collection): # 將三個引數的outputs_collections預設設定為end_points_collection

      net = inputs
      if include_root_block: # 根據標記值
        with slim.arg_scope([slim.conv2d],
                            activation_fn=None, normalizer_fn=None):
          net = conv2d_same(net, 64, 7, stride=2, scope='conv1') # 建立resnet最前面的64輸出通道的步長為2的7*7卷積
        net = slim.max_pool2d(net, [3, 3], stride=2, scope='pool1') # 然後接最大池化
        # 經歷過兩個步長為2的層圖片縮為1/4
      net = stack_blocks_dense(net, blocks) # 將殘差學習模組組生成好
      net = slim.batch_norm(net, activation_fn=tf.nn.relu, scope='postnorm')

      if global_pool: # 根據標記新增全域性平均池化層
        net = tf.reduce_mean(net, [1, 2], name='pool5', keep_dims=True) # tf.reduce_mean實現全域性平均池化效率比avg_pool高
      if num_classes is not None:  # 是否有通道數
        net = slim.conv2d(net, num_classes, [1, 1], activation_fn=None, # 無啟用函式和正則項
                          normalizer_fn=None, scope='logits') # 新增一個輸出通道num_classes的1*1的卷積
      end_points = slim.utils.convert_collection_to_dict(end_points_collection) # 將collection轉化為python的dict
      if num_classes is not None:
        end_points['predictions'] = slim.softmax(net, scope='predictions') # 輸出網路結果
      return net, end_points
#------------------------------ResNet的生成函式定義好了----------------------------------------



def resnet_v2_50(inputs, # 影象尺寸縮小了32倍
                 num_classes=None,
                 global_pool=True,
                 reuse=None, # 是否重用
                 scope='resnet_v2_50'):
  blocks = [
      Block('block1', bottleneck, [(256, 64, 1)] * 2 + [(256, 64, 2)]),



      # Args:：
      # 'block1'：Block名稱（或scope）
      # bottleneck：ResNet V2殘差學習單元
      # [(256, 64, 1)] * 2 + [(256, 64, 2)]：Block的Args，Args是一個列表。其中每個元素都對應一個bottleneck
      #                                     前兩個元素都是(256, 64, 1)，最後一個是(256, 64, 2）。每個元素
      #                                     都是一個三元tuple，即（depth，depth_bottleneck，stride）。
      # (256, 64, 3)代表構建的bottleneck殘差學習單元（每個殘差學習單元包含三個卷積層）中，第三層輸出通道數
      # depth為256，前兩層輸出通道數depth_bottleneck為64，且中間那層步長3。這個殘差學習單元結構為：
      # [(1*1/s1,64),(3*3/s2,64),(1*1/s1,256)]



      Block(
          'block2', bottleneck, [(512, 128, 1)] * 3 + [(512, 128, 2)]),
      Block(
          'block3', bottleneck, [(1024, 256, 1)] * 5 + [(1024, 256, 2)]),
      Block(
          'block4', bottleneck, [(2048, 512, 1)] * 3)]
  return resnet_v2(inputs, blocks, num_classes, global_pool,
                   include_root_block=True, reuse=reuse, scope=scope)


def resnet_v2_101(inputs, # unit提升的主要場所是block3
                  num_classes=None,
                  global_pool=True,
                  reuse=None,
                  scope='resnet_v2_101'):
  """ResNet-101 model of [1]. See resnet_v2() for arg and return description."""
  blocks = [
      Block(
          'block1', bottleneck, [(256, 64, 1)] * 2 + [(256, 64, 2)]),
      Block(
          'block2', bottleneck, [(512, 128, 1)] * 3 + [(512, 128, 2)]),
      Block(
          'block3', bottleneck, [(1024, 256, 1)] * 22 + [(1024, 256, 2)]),
      Block(
          'block4', bottleneck, [(2048, 512, 1)] * 3)]
  return resnet_v2(inputs, blocks, num_classes, global_pool,
                   include_root_block=True, reuse=reuse, scope=scope)


def resnet_v2_152(inputs, # unit提升的主要場所是block3
                  num_classes=None,
                  global_pool=True,
                  reuse=None,
                  scope='resnet_v2_152'):
  """ResNet-152 model of [1]. See resnet_v2() for arg and return description."""
  blocks = [
      Block(
          'block1', bottleneck, [(256, 64, 1)] * 2 + [(256, 64, 2)]),
      Block(
          'block2', bottleneck, [(512, 128, 1)] * 7 + [(512, 128, 2)]),
      Block(
          'block3', bottleneck, [(1024, 256, 1)] * 35 + [(1024, 256, 2)]),
      Block(
          'block4', bottleneck, [(2048, 512, 1)] * 3)]
  return resnet_v2(inputs, blocks, num_classes, global_pool,
                   include_root_block=True, reuse=reuse, scope=scope)


def resnet_v2_200(inputs, # unit提升的主要場所是block2
                  num_classes=None,
                  global_pool=True,
                  reuse=None,
                  scope='resnet_v2_200'):
  """ResNet-200 model of [2]. See resnet_v2() for arg and return description."""
  blocks = [
      Block(
          'block1', bottleneck, [(256, 64, 1)] * 2 + [(256, 64, 2)]),
      Block(
          'block2', bottleneck, [(512, 128, 1)] * 23 + [(512, 128, 2)]),
      Block(
          'block3', bottleneck, [(1024, 256, 1)] * 35 + [(1024, 256, 2)]),
      Block(
          'block4', bottleneck, [(2048, 512, 1)] * 3)]
  return resnet_v2(inputs, blocks, num_classes, global_pool,
                   include_root_block=True, reuse=reuse, scope=scope)



from datetime import datetime
import math
import time


#-------------------評測函式---------------------------------
# 測試152層深的ResNet的forward效能
def time_tensorflow_run(session, target, info_string):
    num_steps_burn_in = 10
    total_duration = 0.0
    total_duration_squared = 0.0
    for i in range(num_batches + num_steps_burn_in):
        start_time = time.time()
        _ = session.run(target)
        duration = time.time() - start_time
        if i >= num_steps_burn_in:
            if not i % 10:
                print ('%s: step %d, duration = %.3f' %
                       (datetime.now(), i - num_steps_burn_in, duration))
            total_duration += duration
            total_duration_squared += duration * duration
    mn = total_duration / num_batches
    vr = total_duration_squared / num_batches - mn * mn
    sd = math.sqrt(vr)
    print ('%s: %s across %d steps, %.3f +/- %.3f sec / batch' %
           (datetime.now(), info_string, num_batches, mn, sd))

batch_size = 32
height, width = 224, 224
inputs = tf.random_uniform((batch_size, height, width, 3))
with slim.arg_scope(resnet_arg_scope(is_training=False)): # is_training設定為false
   net, end_points = resnet_v2_152(inputs, 1000)

init = tf.global_variables_initializer()
sess = tf.Session()
sess.run(init)  
num_batches=100
time_tensorflow_run(sess, net, "Forward") 

# forward計算耗時相比VGGNet和Inception V3大概只增加了50%，是一個實用的卷積神經網路。

跑的太慢了先來個截圖：
這裡寫圖片描述

TensorFlow實現ResNet（ResNet 152網路結構的forward耗時檢測）（轉）

結構有ResNet 50、ResNet 152、ResNet 200，考慮耗時原因只跑了ResNet 152網路結構的forward。 # coding:UTF-8 """ Typical use: from tensorflow.contrib.slim.n

TensorFlow實現Google InceptionNet V3（forward耗時檢測）

Google InceptionNet-V3網路結構圖： Inception V3網路結構圖：型別 kernel尺寸/步長（或註釋）輸入尺寸卷積 3*3 / 2 299 * 299 * 3 卷積 3*3

利用tensorflow實現簡單的卷積神經網路——遷移學習小記（二）

一、什麼是神經網路（CNN）卷積神經網路（Convolutional Neural Network，簡稱CNN），是一種前饋神經網路，人工神經元可以影響周圍單元，可以進行大型影象處理。卷積神經網路包括卷積層和池化層。卷積神經網路是受到生物思考方式的啟發的MLPs（多

計算機視覺筆記及資料整理（含影象分割、目標檢測小方向學習）

前言 1、簡單聊聊：在我腦海中我能通過這些年聽到的技術名詞來感受到技術的更新及趨勢，這種技術發展有時候我覺得連關注的腳步都趕不上。簡單回顧看看，從我能聽到的技術名詞來感受，最開始耳聞比較多「雲端計算」這玩意，後來聽到比較多的是「資料探勘」，當時想著等考上研也要

【TensorFlow實戰】TensorFlow實現經典卷積神經網絡之ResNet

man bject dep lte 也會 weight params detail 三層 ResNet 　　ResNet(Residual Neural Network)通過使用Residual Unit成功訓練152層深的神經網絡，在ILSVRC 2015比賽中獲得冠軍

tensorflow實戰——Tensorflow實現ResNet

【Tensorflow】深度學習實戰06——Tensorflow實現ResNet

前言 ResNet（Residual Neural Network）由前微軟研究院的 Kaiming He 等4名華人提出（有興趣的可以點選這裡，檢視論文原文），通過使用 Residual Blocks 成功訓練152層深的神經網路，在 ILSVR

TensorFlow實戰：Chapter-6（CNN-4-經典卷積神經網路（ResNet）)

ResNet ResNet簡介 ResNet(Residual Neural Network)由微軟研究院的何凱明大神等4人提出，ResNet通過使用Residual Unit成功訓練152層神經網路，在ILSCRC2015年比賽中獲得3.75%的

Pytorch實戰2：ResNet-18實現Cifar-10影象分類（測試集分類準確率95.170%）（轉）

Pytorch實戰2：ResNet-18實現Cifar-10影象分類實驗環境: torchvision 0.2.1 Python 3.6 CUDA8+cuDNN v7 (可選) Win10+Pycharm 整個專案程式碼：點選這裡 Res

Tensorflow實現Mask R-CNN實例分割通用框架，檢測，分割和特征點定位一次搞定（多圖）

優點設計 orf 時間 rcnn 超越 rain 沒有 add Mask R-CNN實例分割通用框架，檢測，分割和特征點定位一次搞定（多圖）導語：Mask R-CNN是Faster R-CNN的擴展形式，能夠有效地檢測圖像中的目標，同時還能為每個實例生成一個

tensorflow實現svm多分類 iris 3分類——本質上在使用梯度下降法求解線性回歸（loss是定制的而已）

points near plot asi atm lob put matplot ive # Multi-class (Nonlinear) SVM Example # # This function wll illustrate how to # implement

tensorflow實現貓狗大戰（分類算法）

sse sin output 行操作 ogr cast bytes 序列 raw 本次使用了tensorflow高級API在規範化網絡編程做出了嘗試。第一步：準備好需要的庫 tensorflow-gpu 1.8.0 opencv-python 3.3.1 nu

TensorFlow實現Softmax回歸（模型存儲與加載）

oat 出現 each softmax reat equal des points optimizer 1 # -*- coding: utf-8 -*- 2 """ 3 Created on Thu Oct 18 18:02:26 2018 4 5 @aut

獨熱（one-hot）編碼的tensorflow實現

一、獨熱編碼獨熱編碼，又稱一位有效碼，用序列化的數字（只有0和1）表達特徵。主要思路是使用N位數字對N種情況進行編碼。舉個例子，對[0,1,2,3]分別進行編碼。由於有4種情況，序列的長度為4，對應數字的位置1，其餘置0。所以： [1,0,0,0] [0,1,0,0] [

Python使用tensorflow實現影象識別（貓狗大戰）-01

Python使用tensorflow實現影象識別（貓狗大戰）-01 import_data.py import tensorflow as tf import numpy as np import os #引入tensorflow、numpy、os 三個第三方模組 img_widt

TensorFlow實現Softmax迴歸（模型儲存與載入）

1 # -*- coding: utf-8 -*- 2 """ 3 Created on Thu Oct 18 18:02:26 2018 4 5 @author: zhen 6 """ 7 8 from tensorflow.examples.tutorials.mnist imp

關於訓練深度學習模型deepNN時，訓練精度維持固定值，模型不收斂的解決辦法（tensorflow實現）

一、背景最近一直在做人臉表情的識別，用到的程式是之間的一篇文章中的程式：深度學習（一）——deepNN模型實現攝像頭實時識別人臉表情（C++和python3.6混合程式設計）。這裡我只進行了簡單的程式修改。由於該程式是利用fer2013資料集做的，效果不是很好，人臉表情的識別精度僅有70

對抗神經網路學習（四）——WGAN+爬蟲生成皮卡丘影象(tensorflow實現)

一、背景 WGAN的全稱為Wasserstein GAN, 是Martin Arjovsky等人於17年1月份提出的一個模型，該文章可以參考[1]。WGAN針對GAN存在的問題進行了有針對性的改進，但WGAN幾乎沒有改變GAN的結構，只是改變了啟用函式和loss函式，以及擷取權重，卻得到了非常好

【Network Architecture】Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning（轉） Feature Extractor[Inception v4]

文章來源： https://www.cnblogs.com/shouhuxianjian/p/7786760.html Feature Extractor[Inception v4] 0. 背景隨著何凱明等人提出的ResNet v1，google這邊坐

TensorFlow實現遷移學習（附思維導圖與程式碼）

看了李巨集毅的機器學習視訊和莫凡的TensorFlow視訊，對遷移學習的理解其實就是為了偷懶, 在訓練好了的模型上接著訓練其他內容, 充分使用原模型的理解力”. 有時候也是為了避免再次花費特別長的時間重複訓練大型模型. 本文根據《TenorFlow實戰Google深度學習框架》的程式碼進行深

TensorFlow實現ResNet（ResNet 152網路結構的forward耗時檢測）（轉）

相關推薦