Tensorflow讀取並使用預訓練模型：以inception_v3為例

阿新 • • 發佈：2019-01-26

在使用Tensorflow做讀取並finetune的時候，發現在讀取官方給的inception_v3預訓練模型總是出現各種錯誤，現記錄其正確的讀取方式和各種錯誤做法：
關鍵程式碼如下：

import tensorflow as tf
import tensorflow.contrib.slim as slim
from tensorflow.contrib.slim.python.slim.nets import inception_v3

.....................................................

# 讀取網路
with slim.arg_scope(inception_v3.inception_v3_arg_scope()):
    logits, end_points = inception_v3.inception_v3(imgs, num_classes=class_num, is_training=is_training_pl)

....................................................

with 
 tf.Session() as sess:
     # 先初始化所有變數，避免有些變數未讀取而產生錯誤
     init = tf.global_variables_initializer()
     sess.run(init)
     #載入預訓練模型
     print('Loading model check point from {:s}'.format(Pretrained_model_dir))

     #這裡的exclusions是不需要讀取預訓練模型中的Logits,因為預設的類別數目是1000，當你的類別數目不是1000的時候，如果還要讀取的話，就會報錯
     exclusions = ['InceptionV3/Logits' 
,
                   'InceptionV3/AuxLogits']
     #建立一個列表，包含除了exclusions之外所有需要讀取的變數
     inception_except_logits = slim.get_variables_to_restore(exclude=exclusions)
     #建立一個從預訓練模型checkpoint中讀取上述列表中的相應變數的引數的函式
     init_fn = slim.assign_from_checkpoint_fn(Pretrained_model_dir, inception_except_logits,ignore_missing_vars=True 
)
     #執行該函式
     init_fn(sess)
     print('Loaded.')

其中的…………………………..省略了一些與本文無關的程式碼。

其中可能會出現的錯誤如下：
錯誤1

InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [5] rhs shape= [1001]
     [[Node: save_1/Assign_8 = Assign[T=DT_FLOAT, _class=["loc:@InceptionV3/AuxLogits/Conv2d_2b_1x1/biases"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](InceptionV3/AuxLogits/Conv2d_2b_1x1/biases, save_1/RestoreV2_8/_2319)]]

原因：
預訓練模型中的類別數class_num=1000，這裡輸入的class_num=5，當讀取完整模型的時候當然會出錯。
解決方案：
選擇不讀取包含類別數的Logits層和AuxLogits層：

exclusions = ['InceptionV3/Logits','InceptionV3/AuxLogits']
inception_except_logits = slim.get_variables_to_restore(exclude=exclusions)

錯誤2
Tensor name “xxxx” not found in checkpoint files

NotFoundError (see above for traceback): Tensor name "InceptionV3/Mixed_6c/Branch_2/Conv2d_0b_7x1/biases" not found in checkpoint files E:\DeepLearning\TensorFlow\Inception\inception_v3_2016_08_28\inception_v3.ckpt
     [[Node: save_1/RestoreV2_180 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/device:CPU:0"](_arg_save_1/Const_0_0, save_1/RestoreV2_180/tensor_names, save_1/RestoreV2_180/shape_and_slices)]]
     [[Node: save_1/RestoreV2_277/_109 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_854_save_1/RestoreV2_277", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"]()]]

這裡的Tensor name可以是所有inception_v3中變數的名字，出現這種情況的各種原因和解決方案是：
1.建立圖的時候沒有用arg_scope，是這樣建立的：

logits, end_points = inception_v3.inception_v3(imgs, num_classes=class_num, is_training=is_training_pl)

解決方案：
在這裡加上arg_scope，裡面呼叫的是庫中自帶的inception_v3_arg_scope

with slim.arg_scope(inception_v3.inception_v3_arg_scope()):
    logits, end_points = inception_v3.inception_v3(imgs, num_classes=class_num, is_training=is_training_pl)

2.在讀取checkpoint的時候未初始化所有變數，即未執行

init = tf.global_variables_initializer()
sess.run(init)

這樣會導致有一些checkpoint中不存在的變數未被初始化，比如使用Momentum時的每一層的Momentum引數等。

3.使用slim.assign_from_checkpoint_fn()函式時，沒有新增ignore_missing_vars=True屬性，由於預設ignore_missing_vars=False，所以，當使用非SGD的optimizer的時候（如Momentum、RMSProp等）時，會提示Momentum或者RMSProp的引數在checkpoint中無法找到，如：
使用Momentum時：

NotFoundError (see above for traceback): Tensor name "InceptionV3/Mixed_6e/Branch_2/Conv2d_0c_1x7/BatchNorm/beta/Momentum" not found in checkpoint files E:\DeepLearning\TensorFlow\Inception\inception_v3_2016_08_28\inception_v3.ckpt
     [[Node: save_1/RestoreV2_397 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/device:CPU:0"](_arg_save_1/Const_0_0, save_1/RestoreV2_397/tensor_names, save_1/RestoreV2_397/shape_and_slices)]]
     [[Node: save_1/RestoreV2_122/_2185 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_2096_save_1/RestoreV2_122", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"]()]]

使用RMSProp時：

NotFoundError (see above for traceback): Tensor name "InceptionV3/Mixed_6b/Branch_1/Conv2d_0b_1x7/BatchNorm/beta/RMSProp" not found in checkpoint files E:\DeepLearning\TensorFlow\Inception\inception_v3_2016_08_28\inception_v3.ckpt
     [[Node: save_1/RestoreV2_257 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/device:CPU:0"](_arg_save_1/Const_0_0, save_1/RestoreV2_257/tensor_names, save_1/RestoreV2_257/shape_and_slices)]]
     [[Node: save_1/Assign_463/_3950 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_3478_save_1/Assign_463", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:GPU:0"]()]]

解決方法很簡單，就是把ignore_missing_vars=True

init_fn = slim.assign_from_checkpoint_fn(Pretrained_model_dir, inception_except_logits,ignore_missing_vars=True)

注意：一定要在之前的步驟都完成之後才能設成True，不然如果變數名稱全部出錯的話，會忽視掉checkpoint中所有的變數，從而不讀取任何引數。

以上就是我碰見的問題，希望有所幫助。

Tensorflow讀取並使用預訓練模型：以inception_v3為例

在使用Tensorflow做讀取並finetune的時候，發現在讀取官方給的inception_v3預訓練模型總是出現各種錯誤，現記錄其正確的讀取方式和各種錯誤做法：關鍵程式碼如下： import tensorflow as tf import ten

淺談linux6：以systemd為例，初探系統服務管理

假如你用的不是很老版本的unix系統，那麼你一定對systemd不甚陌生。檢視服務：systemctl status servicename 停止服務：systemctl stop servicename 這些常見操作，基本大家都有涉及。那麼，systemd到底是何方神聖呢？

（tensorflow之十二）tensorflow與numpy函式的選擇（以reshape為例）

tensorflow與numpy均提供了強大的矩陣運算功能，很多矩陣的運算函式功能是重複的。那什麼時候選擇用tensorflow，什麼時候選擇用numpy呢？這個的選擇需正確的理解tensorflow與numpy計算過程的區別。 tensorflow的計算一般可分成兩個

隨便亂扯：以洗衣機為例淺談自頂向下設計

二話不說先砸維基上的定義： A top-down approach (also known as stepwise design) is essentially the breaking down of a system to gain insigh

一種嵌入式系統軟體定時器的實現：以STM32為例

1.什麼是軟體定時器軟體定時器是用程式模擬出來的定時器，可以由一個硬體定時器模擬出成千上萬個軟體定時器，這樣程式在需要使用較多定時器的時候就不會受限於硬體資源的不足，這是軟體定時器的一個優點，即數量不受限制。但由於軟體定時器是通過程式實現的，其執行和維護

多程序開發如何共享資料：以python為例

最近使用gunicorn部署了一個專案，在啟動的時候，加上了worker 3的引數。也就是說，同時有3個程序存在。這就引出了程序間通訊的問題。因為有一個功能只用單執行緒去執行就行了，也就是說，即使啟動100個程序，這部分功能只有一個執行的例項才可以。否則

tensorflow利用預訓練模型進行目標檢測（一）：預訓練模型的使用

err sync numpy sna sta porting trac git int32 一、運行樣例官網鏈接：https://github.com/tensorflow/models/blob/master/research/object_detection/obje

tensorflow利用預訓練模型進行目標檢測（二）：將檢測結果存入mysql資料庫

mysql版本：5.7 ；資料庫：rdshare；表captain_america3_sd用來記錄某幀是否被檢測。表captain_america3_d用來記錄檢測到的資料。 python模組，包部分內容參考http://www.runoob.com/python/python-modules.html&

tensorflow利用預訓練模型進行目標檢測（四）：檢測中的精度問題以及evaluation

一、tensorflow提供的evaluation Inference and evaluation on the Open Images dataset：https://github.com/tensorflow/models/blob/master/research/object_detection/g

SiameseFC-TensorFlow 程式碼詳細註解（一）：預訓練模型下載轉換測試以及結果視覺化(轉載)

這篇部落格主要的目的就是簡單地跑一下實驗，讓下載的程式碼能用預訓練的模型去測試單個視訊，並對結果視覺化，從視覺上感受一下這個跟蹤演算法的效果，至於如果要自己訓練自己的模型該如何準備訓練資料，如何設計自己的模型，如何訓練自己的模型，以及如何評估自己的模型等，這些問題都將在後面的

slim 讀取並使用預訓練模型 inception_v3 遷移學習

轉自：https://blog.csdn.net/amanfromearth/article/details/79155926#commentBox 在使用Tensorflow做讀取並finetune的時候，發現在讀取官方給的inception_v3預訓練模型總是出現各

谷歌官宣：全面超越人類的最強NLP預訓練模型BERT開源了！

來源 | Google Research GitHub 編譯 | 無明、Natalie 編輯 | Natalie AI 前線導讀：近日，谷歌 AI 的一篇 NLP 論文引起了社群極大的關注與討論，被認為是 NLP 領域的極大突破。谷歌大腦研究科學家 Thang Luong Twitter 表示，這項

tensorflow利用預訓練模型進行目標檢測

一、安裝首先系統中已經安裝了兩個版本的tensorflow，一個是通過keras安裝的，一個是按照官網教程https://www.tensorflow.org/install/install_linux#InstallingNativePip使用Virtualenv 進行安裝的，第二個在根目錄下，做標記

No.3 ssd-caffe(2):訓練ssd-caffe模型：(以VOC資料集為例)

2.訓練ssd-caffe模型：(以VOC資料集為例) 使用caffe進行目標檢測，我們的需要標註了標籤的圖片作為訓練樣本，訓練模型。推薦使用開源的標註工具labelimg，來對我們的圖片進行標註。標註之後，會產生.xml檔案,用於標識圖片中物體的具體資訊。

Tensorflow載入預訓練模型和儲存模型

使用tensorflow過程中，訓練結束後我們需要用到模型檔案。有時候，我們可能也需要用到別人訓練好的模型，並在這個基礎上再次訓練。這時候我們需要掌握如何操作這些模型資料。看完本文，相信你一定會有收穫！ 1 Tensorflow模型檔案我們在checkpo

Tensorflow使用的預訓練的resnet_v2_50，resnet_v2_101，resnet_v2_152等模型預測，訓練

你要的答案或許都在這裡：自己搭建的一個框架，包含模型有：vgg(vgg16,vgg19), resnet(resnet_v2_50,resnet_v2_101,resnet_v2_152), inception_v4, inception_resnet_v2等。此框架主要針對

一文看懂遷移學習：怎樣用預訓練模型搞定深度學習？

【宣告：鄙人菜鳥一枚，寫的都是初級部落格，如遇大神路過鄙地，請多賜教；內容有誤，請批評指教，如有雷同，屬我偷懶轉運的，能給你帶來收穫就是我的部落格價值所在。】引言跟傳統的監督式機器學習演算法相比，深度神經網路目前最大的劣勢是什麼？貴！

遷移學習：怎樣用預訓練模型搞定深度學習？

引言跟傳統的監督式機器學習演算法相比，深度神經網路目前最大的劣勢是什麼？貴！尤其是當我們在嘗試處理現實生活中諸如影象識別、聲音辨識等實際問題的時候。一旦你的模型中包含一些隱藏層時，增添多一層隱藏層將會花費巨大的計算資源。慶幸的是，有一種叫做“遷移學習”的方式

C#中的深度學習（五）：在ML.NET中使用預訓練模型進行硬幣識別

在本系列的最後，我們將介紹另一種方法，即利用一個預先訓練好的CNN來解決我們一直在研究的硬幣識別問題。在這裡，我們看一下轉移學習，調整預定義的CNN，並使用Model Builder訓練我們的硬幣識別模型。我們將使用ML.NET代替Keras.NET。為什麼不使用Keras.NET呢?儘管Keras.NE

訓練模型：交叉驗證

計算其余 CI ron 包括樣本中大 IE justify 一.基本概述用交叉驗證的目的是為了得到可靠穩定的模型。消除測試集與訓練集選擇的不好，導致訓練的模型不好。二.k折交叉驗證 K折交叉驗證，初始采樣分割成K個子樣本，一個單獨的子樣本被保留作為驗證模型的

Tensorflow讀取並使用預訓練模型：以inception_v3為例

相關推薦