16、TensorFLow 模型引數的儲存與恢復

阿新 • • 發佈：2019-02-07

最簡單的儲存和恢復模型的方法是使用tf.train.Saver()物件，它給 graph 中的所有變數，或是定義在列表裡的變數，新增 save 和 restore ops。tf.train.Saver()物件提供了方法來執行這些 ops，並指定了檢查點檔案的讀寫路徑。

`一、tf.train.Saver() 類解析`

tf.train.Saver(
    var_list=None,
    reshape=False,
    sharded=False,
    max_to_keep=5,
    keep_checkpoint_every_n_hours=10000.0,
    name=None 
,
    restore_sequentially=False,
    saver_def=None,
    builder=None,
    defer_build=False,
    allow_empty=False,
    write_version=tf.train.SaverDef.V2,
    pad_step_number=False,
    save_relative_paths=False,
    filename=None
)

`1、初始化引數解析`

var_list
- specifies the variables that will be saved and restored. If None, defaults to the list of all saveable objects. It can be passed as a dict or a list
  
  :
- A dict of names to variables: The keys are the names that will be used to save or restore the variables in the checkpoint files.
- A list of variables: The variables will be keyed with their op name in the checkpoint files.
For example:

v1 = tf.Variable(..., name='v1')
v2 = tf.Variable(..., name='v2' 
)

# 1、 pass them as a list，可使用此 list 儲存或載入部分變數
saver = tf.train.Saver([v1, v2])

# 2、Pass the variables as a dict:
saver = tf.train.Saver({'v1': v1, 'v2': v2})

# 3、Passing a list is equivalent to passing a dict with the variable op names as keys:
saver = tf.train.Saver({v.op.name: v for v in [v1, v2]})

# 4、儲存或載入時給變數重新命名
v1 = tf.Variable(..., name='other_v1')
v2 = tf.Variable(..., name='other_v2')

saver = tf.train.Saver({'v1': v1, 'v2': v2})
print(v1.name)   # 輸出：other-v1:0

max_to_keep
- indicates the maximum number of recent checkpoint files to keep.
- As new files are created, older files are deleted.
- If None or 0, all checkpoint files are kept. Defaults to 5 (that is, the 5 most recent checkpoint files are kept.)
- 設定max_to_keep=1則只儲存最新的 model，或者在使用save()方法儲存模型時，保持global_step=None也可以達到只儲存最新model的效果。

`2、常用方法解析`

# Returns a string, path at which the variables were saved.
save(
    sess,
    save_path,
    global_step=None,
    latest_filename=None,
    meta_graph_suffix='meta',
    write_meta_graph=True,
    write_state=True
)

# The variables to restore do not have to have been initialized, as restoring is itself a way to initialize variables.
restore(
    sess,
    save_path
)

二、引數的儲存與恢復

1、檢查點檔案介紹

變數儲存在二進位制檔案裡，主要包含從variable names to tensor values的對映關係

當你建立一個Saver物件時，你可以選擇性地為檢查點檔案中的變數挑選變數名。預設情況下，將使用每個變數tf.Variable.name 屬性的值。（這才是模型的引數，和變數名沒有半毛錢關係）

saver = tf.train.Saver(max_to_keep=3)時 checkpoint 儲存的檔案詳情如下：

第一個檔案儲存了一個目錄下所有模型檔案路徑的列表

第二個檔案儲存了我們的模型(all the values of the weights, biases, gradients and all the other variables saved)

第三個檔案為索引

第四個檔案為計算圖的結構，包括：all variables, operations, collections etc

這裡寫圖片描述

2、儲存變數&恢復變數

可以用一個 bool 型變數 is_train 來控制訓練和驗證兩個階段，True 表示訓練，False 表示測試

tf.train.Saver() 類支援在恢復變數時給變數重新命名(改寫原來變數中的 name 引數)

#!/usr/bin/env python3
# -*- coding: utf-8 -*-

import tensorflow as tf

# Create some variables.
w = tf.get_variable("weight", shape=[2], initializer=tf.zeros_initializer())
b = tf.get_variable("bias", shape=[3], initializer=tf.zeros_initializer())

inc_w = w.assign(w + 1)
dec_b = b.assign(b - 1)

# Add an op to initialize the variables.
init_op = tf.global_variables_initializer()

# Add ops to save and restore all the variables.
saver = tf.train.Saver(max_to_keep=3)

isTrain = False  # True 表示訓練，False 表示測試
train_steps = 1000
checkpoint_steps = 50
checkpoint_dir = 'checkpoint/save&restore/'
model_name = 'my_model'

# Later, launch the model, initialize the variables, do some work, and save the
# variables to disk.
with tf.Session() as sess:
    sess.run(init_op)
    if isTrain:
        # Do some work with the model.
        for step in range(train_steps):
            inc_w.op.run()
            dec_b.op.run()
            if (step + 1) % checkpoint_steps == 0:
                # Append the step number to the checkpoint name:
                saved_path = saver.save(
                    sess,
                    checkpoint_dir + model_name,
                    global_step=step + 1  # 設為 None 時，只儲存最新結果
                )
    else:
        print('Before restore:')
        print(sess.run(w))
        print(sess.run(b))
        ckpt = tf.train.get_checkpoint_state(checkpoint_dir)
        # 獲取最新的 model_file
        if ckpt and ckpt.model_checkpoint_path:
            print("Success to load %s." % ckpt.model_checkpoint_path)
            saver.restore(sess, ckpt.model_checkpoint_path)
        else:
            pass
        print('After restore:')
        print(sess.run(w))
        print(sess.run(b))

# 測試結果
Before restore:
[ 0.  0.]
[ 0.  0.  0.]
Success to load checkpoint/save&restore/my_model-1000.
After restore:
[ 1000.  1000.]
[-1000. -1000. -1000.]


# 結論：restore 其實就相當於重新初始化所有的變數

# 結論分析
雖然官方文件說：restore 時不用使用 init_op 去初始化所有的變量了，但這裡為了驗證下(restore 其實就相當於重新初始化所有的變數)，還是把 sess.run(init_op) 放在了if isTrain: 語句的上面(同時作用於訓練和測試階段), 從測試結果中可以驗證結論。
# 其實可以把 sess.run(init_op) 放在 if isTrain: 語句的裡面(只作用於訓練階段)

3、取得可訓練引數的值&提取某一層的特徵

sess = tf.Session()

# Returns all variables created with trainable=True in a var_list
var_list = tf.trainable_variables()

print("Trainable variables:------------------------")

# 取出所有可訓練引數的索引、形狀和名稱
for idx, v in enumerate(var_list):
     print("param {:3}: {:15}   {}".format(idx, str(v.get_shape()), v.name))


# 某網路輸出示例
Trainable variables:------------------------
  param   0: (5, 5, 3, 32)     conv2d/kernel:0
  param   1: (32,)             conv2d/bias:0
  param   2: (5, 5, 32, 64)    conv2d_1/kernel:0
  param   3: (64,)             conv2d_1/bias:0
  param   4: (3, 3, 64, 128)   conv2d_2/kernel:0
  param   5: (128,)            conv2d_2/bias:0
  param   6: (3, 3, 128, 128)   conv2d_3/kernel:0
  param   7: (128,)            conv2d_3/bias:0
  param   8: (4608, 1024)      dense/kernel:0
  param   9: (1024,)           dense/bias:0
  param  10: (1024, 512)       dense_1/kernel:0   --->dense2 層的引數
  param  11: (512,)            dense_1/bias:0
  param  12: (512, 5)          dense_2/kernel:0
  param  13: (5,)              dense_2/bias:0


# 提取最後一個全連線層的引數 W 和 b
W = sess.run(var_list[12])
b = sess.run(var_list[13])

# 提取第二個全連線層的輸出值作為特徵    
feature = sess.run(dense2, feed_dict={x:img})

三、繼續訓練&Fine-tune 某一層

1、繼續訓練(所有引數)

# 定義一個全域性物件來獲取引數的值，在程式中使用(eg：FLAGS.iteration)來引用引數
FLAGS = tf.app.flags.FLAGS


# 定義命令列引數，第一個是：引數名稱，第二個是：引數預設值，第三個是：引數描述
tf.app.flags.DEFINE_string(
    "checkpoint_dir", 
    "/path/to/checkpoint_save_dir/", 
    "Directory name to save the checkpoints [checkpoint]"
)
tf.app.flags.DEFINE_boolean(
    "continue_train", 
    False, 
    "True for continue training.[False]"
)

saver = tf.train.Saver()

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    if FLAGS.continue_train:
        # 自動取得最新的 model_file
        model_file = tf.train.latest_checkpoint(checkpoint_dir)
        saver.restore(sess, model_file)
        print("Success to load %s." % model_file)

2、Fine-tune 某一層

更改網路中權重和偏置的引數，把需要固定不進行訓練的變數的 trainable 引數設定為False

然後再使用上面的程式碼進行繼續訓練即可
eg： my_non_trainable = tf.get_variable("my_non_trainable", shape=(3, 3), trainable=False)

Restore a meta checkpoint(待總結?????)

use the TF helper tf.train.import_meta_graph()

四、參考資料

16、TensorFLow 模型引數的儲存與恢復

最簡單的儲存和恢復模型的方法是使用tf.train.Saver()物件，它給 graph 中的所有變數，或是定義在列表裡的變數，新增 save 和 restore ops。tf.train.Saver()物件提供了方法來執行這些 ops，並指定了檢查點檔案

tensorflow模型的儲存與恢復（tf.train.Saver()和saver.restore()方法的運用）

注意：在tensorflow中，一旦有涉及到檔案路徑的相關操作，請確保檔案路徑上所有目錄名為英文！！！！否則可能會出現奇怪的錯誤！！ TensorFlow模型的儲存與恢復（使用tf.train.Saver()和saver.restore()）：首先我們需要建立一個用來儲存模型的物

TensorFlow學習筆記（九）—— Tensorflow模型的儲存與恢復載入

近期做了一些反垃圾的工作，除了使用常用的規則匹配過濾等手段，也採用了一些機器學習方法進行分類預測。我們使用TensorFlow進行模型的訓練，訓練好的模型需要儲存，預測階段我們需要將模型進行載入還原使用，這就涉及TensorFlow模型的儲存與恢復載入。總結一下Tenso

[TensorFlow深度學習入門]實戰八·簡便方法實現TensorFlow模型引數儲存與載入（pb方式）

[TensorFlow深度學習入門]實戰八·簡便方法實現TensorFlow模型引數儲存與載入（pb方式）在上篇博文中，我們探索了TensorFlow模型引數儲存與載入實現方法採用的是儲存ckpt的方式。這篇博文我們會使用儲存為pd格式檔案來實現。首先，我會在上篇博文基礎上，實現由c

[TensorFlow深度學習入門]實戰七·簡便方法實現TensorFlow模型引數儲存與載入（ckpt方式）

[TensorFlow深度學習入門]實戰七·簡便方法實現TensorFlow模型引數儲存與載入（ckpt方式） TensorFlow模型訓練的好網路引數如果想重複高效利用，模型引數儲存與載入是必須掌握的模組。本文提供一種簡單容易理解的方式來實現上述功能。參考部落格地址備註：本文采用的

tensorflow 1.0 學習：模型的儲存與恢復(Saver)

儲存模型： import tensorflow as tf #Prepare to feed input, i.e. feed_dict and placeholders w1 = tf.placeholder("float", name="w1") w2

tensorflow模型的儲存與載入

1.儲存：(儲存的變數都是停放，tf.Variable()中的變數，變數一定要有名字) saver = tf.train.Saver() saver.run(sess,"./model4/line_model.ckpt") 2.檢視儲存的變數資訊：（將儲存的資訊打印

tensorflow 模型的儲存與過載

import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data mnist = input_data.read_data_sets('MNIST_data',one_ho

簡單完整地講解tensorflow模型的儲存和恢復

http://blog.csdn.net/liangyihuai/article/details/78515913 在本教程主要講到： 1. 什麼是Tensorflow模型？ 2. 如何儲存Tensorflow模式？ 3. 如何還原預測/遷移學習Tensorflow模型？ 4. 如

機器學習-訓練模型的儲存與恢復（sklearn）

在做模型訓練的時候，尤其是在訓練集上做交叉驗證，通常想要將模型儲存下來，然後放到獨立的測試集上測試，下面介紹的是Python中訓練模型的儲存和再使用。 scikit-learn已經有了模型持久化的操作，匯入joblib即可 from sklearn.ex

sklearn，keras，tensorflow 模型本地儲存與匯入

sklearn，keras，tensorflow模型本地儲存與匯入 sklearn的模型儲存與匯入 keras的模型儲存與匯入模型的網路結構與權重的儲存模型的網路結構儲存模型的權重的儲存模型的匯入 tensorfl

TensorFlow 模型儲存與恢復總結（微調、微改已有模型）

這裡要注意，如果沒有恢復圖，則在恢復變數時，需要在程式碼中手動建立與之前一樣的圖，如果有新的op，則需要指定要恢復的引數列表，不然的話就會報錯，因為在原有模型檔案裡找不到新定義的op，我們改別人的模型一般用的是這個，不過我們構建的圖可以與原來不同，只要是我們定義的變數與原圖一致就可以，體現在名字，變數型別，大

tensorflow 檢查點和模型,儲存與恢復使用,官方教程(一)

檢查點：這種格式依賴於建立模型的程式碼。SavedModel：這種格式與建立模型的程式碼無關。示例程式碼本文件依賴於 TensorFlow 使用入門中詳細介紹的同一個鳶尾花分類示例。要下載和訪問該示例，請執行下列兩個命令：git clone https://github.co

Keras：自建資料集影象分類的模型訓練、儲存與恢復

資料擴增在資料集中的資料不多的情況下,可以使用圖片生成器ImageDataGenerator用來生成一個batch的影象資料，進行資料擴增. 示例: #!/usr/bin/python # coding:utf8 from keras.prepro

TensorFlow儲存、載入模型引數 | 原理描述及踩坑經驗總結

寫在前面我之前使用的LSTM計算單元是根據其前向傳播的計算公式手動實現的，這兩天想要和TensorFlow自帶的tf.nn.rnn_cell.BasicLSTMCell()比較一下，看看哪個訓練速度快一些。在使用tf.nn.rnn_cell.BasicLSTMCell()進行建模的時候，遇到了模型儲存、載入

tensorflow 訓練模型的儲存與讀取已儲存的模型進行測試

在實際中，通常需要將經過大量訓練的較好模型引數儲存起來，在實際應用以訓練好的模型進行預測。 TensorFlow中提供了模型儲存的模組 tensorflow.train.Saver() 1. 匯入tensorflow模組

【Tensorflow】資料及模型的儲存和恢復

如果你是一個深度學習的初學者，那麼我相信你應該會跟著教材或者視訊敲上那麼一遍程式碼，搭建最簡單的神經網路去完成針對 MNIST 資料庫的數字識別任務。通常，隨意構建 3 層神經網路就可以很快地完成任務，得到比較高的準確率。這時候，你信心大增，準備挑戰更難的任務。

Tensorflow模型引數的Saver儲存讀取

一、Saver儲存 import tensorflow as tf import numpy as np #定義W和b W = tf.Variable([[1,2,3],[3,5,6]],dtype = tf.float32,name = 'weight') b = tf

tensorflow筆記：模型的儲存與訓練過程視覺化

儲存與讀取模型在使用tf來訓練模型的時候，難免會出現中斷的情況。這時候自然就希望能夠將辛辛苦苦得到的中間引數保留下來，不然下次又要重新開始。好在tf官方提供了儲存和讀取模型的方法。儲存模型的方法： # 之前是各種構建模型graph的操作(矩

TensorFlow下網路模型的儲存與載入

#!/usr/bin/env python# 匯入mnist資料庫from tensorflow.examples.tutorials.mnist import input_datamnist = input_data.read_data_sets("MNIST_data/", one_hot=True)i

16、TensorFLow 模型引數的儲存與恢復

一、tf.train.Saver() 類解析

1、初始化引數解析

2、常用方法解析

二、引數的儲存與恢復

1、檢查點檔案介紹

2、儲存變數&恢復變數

3、取得可訓練引數的值&提取某一層的特徵

三、繼續訓練&Fine-tune 某一層

1、繼續訓練(所有引數)

2、Fine-tune 某一層

四、參考資料

相關推薦

`一、tf.train.Saver() 類解析`

`1、初始化引數解析`

`2、常用方法解析`