Torch7深度學習教程（七）

阿新 • • 發佈：2019-01-18

這一章主要分享預測的基本操作，並且先將前面分享的內容總結下，完整地實現CNN影象分類的例項

require 'paths';
require 'nn';

---Load TrainSet
paths.filep("/home/ubuntu64/cifar10torchsmall.zip"); 

trainset = torch.load('cifar10-train.t7');
testset = torch.load('cifar10-train.t7');
classes = {'airplane', 'automobile', 'bird', 'cat',
           'deer' 
, 'dog', 'frog', 'horse', 'ship', 'truck'};

---Add size() function and Tensor index operator 
setmetatable(trainset, 
    {__index = function(t, i) 
                    return {t.data[i], t.label[i]} 
                end}
);
trainset.data = trainset.data:double() 

function trainset:size() 
    return 
 self.data:size(1) 
end

---Normalize data
mean = {}
stdv = {}
for i=1,3 do
    mean[i] = trainset.data[{ {}, {i}, {}, {}  }]:mean()
    print('Channel ' .. i .. ', Mean: ' .. mean[i])
    trainset.data[{ {}, {i}, {}, {}  }]:add(-mean[i])

    stdv[i] = trainset.data[{ {}, {i}, {}, {}  }]:std()
    print 
('Channel ' .. i .. ', Standard Deviation:' .. stdv[i])
    trainset.data[{ {}, {i}, {}, {}  }]:div(stdv[i])
end

資料的預處理

net = nn.Sequential()

--change 1 channel to 3 channels
--net:add(nn.SpatialConvolution(1, 6, 5, 5))
net:add(nn.SpatialConvolution(3, 6, 5, 5)) 

net:add(nn.ReLU())                       
net:add(nn.SpatialMaxPooling(2,2,2,2))     
net:add(nn.SpatialConvolution(6, 16, 5, 5))
net:add(nn.ReLU())                       
net:add(nn.SpatialMaxPooling(2,2,2,2))
net:add(nn.View(16*5*5))                    
net:add(nn.Linear(16*5*5, 120))         
net:add(nn.ReLU())                       
net:add(nn.Linear(120, 84))
net:add(nn.ReLU())                       
net:add(nn.Linear(84, 10))                  
net:add(nn.LogSoftMax())

與之前建立好的網路有一點不同是將原來的1通道變為3通道，輸入的資料集是3通道的彩色影象

criterion = nn.ClassNLLCriterion();

trainer = nn.StochasticGradient(net, criterion)
trainer.learningRate = 0.001
trainer.maxIteration = 5

trainer:train(trainset)

訓練的過程與前面一樣，再重複下，加深印象。
定義損失函式，選擇優化的方法，將網路和損失函式傳入，設定學習率和最大迭代次數，開始訓練，結果如下圖。
這裡寫圖片描述
現在我們的CNN網路已經訓練完了,迭代次數可以多一些，效果不一樣，讓我測試下看看效果如何。

這裡寫圖片描述
將測試的影象資料用先前的平均值和方差進行同訓練集一樣的中心化和歸一化。

選取第100張圖片輸入，前兩行分別為列印其對應的標籤和顯示圖片，第三行，執行網路的前向傳播演算法，返回值賦值給predicted。直接列印predicted給出的不是概率而是對數概率。在這十個概率中值最大的為識別的結果，這個是倒數第二個值最大，對應label標籤為9，即這張圖片被識別成了ship
這裡寫圖片描述
這張最大的是第5個值，即label對應的5，為deer識別正確。

前面的輸出的格式還是有些不好，利用torch的sort排序方法將結果排序，返回值中confidences是屬於各類的可信度從大到小的排序結果，indices是可信度對應的類的標籤，看下列印結果圖，一目瞭然。
這裡寫圖片描述

再進一步完善格式，只選取張量的第一個元素列印，並且利用先前定義的classes陣列輸出名字。

最後是驗證總的識別率，for迴圈，提取測試集中每張照片的標籤，每張測試集使用網路net的前向傳播forward獲得預測結果，將預測結果排序,true表示按遞減排序，對比預測結果同真是標籤，正確計數器correct加1，將correct除以測試總數10000乘以100得出百分比，這是我迭代10次的正確率為62.23%。
這裡寫圖片描述
這是統計每類對應的正確率，宣告一個數組計數器，用來記錄每一類正確識別的個數，改動if語句裡面的賦值語句即可完成計數，其他同上。

for迴圈1到classes的陣列長度，即10。class_performance[i]裡儲存的是第i個類的真確識別個數，由於每類有1000張，除以1000，列印結果。

預測的原始碼

--normalize test data
testset.data = testset.data:double();

for i = 1,3 do
    testset.data[{ {}, {i}, {}, {} }]:add(-mean[i])
    testset.data[{ {}, {i}, {}, {} }]:div(stdv[i])
end

--predict test and print confidences
print(classes[testset.label[400]])
itorch.image(testset.data[400])
predicted = net:forward(testset.data[400])
print(predicted:exp())

--sort confidence and print predicted result
confidences, indices = torch.sort(predicted, true)
print(confidences[1])
print(indices[1])
print(classes[indices[1]])

--correct rate in total
correct = 0
for i=1,10000 do
    local groundtruth = testset.label[i]
    local prediction = net:forward(testset.data[i])
    local confidences, indices = torch.sort(prediction, true)

    if groundtruth == indices[1] then
        correct = correct + 1
    end
end

print(correct, 100*correct/10000 .. '%')


--correct rate every class
class_performance = {0,0,0,0,0,0,0,0,0,0}
for i = 1,10000 do
    local groundtruth = testset.label[i]
    local prediction = net:forward(testset.data[i])
    local confidences, indices = torch.sort(prediction, true)

    if groundtruth == indices[1] then
        class_performance[groundtruth] = class_performance[groundtruth] + 1
    end 
end


for i = 1, #classes do 
    print(classes[i], 100*class_performance[i]/1000 .. '%')
end

如果想利用GPU訓練該例項用下面的程式碼替代上文中的訓練部分即可

require 'cunn'

net = net:cuda()
criterion = criterion:cuda()
trainset.data = trainset.data:cuda()
trainset.label = trainset.label:cuda()

修改這些就可以使用GPU程式設計。

Torch7深度學習教程（七）

Torch7深度學習教程（七）

Torch7深度學習教程（五）

Torch7深度學習教程（二）

吳恩達深度學習筆記（七） —— Batch Normalization

深度學習方法（七）：最新SqueezeNet 模型詳解，CNN模型引數降低50倍，壓縮461倍！

深度學習基礎（七）—— Activation Function

深度學習系列（七）：自編碼網路與PCA特徵學習的分類對比實驗

TensorFlow：實戰Google深度學習框架（七）迴圈神經網路

基於PyTorch的深度學習入門教程（七）——PyTorch重點綜合實踐

python基礎教程（第三版)學習筆記（七）

GPU（CUDA）學習日記（七）------ Parallel Nsight 雙機除錯經驗以及一些比較基礎的教程

設計模式學習總結（七）適配器模式(Adapter)

mysql學習筆記（七）—— MySQL內連接和外連接

Spring 學習筆記（七）—— 切入點表達式

Nhibernate學習教程（2）-- 第一個NHibernate程序

EF學習筆記（七）：讀取關聯數據

python基礎教程（七）

JavaScript學習日誌（七）：表單腳本

深度學習筆記（九）感受野計算

Java語言基礎學習筆記（七）

Torch7深度學習教程（七）

相關推薦