【讀書1】【2017】MATLAB與深度學習——ReLU函式(2)

阿新 • • 發佈：2018-11-14

該部分程式碼從輸出節點的增量開始，計算隱藏節點的輸出誤差，並將其用於下一次誤差的計算。

This process starts from the delta of theoutput node, calculates the error of the hidden node, and uses it for the nexterror.

該過程在delta3、delta2和delta1之間迴圈重複相同的步驟。

It repeats the same steps through delta3,delta2, and delta1.

…

e = d - y;

delta = e;

e3 = W4’*delta;

delta3 = (v3 > 0).*e3;

e2 = W3’*delta3;

delta2 = (v2 > 0).*e2;

e1 = W2’*delta2;

delta1 = (v1 > 0).*e1;

…

程式碼中需要注意的是函式ReLU的導數。

Something noticeable from the code is thederivative of the function ReLU.

例如，在計算第三隱藏層delta3的增量時，ReLU函式的導數對應的程式碼如下：

For example, in the calculation of thedelta of the third hidden layer, delta3, the derivative of the ReLU function iscoded as follows:

(v3> 0)

讓我們看看這行程式碼是如何成為ReLU函式的導數。

Let’s see how this line becomes the derivativeof the ReLU function.

如果括號中的表示式分別為真和假時，則MATLAB對應返回1和0。

MATLAB returns a unity and zero if theexpressions in the brackets are true and false, respectively.

因此，如果v3 > 0，則該行程式碼返回1，否則返回0。

Therefore, this line becomes 1 if v3 > 0and 0 otherwise.

與下面所示的ReLU函式的導數定義一樣，輸出結果是相同的：

The same result is produced as thedefinition of the derivative of the ReLU function shown here:

在這裡插入圖片描述

以下程式碼為TestDeepReLU.m檔案的詳細清單，該程式碼用於DeepReLU函式的測試。

The following listing shows theTestDeepReLU.m file, which tests the DeepReLU function.

該程式碼呼叫DeepReLU函式，並訓練網路10000次。

This program calls the DeepReLU functionand trains the network 10,000 times.

它將訓練資料輸入到訓練網路中並顯示輸出。

It enters the training data into thetrained network and displays the output.

我們通過比較訓練輸出和正確的輸出來驗證神經網路訓練的充分性。

We verify the adequacy of the training bycomparing the output and correct output.

clear all

X = zeros(5, 5, 5);

X(:, :, 1) = [0 1 1 0 0;

X(:, :, 2) = [1 1 1 1 0;

X(:, :, 3) = [1 1 1 1 0;

X(:, :, 4) = [0 0 0 1 0;

X(:, :, 5) = [1 1 1 1 1;

D = [ 1 0 0 00;

W1 = 2*rand(20, 25) - 1;

W2 = 2*rand(20, 20) - 1;

W3 = 2*rand(20, 20) - 1;

W4 = 2*rand( 5, 20) - 1;

for epoch = 1:10000 % train

   [W1,W2, W3, W4] = DeepReLU(W1, W2, W3, W4, X, D);

end

N = 5; % inference

for k = 1:N

   x= reshape(X(:, :, k), 25, 1);

   v1= W1*x;

   y1 = ReLU(v1);

   v2 = W2*y1;

   y2 = ReLU(v2);

   v3 = W3*y2;

   y3 = ReLU(v3);

   v = W4*y3;

   y= Softmax(v)

end

由於該程式碼與以前測試程式的程式碼幾乎相同，因此不再詳細說明。

As this code is also almost identical tothe previous test programs, a detailed explanation is omitted.

有時候該程式碼可能會訓練失敗，產生錯誤輸出，而在我們使用sigmoid啟用函式時是從來沒有發生過的。

This code occasionally fails to trainproperly and yields wrong outputs, which has never happened with the sigmoidactivation function.

ReLU函式對初始權值的敏感特性導致了這種異常現象。

The sensitivityof the ReLU function to the initial weight values seems to cause this anomaly.

——本文譯自Phil Kim所著的《Matlab Deep Learning》

更多精彩文章請關注微訊號：在這裡插入圖片描述

【讀書1】【2017】MATLAB與深度學習——ReLU函式(2)

該部分程式碼從輸出節點的增量開始，計算隱藏節點的輸出誤差，並將其用於下一次誤差的計算。 This process starts from the delta of theoutput node, calculates the error of the hidden node, and u

【讀書1】【2017】MATLAB與深度學習——ReLU函式(1)

ReLU函式（ReLU Function）本節通過例項介紹ReLU函式。 This section introduces the ReLU functionvia the example. DeepReLU函式利用反向傳播演算法對給定的深度神經網路進行訓練。 The fun

【讀書1】【2017】MATLAB與深度學習——多元分類(2)

這種轉換意味著每個輸出節點都對映到向量中的一個元素，只有該元素對應的節點產生1。 This transformation implies that eachoutput node is mapped to an element of the class vector, which onl

【讀書1】【2017】MATLAB與深度學習——二元分類(2)

圖4-4 改變分類符號的表示方法Change the class symbolsand the data is classified differently 圖4-4所示的訓練資料是我們用來訓練神經網路的。 The training data shown in Figure 4-4

【讀書1】【2017】MATLAB與深度學習——代價函式與學習規則(1)

代價函式與學習規則（Cost Function and Learning Rule）本節簡要說明了代價函式是什麼，以及它如何影響神經網路的學習規則。 This section briefly explains what the costfunction is

【讀書1】【2017】MATLAB與深度學習——代價函式比較(2)

如果你覺得很難趕上學習進度，不要氣餒。 If you had a hard time catching on, don’tbe discouraged. 事實上，在研究深度學習時，理解反向傳播演算法並不是一個至關重要的因素。 Actually, understa

【讀書1】【2017】MATLAB與深度學習——代價函式比較(1)

該程式的撰寫方式幾乎與第2章“SGD與批處理比較”中的SGDvsBatch.m檔案的撰寫方式相同。 The architecture of this file is almostidentical to that of the SGDvsBatch.m file

【讀書1】【2017】MATLAB與深度學習——多元分類(3)

假設神經網路在給定輸入資料時產生如圖4-11所示的輸出。 Assume that the neural network produced theoutput shown in Figure 4-11 when given the input data. 圖4-11 當使用sigmo

【讀書1】【2017】MATLAB與深度學習——多元分類(1)

多元分類（Multiclass Classification）本節介紹如何利用神經網路來處理三種或三種以上的分類。 This section introduces how to utilize theneural network to deal with the classific

【讀書1】【2017】MATLAB與深度學習——二元分類(1)

圖4-2 二元分類的訓練資料格式Training data binaryclassification 圖中的前兩個數字分別表示x和y座標，符號表示該資料所屬的類別。 The first two numbers indicate the x and ycoordinates resp

【讀書1】【2017】MATLAB與深度學習——示例：多元分類(4)

讓我們逐一檢視。 Let’s take a look one by one. 對於第一幅影象，神經網路認為該圖為4的概率為96.66%。 For the first image, the neural networkdecided it was a 4 by 96.66% pro

【讀書1】【2017】MATLAB與深度學習——深度學習(1)

也許不容易理解為什麼只加入額外的一層卻花費了如此長的時間。 It may not be easy to understand why ittook so long for just one additional layer. 這是因為沒有找到多層神經網路的正確學習規則。 It w

【讀書1】【2017】MATLAB與深度學習——消失的梯度(1)

它的實現也是極其容易的。 Its implementation is extremely easy aswell. sigmoid函式將節點的輸出範圍限制為單位1，沒有考慮輸入值的大小。 The sigmoid function limits the node’soutputs

【讀書1】【2017】MATLAB與深度學習——過度擬合(1)

該方法儘可能簡化神經網路的結構，從而減少過擬合的發生。（編注：正則化的簡化實質也是丟棄網路節點，不過正則化是按照某種規則丟棄，而dropout是隨機丟棄） This method works as it simplifies theneural network’ architecture

【讀書1】【2017】MATLAB與深度學習——Dropout(1)

Dropout 本節給出dropout的執行程式碼。 This section presents the code thatimplements the dropout. 我們使用sigmoid作為隱藏節點的啟用函式。 We use the sigmoid activatio

【讀書1】【2017】MATLAB與深度學習——示例：MNIST(2)

函式MnistConv使用反向傳播演算法訓練網路，獲取神經網路的權重和訓練資料，並返回訓練後的權重。 The function MnistConv, which trains thenetwork using the back-propagation algorithm, takes t

【讀書1】【2017】MATLAB與深度學習——示例：MNIST(1)

因此，我們有8000幅MNIST影象用於訓練，2000幅影象用於神經網路的效能驗證。 Therefore, we have 8,000 MNIST images fortraining and 2,000 images for validation of the performance

【讀書1】【2017】MATLAB與深度學習——池化層(1)

由於它是一個二維的運算操作，文字解釋可能會導致更多的混淆，因此讓我們來舉一個例子。 As it is a two-dimensional operation, andan explanation in text may lead to more confusion, let’s go t

【讀書1】【2017】MATLAB與深度學習——卷積層(4)

圖6-13 當影象矩陣與濾波器不匹配時，較大的重要元素不會起到顯著的作用Whenthe image matrix does not match the filter, the significant elements are notaligned 這是因為影象矩陣與濾波器不匹配，影象矩

【讀書1】【2017】MATLAB與深度學習——示例：MNIST(3)

程式碼的小批量訓練部分被單獨提取出來並顯示在下面的列表中。 The minibatch portion of the code isextracted and shown in the following listing. bsize = 100; blist = 1:bsize

【讀書1】【2017】MATLAB與深度學習——ReLU函式(2)

相關推薦