人工智慧（4）- 實現多層神經網路

阿新 • • 發佈：2018-12-06

1.單層神經網路

2.多層神經網路

3.MLP的3個步驟

MLP learning procedure in three simple steps:

Starting at the input layer, we forward propagate the patterns of the training data through the network to generate an output.
Based on the network's output, we calculate the error that we want to minimize using a cost function that we will describe later.

We backpropagate the error, find its derivative with respect to each weight inthe network, and update the model.

前向演算法

隱藏層中的每個單元連結所有輸入層，計算隱藏層的啟用單元

輸出也是同樣的方法

4.Obtaining the MNIST dataset

獲取60000個訓練集和10000個測試集，將原始的資料轉換成784（28*28）畫素的資料。

# -*- coding: utf-8 -*-
"""
Created on Sat Nov 10 14:30:38 2018

@author:YRP
"""
import os 
import struct
import numpy as np

#Load_mnist返回兩個值樣品和特徵
def load_mnist(path, kind='train'):
    """Load MNIST data from `path`""" 
    labels_path = os.path.join(path,
                               '%s-labels.idx1-ubyte' % kind) 
    images_path = os.path.join(path,
                               '%s-images.idx3-ubyte' % kind)
    
    with open(labels_path, 'rb') as lbpath:
        magic, n = struct.unpack('>II',
                                 lbpath.read(8)) 
        labels = np.fromfile(lbpath,
                             dtype=np.uint8)
    
    with open(images_path, 'rb') as imgpath:
        magic, num, rows, cols = struct.unpack(">IIII",
                                               imgpath.read(16))
        images = np.fromfile(imgpath,
                             dtype=np.uint8).reshape( 
                             len(labels), 784)
        images = ((images / 255.) - .5) * 2
    
    return images, labels
#讀取60000個訓練集和10000個測試集
X_train, y_train = load_mnist('', kind='train')
print('Rows: %d, columns: %d'
      % (X_train.shape[0], X_train.shape[1])) 
X_test, y_test = load_mnist('', kind='t10k')
print('Rows: %d, columns: %d'
      % (X_test.shape[0], X_test.shape[1]))
#顯示影象中的1到9
import matplotlib.pyplot as plt
fig, ax = plt.subplots(nrows=2, ncols=5,
                       sharex=True, sharey=True)
ax = ax.flatten()
for i in range(10):
    img = X_train[y_train == i][0].reshape(28, 28)
    ax[i].imshow(img, cmap='Greys')
ax[0].set_xticks([])
ax[0].set_yticks([])
plt.tight_layout()
plt.show()
#儲存訓練和測試集到檔案中
np.savez_compressed('mnist_scaled.npz',
	X_train=X_train,
	y_train=y_train,
	X_test=X_test,
	y_test=y_test)

#將檔案讀取
mnist = np.load('mnist_scaled.npz')

影象顯示結果

5.區分手寫資料 Classifying handwritten digits

實現MLP包括一層輸入、一層隱藏、一層輸出，來對MNIST的資料集進行識別

對55000個數據進行訓練，留下5000個數據進行驗證

在NeuralNetMLP中設定引數

- - l2: This is the l parameter for L2 regularization to decrease the degree of overfitting.
  - epochs: This is the number of passes over the training set.
  - eta: This is the learning rate h .
  - shuffle: This is for shuffling the training set prior to every epoch to prevent that the algorithm gets stuck in circles.
  - seed: This is a random seed for shuffling and weight initialization.
  - minibatch_size: This is the number of training samples in each mini-batch when splitting of the training data in each epoch for stochastic gradient descent. The gradient is computed for each mini-batch separately instead of the entire training data for faster learning.

通過得到200個epochs的cost，繪製出如下圖表

得到200Epochs的驗證和訓練精度

最後通過分析驗證集和訓練集的精度評估模型的泛化能力

Test accuracy: 97.54%

觀察一個5*5的子圖矩陣，其中副標題中的第一個數字表示圖索引，第二個數字表示真正的類標籤(t)，第三個數字表示預測的類標籤(p):

import os
import mlp
import numpy as np
import matplotlib.pyplot as plt

mnist = np.load('./mnist/mnist_scaled.npz')
X_train, y_train, X_test, y_test = [mnist[f] for f in mnist.files]

n_epochs = 200


if 'TRAVIS' in os.environ:
    n_epochs = 20

nn = mlp.NeuralNetMLP(n_hidden=100, 
                  l2=0.01, 
                  epochs=n_epochs, 
                  eta=0.0005,
                  minibatch_size=100, 
                  shuffle=True,
                  seed=1)

nn.fit(X_train=X_train[:55000], 
       y_train=y_train[:55000],
       X_valid=X_train[55000:],
       y_valid=y_train[55000:])

plt.plot(range(nn.epochs), nn.eval_['cost'])
plt.ylabel('Cost')
plt.xlabel('Epochs')
plt.savefig('images/costEpochs.png', dpi=300)
plt.show()

plt.plot(range(nn.epochs), nn.eval_['train_acc'], label='training')
plt.plot(range(nn.epochs), nn.eval_['valid_acc'], label='validation', linestyle='--')
plt.ylabel('Accuracy')
plt.xlabel('Epochs')
plt.legend()
plt.savefig('images/accuracyEpochs.png', dpi=300)
plt.show()

y_test_pred = nn.predict(X_test)
acc = (np.sum(y_test == y_test_pred)
       .astype(np.float) / X_test.shape[0])

print('Test accuracy: %.2f%%' % (acc * 100))

miscl_img = X_test[y_test != y_test_pred][:25]
correct_lab = y_test[y_test != y_test_pred][:25]
miscl_lab = y_test_pred[y_test != y_test_pred][:25]

fig, ax = plt.subplots(nrows=5, ncols=5, sharex=True, sharey=True,)
ax = ax.flatten()
for i in range(25):
    img = miscl_img[i].reshape(28, 28)
    ax[i].imshow(img, cmap='Greys', interpolation='nearest')
    ax[i].set_title('%d) t: %d p: %d' % (i+1, correct_lab[i], miscl_lab[i]))

ax[0].set_xticks([])
ax[0].set_yticks([])
plt.tight_layout()
plt.savefig('images/misclassifying.png', dpi=300)
plt.show()

參考資料：《Python Machine Learning（2th）》

人工智慧（4）- 實現多層神經網路

1.單層神經網路 2.多層神經網路 3.MLP的3個步驟 MLP learning procedure in three simple steps: Starting at the input layer, we forward propagate the patt

TensorFlow學習筆記（4）--實現多層感知機（MNIST資料集）

前面使用TensorFlow實現一個完整的Softmax Regression，並在MNIST資料及上取得了約92%的正確率。現在建含一個隱層的神經網路模型（多層感知機）。 import tensorflow as tf import numpy as np

DeepLearning tutorial（4）CNN卷積神經網路原理簡介+程式碼詳解

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

TensorFlow學習筆記（5）--實現卷積神經網路（MNIST資料集）

這裡使用TensorFlow實現一個簡單的卷積神經網路，使用的是MNIST資料集。網路結構為：資料輸入層–卷積層1–池化層1–卷積層2–池化層2–全連線層1–全連線層2（輸出層），這是一個簡單但非常有代表性的卷積神經網路。 import tensorflow

[手把手系列之二]實現多層神經網路

完整程式碼：>>點我歡迎star,fork,一起學習網路用途或者說應用場景：使用單層神經網路來識別一張圖片是否是貓咪的圖片。數學表示給定一張圖片XX 送到網路中，判斷這張圖片是否是貓咪的照片？網路架構多層

跟著吳恩達學深度學習：用Scala實現神經網路-第二課：用Scala實現多層神經網路

上一章我們講了如何使用Scala實現LogisticRegression，這一張跟隨著吳恩達的腳步我們用Scala實現基礎的深度神經網路。順便再提一下，吳恩達對於深度神經網路的解釋是我如今聽過的最清楚的課，感嘆一句果然越是大牛知識解釋得越清晰明瞭。本文分為以下四個部分。

TensorFlow學習筆記（7）--實現卷積神經網路（同(5),不同的程式風格）

import tensorflow as tf import numpy as np import input_data mnist = input_data.read_data_sets('data/', one_hot=True) print("MNIST

Python20行程式碼實現多層神經網路的學習

轉載自：python小練習（062）：python20行程式碼實現多層神經網路的機器學習（一）http://bbs.fishc.com/thread-81849-1-1.html(出處: 魚C論壇)今天在魚C論壇看到一個很好的入門機器學習的小例子，分享給大家。現在神經網路、

TensorFlow 訓練 MNIST （2）—— 多層神經網路

　　在我的上一篇隨筆中，採用了單層神經網路來對MNIST進行訓練，在測試集中只有約90%的正確率。這次換一種神經網路（多層神經網路）來進行訓練和測試。 1、獲取MNIST資料　　MNIST資料集只要一行程式碼就可以獲取的到，非常方便。關於MNIST的基本資訊可以參考我的上一篇隨筆。 mnist = i

深度學習基礎（二）—— 從多層感知機（MLP）到卷積神經網路（CNN）

經典的多層感知機（Multi-Layer Perceptron）形式上是全連線（fully-connected）的鄰接網路（adjacent network）。 That is, every neuron in the network is connec

深度學習實踐（二）——多層神經網路

#一、準備為了更深入的理解神經網路，筆者基本採用純C++的手寫方式實現，其中矩陣方面的運算則呼叫opencv，資料集則來自公開資料集a1a。實驗環境： Visual studio 2017 opencv3.2.0 a1a資料集本文緊跟上篇文章深度

TensorFlow實戰4：實現簡單的多層神經網路案例

這篇文章記錄一下使用TensorFlow實現卷積神經網路的過程，資料集採用的還是MNIST資料集，使用了兩層的卷積來進行計算，整個過程在jupyter notebook中完成，具體步驟和程式碼展示如下： 1.環境設定 import numpy as np

理解神經網路，從簡單的例子開始（2）使用python建立多層神經網路

這篇文章將講解如何使用python建立多層神經網路。在閱讀這篇文章之前，建議先閱讀上一篇文章:理解神經網路，從簡單的例子開始。講解的是單層的神經網路。如果你已經閱讀了上一篇文章，你會發現這篇文章的程式碼和上一篇基本相同，理解起來也相對容易。上一篇文章使用了9

企業實戰（4）-實現基於Haproxy負載均衡集群的電子商務網站架構

haproxy keepalived 企業實戰：逐步實現企業各種情景下的需求企業情景四：隨著公司業務的發展，公司負載均衡服務已經實現四層負載均衡，但業務的復雜程度提升，公司要求把mobile手機站點作為單獨的服務提供，不在和pc站點一起提供服務，此時需要做7層規則負載均衡，運維總監要求，能否用一種服務

Cocos2d-x學習筆記（六）例項——多層佈景

【關於多層佈景】在遊戲開發中，一般會把遊戲分為兩部分：一部分是遊戲介面部分，也就是常說得UI部分；另一部分就是遊戲本身部分。有時UI有很多頁面，在頁面中用的圖也不是很多，不需要進行場景切換，只需把不同頁面做成不同的佈景，然後切換佈景層。那麼就需要一個“管理者”來管理這些介面，這時

python3.5進階（三）-------------實現多工之協程（生成器，迭代器）

1.迭代器：迭代是訪問集合元素的一種方式，迭代器是可以記住遍歷的位置的物件，迭代器物件從集合的第一個元素開始訪問，直到所有訪問結束，迭代器只能前進不能後退。判斷一個數據型別是否可以迭代，看是否能for迴圈。如（字串，列表，元祖...）序列可以迭代，數字不能迭代，或通過isintance([11,12

python3.5進階（三）-------------實現多工之程序

1. 程式：硬碟上的exe，是靜態的（一段程式碼程式碼）。通俗的說，程式在硬碟上執行起來（如雙擊qq.exe）就是程序，一般一個程式，可以有多個程序，如一個QQ程式，可以同時開啟登入多個QQ號程序。 2. 程序與執行緒的區別：都能實現多工。程式執行時，先將靜態程式碼

DeepLearning tutorial（3）MLP多層感知機原理簡介+程式碼詳解

VAE（4）——實現

終於到了實現的地方。前面乾燥乏味的公式推導和理論闡述已經讓很多人昏昏欲睡了，下面我們要提起精神，來看看這個模型的一個比較不錯的實現——GitHub - cdoersch/vae_tutorial: Caffe code to accompany my Tutorial on Var

Tensorflow實戰（五）經典卷積神經網路之實現VGGNet

演算法原理： VGGNet探索了卷積神經網路深度與其效能之間的關係，通過反覆的堆疊3*3的小型卷積核和2*2的最大池化層，VGGNet成功的構建了16-19層深的卷積神經網路。。 VGGNet擁有5段卷積，每一段內有2-3個卷積層，同時尾部會連線一

人工智慧（4）- 實現多層神經網路

相關推薦