MNIST手寫數字圖片識別（線性回歸、CNN方法的手工及框架實現）（未完待續）

阿新 • • 發佈：2018-06-26

shape 初始化 result rect not found pro res edi ise

0-Background

作為Deep Learning中的Hello World 項目無論如何都要做一遍的。

代碼地址：Github 練習過程中將持續更新blog及代碼。

第一次寫博客，很多地方可能語言組織不清，請多多提出意見。。謝謝~

0.1 背景知識：

Linear regression
CNN

LeNet-5
AlexNet
ResNet
VGG
各種regularization方式

0.2 Catalog

1-Prepare
2-MNIST
3-LinearRegression

1-Prepare

Numpy 開源的數值計算庫
matplotlib Python 的 2D繪圖庫

TensorFlow 開源的人工智能學習系統
Keras 基Tensorflow、Theano以及CNTK後端的一個高層神經網絡API

2-MNIST

MNIST作為NIST的一個超集，是一個由來自 250 個不同人手寫的數字構成。其中包含60,000個訓練樣本和10,000個測試樣本。
加載MNIST

import numpy as np
import os
import struct
import matplotlib.pyplot as plt


class load:
    def __init__(self,
                 path=‘mnist‘):
        self 
.path = path

    def load_mnist(self):
        """Read train and test dataset and labels from path"""

        train_image_path = ‘train-images.idx3-ubyte‘
        train_label_path = ‘train-labels.idx1-ubyte‘

        test_image_path = ‘t10k-images.idx3-ubyte‘
        test_label_path = ‘t10k-labels.idx1-ubyte‘ 


        with open(os.path.join(self.path, train_label_path), ‘rb‘) as labelpath:
            magic, n = struct.unpack(‘>II‘, labelpath.read(8))
            labels = np.fromfile(labelpath, dtype=np.uint8)
            train_labels = labels.reshape(len(labels), 1)

        with open(os.path.join(self.path, train_image_path), ‘rb‘) as imgpath:
            magic, num, rows, cols = struct.unpack(‘>IIII‘, imgpath.read(16))
            images = np.fromfile(imgpath,
                                 dtype=np.uint8).reshape(len(train_labels), 784)
            train_images = images

        with open(os.path.join(self.path, test_label_path), ‘rb‘) as labelpath:
            magic, n = struct.unpack(‘>II‘, labelpath.read(8))
            labels = np.fromfile(labelpath,
                                 dtype=np.uint8)
            test_labels = labels.reshape(len(labels), 1)

        with open(os.path.join(self.path, test_image_path), ‘rb‘) as imgpath:
            magic, num, rows, cols = struct.unpack(‘>IIII‘, imgpath.read(16))
            images = np.fromfile(imgpath, dtype=np.uint8).reshape(len(test_labels), 784)
            test_images = images

        return train_images, train_labels, test_images, test_labels


if __name__ == ‘__main__‘:
    train_images, train_labels, test_images, test_labels = load().load_mnist()
    print(‘train_images shape:%s‘ % str(train_images.shape))
    print(‘train_labels shape:%s‘ % str(train_labels.shape))
    print(‘test_images shape:%s‘ % str(test_images.shape))
    print(‘test_labels shape:%s‘ % str(test_labels.shape))

    np.random.seed(1024)

    trainImage = np.random.randint(60000, size=4)
    testImage = np.random.randint(10000, size=2)

    img1 = train_images[trainImage[0]].reshape(28, 28)
    label1 = train_labels[trainImage[0]]
    img2 = train_images[trainImage[1]].reshape(28, 28)
    label2 = train_labels[trainImage[1]]
    img3 = train_images[trainImage[2]].reshape(28, 28)
    label3 = train_labels[trainImage[2]]
    img4 = train_images[trainImage[3]].reshape(28, 28)
    label4 = train_labels[trainImage[3]]

    img5 = test_images[testImage[0]].reshape(28, 28)
    label5 = test_labels[testImage[0]]
    img6 = test_images[testImage[1]].reshape(28, 28)
    label6 = test_labels[testImage[1]]


    plt.figure(num=‘mnist‘, figsize=(2, 3))

    plt.subplot(2, 3, 1)
    plt.title(label1)
    plt.imshow(img1)

    plt.subplot(2, 3, 2)
    plt.title(label2)
    plt.imshow(img2)

    plt.subplot(2, 3, 3)
    plt.title(label3)
    plt.imshow(img3)

    plt.subplot(2, 3, 4)
    plt.title(label4)
    plt.imshow(img4)

    plt.subplot(2, 3, 5)
    plt.title(label5)
    plt.imshow(img5)

    plt.subplot(2, 3, 6)
    plt.title(label6)
    plt.imshow(img6)
    plt.show()

運行得到輸出：

技術分享圖片

3-LinearRegression

采用線性回歸的方式對MNIST數據集訓練識別。
采用2層網絡，hidden layer具有四個神經元，激活函數分別使用Tanh和ReLu。

由於MNIST是一個多分類問題，故輸出層采用Softmax作為激活函數，並使用cross entropy作為Loss Function。

3.1 使用Numpy實現

3.1.1 通過Tran data、label獲取 layer size

Code：

def layer_size(X, Y):
    """
    Get number of input and output size, and set hidden layer size
    :param X: input dataset‘s shape(m, 784)
    :param Y: input labels‘s shape(m,1)
    :return:
    n_x -- the size of the input layer
    n_h -- the size of the hidden layer
    n_y -- the size of the output layer
    """

    n_x = X.T.shape[0]
    n_h = 4
    n_y = Y.T.shape[0]

    return n_x, n_h, n_y

3.1.2 初始化參數

初始化W1、b1、W2、b2*

W初始化為非0數字

b均初始化為0

Code：

def initialize_parameters(n_x, n_h, n_y):
    """
    Initialize parameters
    :param n_x: the size of the input layer
    :param n_h: the size of the hidden layer
    :param n_y: the size of the output layer
    :return: dictionary of parameters
    """

    W1 = np.random.randn(n_h, n_x) * 0.01
    b1 = np.zeros((n_h, 1))
    W2 = np.random.randn(n_y, n_h) * 0.01
    b2 = np.zeros((n_y, 1))

    parameters = {"W1": W1,
                  "b1": b1,
                  "W2": W2,
                  "b2": b2
                  }

    return parameters

3.1.3 Forward Propagation

ReLu采用\((|Z|+Z)/2\)的方式實現

def ReLu(Z):
    return (abs(Z) + Z) / 2

def forward_propagation(X, parameters, activation="tanh"):
    """
    Compute the forword propagation
    :param X: input data (m, n_x)
    :param parameters: parameters from initialize_parameters
    :param activation: activation function name, has "tanh" and "relu"
    :return:
        cache: caches of forword result
        A2: sigmoid output
    """

    X = X.T

    W1 = parameters["W1"]
    b1 = parameters["b1"]
    W2 = parameters["W2"]
    b2 = parameters["b2"]

    Z1 = np.dot(W1, X) + b1
    if activation == "tanh":
        A1 = np.tanh(Z1)
    elif activation == "relu":
        A1 = ReLu(Z1)
    else:
        raise Exception(‘Activation function is not found!‘)
    Z2 = np.dot(W2, A1) + b2
    A2 = 1 / (1 + np.exp(-Z2))

    cache = {"Z1": Z1,
             "A1": A1,
             "Z2": Z2,
             "A2": A2}

    return A2, cache

3.1.4 Compute Cost

MNIST手寫數字圖片識別（線性回歸、CNN方法的手工及框架實現）（未完待續）

shape 初始化 result rect not found pro res edi ise 0-Background 作為Deep Learning中的Hello World 項目無論如何都要做一遍的。代碼地址：Github 練習過程中將持續更新blog及代碼。第一

吳裕雄 python神經網絡手寫數字圖片識別（5）

end false new ppm sqrt 格式 ica utils lin import kerasimport matplotlib.pyplot as pltfrom keras.models import Sequentialfrom keras.layers i

使用LeNet-5實現mnist手寫數字分類識別 TensorFlow

TensorFlow的學習材料很多，但很少有講得特別詳細，讓小白一看就懂的。我自己總結了cnn實現mnist分類識別的方法，希望能給TensorFlow初學者一些幫助，實測在python3下可以執行。 # -*- coding: utf-8 -*- # 使用LeNet-5實

深度學習系列——AlxeNet實現MNIST手寫數字體識別

本文實現AlexNet,用於識別MNIST手寫數字體。所有程式碼的框架基於tensorflow。看了幾篇論文的原始碼之後，覺得tensorflow 確實很難，學習程式設計還是靠實踐。這篇部落格留著給自己以及學習深度學習道路上的小夥伴們一些參考吧，希望能對大家有所幫助！

初識GAN之MNIST手寫數字的識別

初識GAN，因為剛好在嘗試用純python實現手寫數字的識別，所以在這裡也嘗試了一下。筆者也是根據網上教程一步步來的，不多說了，程式碼如下： from tensorflow.examples.tutorials.mnist import input_data i

使用LSTM實現mnist手寫數字分類識別 TensorFlow

RNN做影象識別原理：MNIST資料集中一張圖片資料包含28*28的畫素點。RNN是將一張圖片資料的一行作為一個向量總體輸入一個X中。也就是說，RNN有28個輸入X，一個輸入X有28個畫素點。輸出最後一個結果做為預測值。 TensorFlow入門學習程式碼： # -

DL之NN：(sklearn自帶資料集為1797個樣本*64個特徵)利用NN之sklearn、NeuralNetwor.py實現手寫數字圖片識別95%準確率

先檢視sklearn自帶digits手寫資料集(1797*64)import numpy as np from sklearn.datasets import load_digits from skl

TF之RNN：(TF自帶函式下載MNIST55000訓練集圖片)基於順序的RNN分類案例手寫數字圖片識別實現高精度99%準確率

import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data mnist = input_data.read_data_sets('MNIST_data', one_hot

DL之NN：NN演算法(本地資料集50000張訓練集圖片)進階優化之三種引數改進，進一步提高手寫數字圖片識別的準確率

首先，改變之一：先在初始化權重的部分，採取一種更為好的隨機初始化方法，我們依舊保持正態分佈的均值不變，只對標準差進行改動，初始化權重改變前， def large_weight_initializer(self): self.biases = [np.ran

MNIST手寫數字的識別——CNN篇

這裡貼一個用nolearn，lasagne訓練CNN的例子，資料集嘛，當然是MNIST咯，keras暫時還沒研究過，但nolearn訓練CNN真的炒雞炒雞方便啊這裡簡單說下CNN的結構，首先是輸入層，是一個1*28*28的影象矩陣，用32個5*5*1的濾波器去慮，得到3

運用tensorflow全連線神經網路進行MNIST手寫數字影象識別

本文記錄tensorflow搭建簡單神經網路，並進行模組化處理，目的在於總結並提取簡單神經網路搭建的基本思想和方法，提煉核心結構和元素，從而能夠移植到日後深入學習中去。 1 模組提煉 1.1 template_forward.py

手寫數字圖片識別-卷積神經網路

匯入依賴 from tensorflow import keras from matplotlib import pyplot as plt from tensorflow.keras.layers import Conv2D, MaxPool2D, Flatten, Dense 下載資料集

Tensorflow之MNIST手寫數字識別：分類問題（1）

一、MNIST資料集讀取 one hot 獨熱編碼獨熱編碼是一種稀疏向量，其中：一個向量設為1，其他元素均設為0.獨熱編碼常用於表示擁有有限個可能值的字串或識別符號優點： 1、將離散特徵的取值擴充套件到了歐式空間，離散特徵的某個取值就對應歐式空間的某個點　　　　2、機器學習演算法中，

Tensorflow之MNIST手寫數字識別：分類問題（2）

整體程式碼： #資料讀取 import tensorflow as tf import matplotlib.pyplot as plt import numpy as np from tensorflow.examples.tutorials.mnist import input_data mnis

用pytorch實現多層感知機（MLP)（全連線神經網路FC）分類MNIST手寫數字體的識別

1.匯入必備的包 1 import torch 2 import numpy as np 3 from torchvision.datasets import mnist 4 from torch import nn 5 from torch.autograd import Variable 6

TensorFlow筆記（1）非線性迴歸、MNIST手寫數字識別

程式 import tensorflow as tf import numpy as np import matplotlib.pyplot as plt # numpy生成200個隨機點，下面這麼寫可以得到200行1列的矩陣 x_data = np.linspace(-0.5,

Tensorflow案例4：Mnist手寫數字識別(線性神經網路)及其侷限性

學習目標目標應用matmul實現全連線層的計算說明準確率的計算應用softmax_cross_entropy_with_logits實現softamx以及交叉熵損失計算說明全連線層在神經網路的作用應用全連

MNIST手寫數字識別（二）幾種模型優化方式介紹

本篇的主要內容有：動態衰減法設定可變學習率為損失函式新增正則項滑動平均模型介紹為了讓MNIST數字識別模型更準確，學習幾種常用的模型優化手段：學習率的優化學習率的設定一定程度上也會影響模型的訓練，如果學習率過小，那麼將會經過很長時間才會收斂到想要

MNIST手寫數字識別（三）應用優化

本篇的主要內容應用三種優化方式，對之前的模型進行優化介紹一些在程式中用到的函式學習於《TensorFlow實戰Google深度學習框架》一書程式相比於第一次的簡單邏輯迴歸模型，這一次的調整了網路結構，添加了一個500個節點的隱藏層，在結構中，設定了

從零開始學caffe（四）：mnist手寫數字識別網路結構模型和超引數檔案的原始碼閱讀

下面為網路結構模型 %網路結構模型 name: "LeNet" #網路的名字"LeNet" layer { #定義一個層 name: "mnist" #層的名字"mnist" type:

MNIST手寫數字圖片識別（線性回歸、CNN方法的手工及框架實現）（未完待續）

0-Background

0.1 背景知識：

0.2 Catalog

1-Prepare

2-MNIST

3-LinearRegression

3.1 使用Numpy實現

3.1.1 通過Tran data、label獲取 layer size

3.1.2 初始化參數

3.1.3 Forward Propagation

3.1.4 Compute Cost

相關推薦