MNIST手寫數字的識別——CNN篇

阿新 • • 發佈：2019-02-16

這裡貼一個用nolearn，lasagne訓練CNN的例子，資料集嘛，當然是MNIST咯，keras暫時還沒研究過，但nolearn訓練CNN真的炒雞炒雞方便啊

這裡簡單說下CNN的結構，首先是輸入層，是一個1*28*28的影象矩陣，用32個5*5*1的濾波器去慮，得到32*24*24的hidden layer，然後對這個東西進行（2,2）的maxpool，結果是32*12*12的hidden layer，然後在用17個32*5*5的濾波器去過濾，得到17*8*8的hidden layer，然後在進行（2,2）的maxpool，得到17*4*4的hidden layer。先把這個hidde layer投射到56個神經元的hidden layer（這裡就像普通的神經網路了，所以叫dense layer），最後是輸出層，輸出有10個，用softmax進行判定。這裡和Coursera上的AndrewNg老師的作業不太一樣，因為作業中是用10個2分的logistic regression classifier進行分類的，所以對於單個training example，其造成的cost 是10部分的相加，但對於softmax，單個training example的cost直接就是一個東西。

# coding=utf-8
# 按別人的改的
# 版權未知，盜版不究
# typhoonbxq
# the University of Hong Kong

from urllib import urlretrieve
import cPickle as pickle
import os
import gzip
import numpy as np
# import theano
import lasagne
import csv

from lasagne import layers
from lasagne.updates import nesterov_momentum
from nolearn.lasagne import NeuralNet

def load_dataset():
    url = 'http://deeplearning.net/data/mnist/mnist.pkl.gz'
    filename = 'mnist.pkl.gz'
    if not os.path.exists(filename):
        print("Downloading MNIST dataset...")
        urlretrieve(url, filename)
    with gzip.open(filename, 'rb') as f:
        data = pickle.load(f)
    X_train, y_train = data[0]
    X_val, y_val = data[1]
    X_test, y_test = data[2]
    X_train = X_train.reshape((-1, 1, 28, 28))
    X_val = X_val.reshape((-1, 1, 28, 28))
    X_test = X_test.reshape((-1, 1, 28, 28))
    y_train = y_train.astype(np.uint8)
    y_val = y_val.astype(np.uint8)
    y_test = y_test.astype(np.uint8)
    return X_train, y_train, X_val, y_val, X_test, y_test
X_train, y_train, X_val, y_val, X_test, y_test = load_dataset()

# Set the parameters for the CNN
net1 = NeuralNet(
    layers=[('input', layers.InputLayer),
            ('conv2d1', layers.Conv2DLayer),
            ('maxpool1', layers.MaxPool2DLayer),
            ('conv2d2', layers.Conv2DLayer),
            ('maxpool2', layers.MaxPool2DLayer),
           # ('dropout1', layers.DropoutLayer),
            ('dense', layers.DenseLayer),
            #('dropout2', layers.DropoutLayer),
            ('output', layers.DenseLayer),
            ],
    # input layer
    input_shape=(None, 1, 28, 28),
    # layer conv2d1
    conv2d1_num_filters=32,
    conv2d1_filter_size=(5, 5),
    conv2d1_nonlinearity=lasagne.nonlinearities.rectify,
    conv2d1_W=lasagne.init.GlorotUniform(),
    # layer maxpool1
    maxpool1_pool_size=(2, 2),
    # layer conv2d2
    conv2d2_num_filters=17,
    conv2d2_filter_size=(5, 5),
    conv2d2_nonlinearity=lasagne.nonlinearities.rectify,
    # layer maxpool2
    maxpool2_pool_size=(2, 2),
    # dropout1
    #dropout1_p=0.5,
    # dense
    dense_num_units=56,
    dense_nonlinearity=lasagne.nonlinearities.rectify,
    # dropout2
    #dropout2_p=0.5,
    # output
    output_nonlinearity=lasagne.nonlinearities.softmax,
    output_num_units=10,
    # optimization method params
    update=nesterov_momentum,
    update_learning_rate=0.01,
    update_momentum=0.9,

# Below is a very important parameter, increasing max_epochs will increase the prediction accuracy
# I suggest this is the maximum of the turn for which we update the parameters
# I remember when training a CNN, we limit the traing time

    max_epochs=5,
    verbose=1,
    )
# Train the network
nn = net1.fit(X_train, y_train)

preds = net1.predict(X_test)

l = len(preds)
count = 0

f0 = open('F:\\result.csv','wb')
f1 = csv.writer(f0)


Y = y_test.tolist()
for i in range(0,l):
    f1.writerow([Y[i],preds[i]])
    if(preds[i] == y_test[i]):
        count = count + 1
acc = count * 100.0 / l
print "The accuracy is %.2f%%"%(acc)
f0.close()

然後貼個執行的結果，

可以看出來，正確率是非常高的，這說明……（進入實驗報告模式）

最後我想貼一張紙證明這個30307是怎麼計算的，也算是一個基本功吧。

MNIST手寫數字的識別——CNN篇

MNIST手寫數字識別——CNN

Tensorflow案例5：CNN演算法-Mnist手寫數字識別

CNN實現MNIST手寫數字識別

基於tensorflow的MNIST手寫數字識別（二）--入門篇

Mnist手寫數字識別之CNN實現

Pytorch 快速入門（六）MNIST 手寫數字識別之CNN

Android+TensorFlow+CNN+MNIST 手寫數字識別實現

基於tensorflow的MNIST手寫數字識別（三）--神經網路篇

Tensorflow實踐 mnist手寫數字識別

tensorflow 基礎學習五：MNIST手寫數字識別

Caffe的運行mnist手寫數字識別

Tensorflow之MNIST手寫數字識別：分類問題（1）

Tensorflow之MNIST手寫數字識別：分類問題（2）

基於Keras mnist手寫數字識別---Keras卷積神經網路入門教程

【AI實戰】訓練第一個AI模型：MNIST手寫數字識別模型

TensorFlow筆記（1）非線性迴歸、MNIST手寫數字識別

神經網路實現Mnist手寫數字識別筆記

MNIST手寫數字識別---My way of AI 19

tensorflow實戰：MNIST手寫數字識別的優化2-代價函式優化，準確率98%

Tensorflow案例4：Mnist手寫數字識別(線性神經網路)及其侷限性

MNIST手寫數字的識別——CNN篇

相關推薦