邏輯迴歸softmax神經網路實現手寫數字識別(cs)

阿新 • • 發佈：2019-01-23

邏輯迴歸softmax神經網路實現手寫數字識別全過程

1 - 匯入模組

import numpy as np
import matplotlib.pyplot as plt
from ld_mnist import load_digits

%matplotlib inline

2 - 匯入資料及資料預處理

mnist = load_digits()

Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\train-images-idx3-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\train-labels-idx1-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\t10k-images-idx3-ubyte.gz
Extracting C:/Users/marsggbo/Documents/Code/ML/TF Tutorial/data/MNIST_data\t10k-labels-idx1-ubyte.gz

print("Train: "+ str(mnist.train.images.shape))
print("Train: "+ str(mnist.train.labels.shape))
print("Test: "+ str(mnist.test.images.shape))
print("Test: "+ str(mnist.test.labels.shape))

Train: (55000, 784)
Train: (55000, 10)
Test: (10000, 784)
Test: (10000, 10)

mnist資料採用的是TensorFlow的一個函式進行讀取的，由上面的結果可以知道訓練集資料X_train有55000個，每個X的資料長度是784（28*28）。

另外由於資料集的數量較多，所以TensorFlow提供了批量提取資料的方法，從而大大提高了執行速率，方法如下：

x_batch, y_batch = mnist.train.next_batch(100)
print(x_batch.shape)
print(y_batch.shape)

>>>
(100, 784)
(100, 10)

x_train, y_train, x_test, y_test = mnist.train.images, mnist.train.labels, mnist.test.images, mnist.test.labels

因為訓練集的資料太大，所以可以再劃分成訓練集，驗證集，測試集，比例為6:2:2

x_train_batch, y_train_batch = mnist.train.next_batch(30000)
x_cv_batch, y_cv_batch = mnist.train.next_batch(15000)
x_test_batch, y_test_batch = mnist.train.next_batch(10000)
print(x_train_batch.shape)
print(y_cv_batch.shape)
print(y_test_batch.shape)

(30000, 784)
(15000, 10)
(10000, 10)

展示手寫數字

nums = 6
for i in range(1,nums+1):
    plt.subplot(1,nums,i)
    plt.imshow(x_train[i].reshape(28,28), cmap="gray")

這裡寫圖片描述

3 - 演算法介紹

3.1 演算法

對單個樣本資料 x(i):

z(i)=wTx(i)+b(1)
y^(i)=a(i)=softmax(z(i))(2)
損失函式

$cost function$

訓練資料集總的損失函式表示式

$J(θ)$

需要注意的是公式(1)中的wTx(i)，這個需要視情況而定,因為需要根據資料維度的不同而進行改變。例如在本次專案中，x∈R55000×784,w∈R784×10,y∈R55000×10，所以z(i)=x(i)w+b

關鍵步驟
- 初始化模型引數
- 使用引數最小化cost function
- 使用學習得到的引數進行預測
- 分析結果和總結

3.2 初始化模型引數

# 初始化模型引數
def init_params(dim1, dim2):
    '''
    dim: 表示權重w的個數，一般來說w維度要與樣本x_train.shape[1]和y_train.shape[1]相匹配
    '''
    w = np.zeros((dim1,dim2))
    return w

w  = init_params(2,1)
print(w)

[[ 0.]
 [ 0.]]

3.3 定義softmax函式

def softmax(x):
    """
    Compute the softmax function for each row of the input x.

    Arguments:
    x -- A N dimensional vector or M x N dimensional numpy matrix.

    Return:
    x -- You are allowed to modify x in-place
    """
    orig_shape = x.shape

    if len(x.shape) > 1:
        # Matrix
        exp_minmax = lambda x: np.exp(x - np.max(x))
        denom = lambda x: 1.0 / np.sum(x)
        x = np.apply_along_axis(exp_minmax,1,x)
        denominator = np.apply_along_axis(denom,1,x) 

        if len(denominator.shape) == 1:
            denominator = denominator.reshape((denominator.shape[0],1))

        x = x * denominator
    else:
        # Vector
        x_max = np.max(x)
        x = x - x_max
        numerator = np.exp(x)
        denominator =  1.0 / np.sum(numerator)
        x = numerator.dot(denominator)

    assert x.shape == orig_shape
    return x

a = np.array([[1,2,3,4],[1,2,3,4]])
print(softmax(a))
np.sum(softmax(a))

[[ 0.0320586   0.08714432  0.23688282  0.64391426]
 [ 0.0320586   0.08714432  0.23688282  0.64391426]]





2.0

3.4 - 前向&反向傳播(Forward and Backward propagation)

引數初始化後，可以開始實現FP和BP演算法來讓引數自學習了。

Forward Propagation:
- 獲取資料X
- 計算 A=softmax(wTX+b)=(a(0),a(1),...,a(m−1),a(m))
- 計算 cost function:

def propagation(w, c, X, Y):
    '''
    前向傳播
    '''
    m = X.shape[0]
    A = softmax(np.dot(X,w))
    J  = -1/m * np.sum(Y*np.log(A)) + 0.5*c*np.sum(w*w)
    dw = -1/m * np.dot(X.T, (Y-A)) + c*w

    update = {"dw":dw, "cost": J}
    return update

def optimization(w, c, X, Y, learning_rate=0.1, iterations=1000, print_info=False):
    '''
    反向優化
    '''
    costs = []

    for i in range(iterations):
        update = propagation(w, c, X, Y)
        w -= learning_rate * update['dw']

        if i %100==0:
            costs.append(update['cost'])

        if i%100==0 and print_info==True:
            print("Iteration " + str(i+1) + " Cost = " + str(update['cost']))


    results = {'w':w, 'costs': costs}
    return results

def predict(w, X):
    '''
    預測
    '''
    return softmax(np.dot(X, w))

def accuracy(y_hat, Y):
    '''
    統計準確率
    '''
    max_index = np.argmax(y_hat, axis=1)
    y_hat[np.arange(y_hat.shape[0]), max_index] = 1
    accuracy = np.sum(np.argmax(y_hat, axis=1)==np.argmax(Y, axis=1))   
    accuracy = accuracy *1.0/Y.shape[0]
    return accuracy

def model(w, c, X, Y, learning_rate=0.1, iterations=1000, print_info=False):
    results = optimization(w, c, X, Y, learning_rate, iterations, print_info)

    w = results['w']
    costs = results['costs']
    y_hat = predict(w, X)

    accuracy = accuracy(y_hat, Y)
    print("After %d iterations,the total accuracy is %f"%(iterations, accuracy))
    results = {
        'w':w,
        'costs':costs,
        'accuracy':accuracy,
        'iterations':iterations,
        'learning_rate':learning_rate,
        'y_hat':y_hat,
        'c':c
    }
    return results

4 - 驗證模型

w = init_params(x_train_batch.shape[1], y_train_batch.shape[1])
c = 0
results_train = model(w, c, x_train_batch, y_train_batch, learning_rate=0.3, iterations=1000, print_info=True)
print(results_train)

Iteration 1 Cost = 2.30258509299
Iteration 101 Cost = 0.444039646187
Iteration 201 Cost = 0.383446527394
Iteration 301 Cost = 0.357022940232
Iteration 401 Cost = 0.341184601147
Iteration 501 Cost = 0.330260258921
Iteration 601 Cost = 0.322097106964
Iteration 701 Cost = 0.315671301537
Iteration 801 Cost = 0.310423971361
Iteration 901 Cost = 0.306020145234
After 1000 iterations,the total accuracy is 0.915800
{'w': array([[ 0.,  0.,  0., ...,  0.,  0.,  0.],
       [ 0.,  0.,  0., ...,  0.,  0.,  0.],
       [ 0.,  0.,  0., ...,  0.,  0.,  0.],
       ..., 
       [ 0.,  0.,  0., ...,  0.,  0.,  0.],
       [ 0.,  0.,  0., ...,  0.,  0.,  0.],
       [ 0.,  0.,  0., ...,  0.,  0.,  0.]]), 'costs': [2.302585092994045, 0.44403964618714781, 0.38344652739376933, 0.35702294023246306, 0.34118460114650634, 0.33026025892089478, 0.32209710696427363, 0.31567130153696982, 0.31042397136133199, 0.30602014523405535], 'accuracy': 0.91579999999999995, 'iterations': 1000, 'learning_rate': 0.3, 'y_hat': array([[  1.15531353e-03,   1.72628369e-09,   2.24683134e-03, ...,
          4.06392375e-08,   1.19337142e-04,   2.07493343e-06],
       [  1.41786837e-01,   1.11756123e-03,   2.79188805e-02, ...,
          6.80002693e-03,   1.00000000e+00,   1.25721652e-01],
       [  9.52758112e-05,   1.41141596e-06,   2.04835561e-03, ...,
          1.21014773e-04,   2.50044218e-02,   1.00000000e+00],
       ..., 
       [  1.79945865e-07,   6.74560778e-05,   1.53151951e-05, ...,
          2.44907396e-05,   1.71333912e-04,   1.08085629e-02],
       [  2.59724603e-05,   6.36785472e-10,   1.00000000e+00, ...,
          2.70273729e-08,   2.10287536e-06,   2.48876734e-08],
       [  1.00000000e+00,   9.96462215e-15,   5.55562364e-08, ...,
          2.01973615e-08,   1.57821049e-07,   3.37994451e-09]]), 'c': 0}

plt.plot(results_train['costs'])

[<matplotlib.lines.Line2D at 0x283b1d75ef0>]

這裡寫圖片描述

params = [[0, 0.3],[0,0.5],[5,0.3],[5,0.5]]
results_cv = {}
for i in range(len(params)):
    result = model(results_train['w'],0, x_cv_batch, y_cv_batch, learning_rate=0.5, iterations=1000, print_info=False)
    print("{0} iteration done!".format(i))
    results_cv[i] = result

After 1000 iterations,the total accuracy is 0.931333
0 iteration done!
After 1000 iterations,the total accuracy is 0.936867
1 iteration done!
After 1000 iterations,the total accuracy is 0.940200
2 iteration done!
After 1000 iterations,the total accuracy is 0.942200
3 iteration done!

for i in range(len(params)):
    print("{0} iteration accuracy: {1} ".format(i+1, results_cv[i]['accuracy']))
for i in range(len(params)):
    plt.subplot(len(params), 1,i+1)
    plt.plot(results_cv[i]['costs'])

1 iteration accuracy: 0.9313333333333333 
2 iteration accuracy: 0.9368666666666666 
3 iteration accuracy: 0.9402 
4 iteration accuracy: 0.9422

這裡寫圖片描述

驗證測試集準確率

y_hat_test = predict(w, x_test_batch)
accu = accuracy(y_hat_test, y_test_batch)
print(accu)

0.9111

5 - 測試真實手寫數字

讀取之前儲存的權重資料

# w = results_cv[3]['w']
# np.save('weights.npy',w)

w = np.load('weights.npy')
w.shape

(784, 10)

# 已經將圖片轉化成txt格式
files = ['3.txt','31.txt','5.txt','8.txt','9.txt','6.txt','91.txt']

# 將txt資料轉化成np.array
def pic2np(file):
    with open(file, 'r') as f:
        x = f.readlines()
        data = []

        for i in range(len(x)):
            x[i] = x[i].split('\n')[0]
            for j in range(len(x[0])):
                data.append(int(x[i][j]))
        data = np.array(data)
        return data.reshape(-1,784)

# 驗證準確性
i = 1
count = 0
for file in files:
    x = pic2np(file)
    y = np.argmax(predict(w, x))

    print("實際值{0}-預測值{1}".format( int(file.split('.')[0][0]) , y) )
    if y == int(file.split('.')[0][0]):
        count += 1
    plt.subplot(2, len(files), i)
    plt.imshow(x.reshape(28,28))
    i += 1
print("準確率為{0}".format(count/len(files)))

實際值3-預測值6
實際值3-預測值3
實際值5-預測值3
實際值8-預測值3
實際值9-預測值3
實際值6-預測值6
實際值9-預測值7
準確率為0.2857142857142857

我自己手寫的數字

由上面的結果可見我自己寫的數字還是蠻有個性的。。。。居然7個只認對了2個。看來演算法還是需要提高的

6 - Softmax 梯度下降演算法推導

Softmax求導過程

邏輯迴歸softmax神經網路實現手寫數字識別(cs)

邏輯迴歸softmax神經網路實現手寫數字識別全過程 1 - 匯入模組 import numpy as np import matplotlib.pyplot as plt from ld_mnist import load_digits

用python的numpy實現神經網路實現手寫數字識別

首先是讀取檔案，train-images-idx3-ubyte等四個檔案是mnist資料集裡的資料。放在MNIST資料夾裡。MNIST資料夾和這個.py檔案放在同一個資料夾裡。 import numpy as np import struct train_images

【深度學習】python實現簡單神經網路以及手寫數字識別案例

前言 \quad \qu

【深度學習】基於Numpy實現的神經網路進行手寫數字識別

直接先用前面設定的網路進行識別，即進行推理的過程，而先忽視學習的過程。推理的過程其實就是前向傳播的過程。深度學習也是分成兩步：學習 + 推理。學習就是訓練模型，更新引數；推理就是用學習到的引數來處理新的資料。 from keras.datasets.mnist impor

Deep Learning-TensorFlow (1) CNN卷積神經網路_MNIST手寫數字識別程式碼實現詳解

import tensorflow as tf import tensorflow.examples.tutorials.mnist.input_data as input_data import time # 計算開始時間 start = time.clock()

利用卷積神經網路進行手寫數字識別詳解

import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data ‘’‘可分別用這兩個函式建立卷積核(kernel)與偏置(bias)’’’ #返回一個給定形狀的變數，並自動以截斷正態分佈

機器學習（四）：BP神經網路_手寫數字識別_Python

機器學習演算法Python實現三、BP神經網路全部程式碼 1、神經網路model 先介紹個三層的神經網路，如下圖所示輸入層（input layer）有三個units（為

卷積神經網路之手寫數字識別應用MNISTCNN

一、TensorFlow環境安裝，及準備（ubuntu 環境）1、安裝python、pip#sudo apt-get install python-pip python-dev2、臨時更換pip源，使用國內源保證下載速度#sudo pip install -i https:/

深度學習2--tensorflow--Softmax迴歸實現手寫數字識別

使用Softmax迴歸來實現手寫數字識別，即給定一張手寫數字，判斷屬於0--9中哪一個數字。 1.LR邏輯迴歸先準備一下LR邏輯迴歸：廣義線性模型：實現x到y的非線性對映：在LR邏輯迴歸中取g函式：實現0--1對映輸出值為預測結果為1的概率

各種機器學習方法（線性迴歸、支援向量機、決策樹、樸素貝葉斯、KNN演算法、邏輯迴歸）實現手寫數字識別並用準確率、召回率、F1進行評估

本文轉自：http://blog.csdn.net/net_wolf_007/article/details/51794254 前面兩章對資料進行了簡單的特徵提取及線性迴歸分析。識別率已經達到了85%，完成了數字識別的第一步：資料探測。這一章要做的就各

【深度學習】3：BP神經網路與MNIST資料集實現手寫數字識別

前言：這是一篇基於tensorflow框架，建立的只有一層隱藏層的BP神經網路，做的圖片識別，內容也比較簡單，全當是自己的學習筆記了。 –—-—-—-—-—-—-—-—-—-—-—-—–—-—-—-—-—-—-—-——-—-—-—-—-—-—-—-—-—-—-

python-積卷神經網路全面理解-tensorflow實現手寫數字識別

　　　　首先，關於神經網路，其實是一個結合很多知識點的一個演算法，關於cnn（積卷神經網路）大家需要了解：　　　　　　　　　　下面給出我之前總結的這兩個知識點（基於吳恩達的機器學習）　　　　　　　　　　代價函式：　　　　　　　　　　代價函式　　　　　　　　　　代價函式（Cost Function ）是

python線上神經網路實現手寫字元識別系統

神經網路實現手寫字元識別系統一、課程介紹1. 課程來源課程內容在原文件基礎上做了稍許修改，增加了部分原理介紹，步驟的拆解分析及原始碼註釋。2. 內容簡介本課程最終將基於BP神經網路實現一個手寫字元識別系統，系統會在伺服器啟動時自動讀入訓練好的神經網路檔案，如果檔案不存在，則讀入

【python keras實戰】利用VGG卷積神經網路進行手寫字型識別

# encoding: utf-8 import sys reload(sys) sys.setdefaultencoding('utf-8') import numpy as np from keras.datasets import mnist impor

keras+卷積神經網路HWDB手寫漢字識別

寫在前面 HWDB手寫漢字資料集來自於中科院自動化研究所，下載地址：原始碼按照github上的提示操作：（1）解壓 unzip HWDB1.1trn_gnt.zip Archive: HWDB1.1trn_gnt.zip inflating:

利用BP神經網路實現手寫字元識別

利用python實現的BP神經網路，進行手寫字元識別，訓練與測試資料集為mnist，若實現UI介面需要安裝pyqt5。程式碼開源在個人的github：https://github.com/SpyderXu/BP_mnist_UI效果：

第二節，TensorFlow 使用前饋神經網絡實現手寫數字識別

com net config return pyplot dataset 運行算法但是一感知器感知器學習筆記：https://blog.csdn.net/liyuanbhu/article/details/51622695 感知器（Percep

第三節，TensorFlow 使用CNN實現手寫數字識別

啟用 out min 灰度 HA 打破 gre 大量 gray 上一節，我們已經講解了使用全連接網絡實現手寫數字識別，其正確率大概能達到98%，著一節我們使用卷積神經網絡來實現手寫數字識別，其準確率可以超過99%，程序主要包括以下幾塊內容 [1]: 導入數據，即測試集和

TensorFlow(九)：卷積神經網絡實現手寫數字識別以及可視化

writer orm true 交叉 lar write 執行 one 界面上代碼： import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data mnist =

TensorFlow(十二)：使用RNN實現手寫數字識別

rop mea pre rnn ext ini tro truncate tutorial 上代碼： import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data #

邏輯迴歸softmax神經網路實現手寫數字識別(cs)

1 - 匯入模組

2 - 匯入資料及資料預處理

3 - 演算法介紹

3.1 演算法

3.2 初始化模型引數

3.3 定義softmax函式

3.4 - 前向&反向傳播(Forward and Backward propagation)

4 - 驗證模型

5 - 測試真實手寫數字

6 - Softmax 梯度下降演算法推導

相關推薦