python實現《機器學習》西瓜書習題5.7RBF網路解決異或問題

阿新 • • 發佈：2018-12-30

致敬：https://blog.csdn.net/Snoopy_Yuan/article/details/71024046
RBF首先是確定神經元中心點ci，對於異或，訓練集和驗證集構造0 1取值的二維資料，四個0-1點即為中心點。
然後根據103頁公式計算w和β，代入徑向基函式作為輸出
訓練10次後錯誤率已經是0，畢竟RBF是能以任意精度逼近任意連續函式的，異或麻麻地啦

主函式XOR_RBF.py

import numpy as np

# train set
X_trn = np.random.randint(0, 2, (100, 2))   #生成int隨機數 >=0且<2
y_trn = np.logical_xor(X_trn[:, 0], X_trn[:, 1])
# test set
X_tst = np.random.randint(0, 2, (100, 2))
y_tst = np.logical_xor(X_tst[:, 0], X_tst[:, 1])

'''
implementation of RBF network
'''
from RBF_BP import *

# 神經元中心ci,因為隨機數在0和1取值，隨機取樣的中心點應該是如下四個
centers = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])

# construct the network
rbf_nn = RBP_network()  # initial a BP network class
rbf_nn.CreateNN(4, centers, learningrate=0.05)  # build the network structure

# parameter training
e = []
for i in range(10):
    err, err_k = rbf_nn.TrainRBF(X_trn, y_trn)
    e.append(err)


'''
model testing 
'''
y_pred = rbf_nn.Batch_Pred(X_tst);
count = 0
for i in range(len(y_pred)):
    if y_pred[i] >= 0.5:
        y_pred[i] = True
    else:
        y_pred[i] = False
    if y_pred[i] == y_tst[i]: count += 1

tst_err = 1 - count / len(y_tst)
print("test error rate: %.3f" % tst_err)

負責訓練的RBF_BP.py

class RBP_network:
    '''
    the definition of BP network class
    '''

    def __init__(self):

        '''
        initial variables
        '''
        # node number each layer
        # input neuron number equals to input variables' number
        self.h_n = 0
        # output layer contains only one neuron

        # output value for each layer
        self.b = []  # hidden layer
        self.y = 0.0  # output

        # parameters (w, b, c)
        self.w = []  # weight of the link between hidden neuron and output neuron
        self.beta = []  # scale index of Gaussian-RBF
        self.c = []  # center of Gaussian-RBF]

        # initial the learning rate
        self.lr = 0.05

    def CreateNN(self, nh, centers, learningrate):
        '''
        build a RBF network structure and initial parameters
        @param  nh : the neuron number of in layer
        @param centers: matrix [h_n * i_n] the center parameters object to hidden layer neurons
        @param learningrate: learning rate of gradient algorithm
        '''
        # dependent packages
        import numpy as np

        # assignment of hidden neuron number
        self.h_n = nh

        # initial value of output for each layer
        self.b = np.zeros(self.h_n)
        # self.y = 0.0

        # initial centers
        self.c = centers

        # initial weights for each link (random initialization)
        self.w = np.zeros(self.h_n)
        self.beta = np.zeros(self.h_n)
        for h in range(self.h_n):
            self.w[h] = rand(0, 1)
            self.beta[h] = rand(0, 1)

        # initial learning rate
        self.lr = learningrate

    def Pred(self, x):
        '''
        predict process through the network
        @param x: array, input array for input layer
        @param y: float, output of the network
        '''

        self.y = 0.0
        # activate hidden layer and calculating output
        for h in range(self.h_n):
            self.b[h] = RBF(x, self.beta[h], self.c[h])
            self.y += self.w[h] * self.b[h]

        return self.y

    def Batch_Pred(self, X):
        '''
        predict process through the network for batch data

        @param x: array, data set for input layer
        @param y: array, output of the networks
        '''

        y_pred = []
        # activate hidden layer and calculating output
        for i in range(len(X)):
            y_pred.append(self.Pred(X[i]))

        return y_pred

    def BackPropagateRBF(self, x, y):
        '''
        the implementation of special BP algorithm on one slide of sample for RBF network
        @param x, y: array and float, input and output of the data sample
        '''

        # dependent packages
        import numpy as np

        # get current network output
        self.Pred(x)

        # calculate the gradient for hidden layer
        g = np.zeros(self.h_n)
        for h in range(self.h_n):
            g[h] = (self.y - y) * self.b[h]

            # updating the parameter
        for h in range(self.h_n):
            self.beta[h] += self.lr * g[h] * self.w[h] * np.linalg.norm(x - self.c[h], 2)
            self.w[h] -= self.lr * g[h]

    def TrainRBF(self, data_in, data_out):
        '''
        BP training for RBF network
        @param data_in, data_out:
        @return e: accumulated error
        @return e_k: error array based on each step
        '''
        e_k = []
        for k in range(len(data_in)):
            x = data_in[k]
            y = data_out[k]
            self.BackPropagateRBF(x, y)

            # error in train set for each step
            y_delta2 = (self.y - y) ** 2
            e_k.append(y_delta2 / 2)

        # total error of training
        e = sum(e_k) / len(e_k)

        return e, e_k


def RBF(x, beta, c):
    '''
    the definition of radial basis function (RBF)
    @param x: array, input variable
    @param beta: float, scale index
    @param c: array. center
    '''

    # dependent packages
    from numpy.linalg import norm

    return norm(x - c, 2)  #計算範數，2範數也就是歐氏距離


def rand(a, b):
    '''
    the definition of random function
    @param a,b: the upper and lower limitation of the random value
    '''

    # dependent packages
    from random import random

    return (b - a) * random() + a

python實現《機器學習》西瓜書習題5.7RBF網路解決異或問題

致敬：https://blog.csdn.net/Snoopy_Yuan/article/details/71024046 RBF首先是確定神經元中心點ci，對於異或，訓練集和驗證集構造0 1取值的二維資料，四個0-1點即為中心點。然後根據103頁公式計算w和

python實現《機器學習》西瓜書習題5.6自適應學習率的BP改進演算法

致敬環節：https://blog.csdn.net/Snoopy_Yuan/article/details/70846554 因為太難了，我選擇直接抄，基本無改動。。。但有點意思的是，自適應學習率的最後，計算出的錯誤率是0.013，和固定學習速率一樣，猜測原

樸素貝葉斯算法的python實現 -- 機器學習實戰

cut ocl add set 分類器觀察 problem enc 兩個 1 import numpy as np 2 import re 3 4 #詞表到向量的轉換函數 5 def loadDataSet(): 6 postingLi

Python實現機器學習之迴歸分析

前言機器學習常用來解決相關分析和迴歸分析的問題，有時候大家會混淆兩者之間的差異，這裡通過對比分析來說明兩者的區別和聯絡，最後會以呼叫sklearn包中LinearRegression方法進行簡單線性迴歸分析為例，說明如何使用python進行資料分析。一、相關分析和迴

《機器學習-西瓜書》-周志華-學習筆記系列（1）--序言、前言和主要符號表

寫在前面的話：自己於今天（2018年9月4日）看完了機器學習-西瓜書-周志華-清華大學出版社書籍，對於這本書的評價就是：好書，自己可以在每一個字裡行間感受到作者的用心，每當看到一個不懂的名詞的時候，作者都會用通俗的例子來講解，遇到公式的時候，也會進行推導，側邊欄的一些說明資訊往往能帶給自己

數學推導+純Python實現機器學習演算法：邏輯迴歸

自本系列第一講推出以來，得到了不少同學的反響和贊成，也有同學留言說最好能把數學推導部分寫的詳細點，筆者只能說盡力，因為打公式實在是太浪費時間了。。本節要和大家一起學習的是邏輯（logistic）迴歸模型，繼續按照手推公式+純 Python 的寫作套路。邏輯迴歸本質上跟邏輯這個詞不是很搭邊，叫這個名字完

《機器學習西瓜書》學習筆記——第七章_貝葉斯分類器_樸素貝葉斯分類器

樸素：特徵條件獨立；貝葉斯：基於貝葉斯定理。樸素貝葉斯是經典的機器學習演算法之一，也基於概率論的分類演算法，屬於監督學習的生成模型。樸素貝葉斯原理簡單，也很容易實現，多用於文字分類，比如垃圾郵件過濾。 1.演算法思想——基於概率的預測貝葉斯決策論是概率框架下

python實現機器學習分類演算法原始碼————上篇

python實現機器學習分類演算法原始碼文章

Python實現機器學習二（實現多元線性迴歸）

接著上一次的一元線性迴歸http://blog.csdn.net/lulei1217/article/details/49385531往下講，這篇文章要講解的多元線性迴歸。 1、什麼是多元線性迴歸模型？當y值的影響因素不唯一時,採用多元線性迴歸模型。

python實現機器學習中的各種距離計算及文字相似度演算法

在自然語言處理以及機器學習的分類或者聚類中會涉及到很多距離的使用，各種距離的概念以及適用範圍請自行百度或者參考各種距離 import numpy as np import math # 依賴包numpy、python-Levenshtein、scipy

機器學習西瓜書（周志華）學習筆記（1）-緒論

基本術語資料集（data set）：一組記錄的集合。例如：（色澤=青綠；根蒂=稍蜷；敲聲=沉悶）。樣本（sample）：資料集中的每條記錄，它是關於一個事件或物件的描述。又稱示例（instance）。例如：色澤=青綠。屬性（attribute）：反映事件或物件在某

深度學習入門筆記（二）————線性神經網路解決異或問題（程式碼）

首先梳理一下思路輸入為1，0。00異或為0，01異或為1，10異或為1，11異或為0.所以輸出為2類如下圖可知，需要兩條線劃分。 Madaline用間接地辦法解決。多個線性函式進行劃分，然後對各個神經元的輸出做邏輯運算。如圖，用兩條直線實現了異或的劃分。線

用python來實現機器學習（一）：線性迴歸（linear regression）

需要下載一個data：auto-mpg.data 第一步：顯示資料集圖 import pandas as pd import matplotlib.pyplot as plt columns = ["mpg","cylinders","displacement","horsepowe

用Python從頭實現機器學習演算法

Machine Learning from scratch：僅使用Python和少量的第三方庫（Numpy/Pandas/PyTorch）函式實現基礎的機器學習演算法。實現的模型會與sklearn進行比較。專案地址：https://github.com/anhquan0412/ba

Python3實現機器學習經典演算法（四）C4.5決策樹

一、C4.5決策樹概述　　C4.5決策樹是ID3決策樹的改進演算法，它解決了ID3決策樹無法處理連續型資料的問題以及ID3決策樹在使用資訊增益劃分資料集的時候傾向於選擇屬性分支更多的屬性的問題。它的大部分流程和ID3決策樹是相同的或者相似的，可以參考我的上一篇部落格：https://www.cnblogs.

python+jenkins+pytest實現機器學習專案介面自動化測試

背景介紹演算法機器學習專案程式碼為python實現。整體實現：通過 http介面接受引數，呼叫dubbo thrift 服務，再呼叫演算法檔案，演算法返回結果給dubbo thrfit，dubbo thrift 返回給http後臺（dubbo 和

西瓜書習題4.4 程式設計實現基尼指數決策樹

資料及程式碼地址：https://github.com/qdbszsj/decisionTreeGini這裡的程式碼在資訊熵決策樹的基礎上稍加修改就可以，之前是根據熵增的最大值來確定用哪個屬性劃分，現在是根據基尼指數（表現資料集D的純度）的最小值來建樹。這裡網上的很多人說建出

用Python開始機器學習（5：文字特徵抽取與向量化）

假設我們剛看完諾蘭的大片《星際穿越》，設想如何讓機器來自動分析各位觀眾對電影的評價到底是“贊”（positive）還是“踩”（negative）呢？這類問題就屬於情感分析問題。這類問題處理的第一步，就是將文字轉換為特徵。因此，這章我們只學習第一步，如何從文字中抽取特徵，並將其向量化。由於中文的處理涉及

基於python的機器學習實現日元幣對人民幣匯率預測

## 匯入所需的包 import pandas as pd import numpy as np import matplotlib.pyplot as plt import tensorflow as tf tf.reset_default_graph() plt.rcParam

用Python Scikit-learn 實現機器學習十大演算法--樸素貝葉斯演算法（文末有程式碼）

1，前言很久不發文章，主要是Copy別人的總感覺有些不爽，所以整理些乾貨，希望相互學習吧。不囉嗦，進入主題吧，本文主要時說的為樸素貝葉斯分類演算法。與邏輯迴歸，決策樹一樣，是較為廣泛使用的有監督分類演算法，簡單且易於理解（號稱十大資料探勘演算法中最簡單的演算法）。但

python實現《機器學習》西瓜書習題5.7RBF網路解決異或問題

相關推薦