多層前饋神經元網路

阿新 • • 發佈：2019-02-09

做了一個神經元網路分類器。開始步長設定為迭代次數的倒數，效果不好；後來調整到 0.2 效果比較好。測試一個拋物線邊界的例子，準確率大約 96% 以上。

public final class NeuroNetwork {
	
	private static class Neurode {
		double err;
		double output;
		double theta;
	}
	
	private static enum Status {
		NEW,
		TRAINED;
	}
	
	// status of this class, either NEW or TRAINED
	private Status status;
	
	// depth of network, layer 0 is input layer
	private int depth;
	
	// neurodes in each layer
	private Neurode[][] neurodes;
	
	// weights[i] is a two dimensional array, representing weights between layer i and layer 1+1
	private double[][][] weights;
	
	// initialize the neuronetwork
	/**
	 * Initialize the neuronetwork
	 * 
	 * @param depth			: the number of layers
	 * @param numNeurodes	: the number of neurodes in each layer
	 */
	public NeuroNetwork(int depth, int[] numNeurodes) {
		
		this.depth = depth;
		
		// create and initialize neurodes
		neurodes = new Neurode[depth][];
		for ( int d=0; d<depth; d++ ) {
			neurodes[d] = new Neurode[numNeurodes[d]];
			for ( int i=0; i<numNeurodes[d]; i++) {
				neurodes[d][i] = new Neurode();
				neurodes[d][i].theta = Math.random();
			}
		}
		
		// initialize weights
		weights = new double[depth][][];
		for ( int d=0; d<depth-1; d++ ) {
			weights[d] = new double[numNeurodes[d]][numNeurodes[d+1]];
			for ( int i=0; i<numNeurodes[d]; i++) {
				for ( int j=0; j<numNeurodes[d+1]; j++ ) {
					weights[d][i][j] = Math.random();
				}
			}
		}
		
		status = Status.NEW;
		
	}
	
	/**
	 * Calculate output given a input
	 * 
	 * @param data		: an vector representing input
	 */
	private void calculateOutput(double[] data) {
		// initial output of layer 0
		for (int i=0; i<neurodes[0].length; i++ ) {
			neurodes[0][i].output = data[i];
		}
		
		// calculate output for each output layer
		for ( int d=1; d<depth; d++ ) {
			for ( int j=0; j<neurodes[d].length; j++) {
				double input = 0.0;
				for ( int i=0; i<neurodes[d-1].length; i++ ) {
					input += neurodes[d-1][i].output*weights[d-1][i][j];
				}
				input += neurodes[d][j].theta;
				neurodes[d][j].output = 1.0/(1.0+Math.exp(0.0-input));
			}
		}
	}
	
	/**
	 * Classify and predict
	 * 
	 * @param data		: an vector represent one entry of taining sample
	 * @param target	: an vector represent class label of the training sample
	 */
	public int predict(double[] data, double[] output) {
		
		if ( data.length != neurodes[0].length || output.length != neurodes[depth-1].length ) {
			throw  new IllegalArgumentException();
		}
		
		calculateOutput(data);
		
		double x = neurodes[depth-1][0].output;
		int label = 0;
		for ( int i=0; i<neurodes[depth-1].length; i++ ) {
			output[i] = neurodes[depth-1][i].output;
			if ( x < output[i] ) {
				x = output[i];
				label = i;
			}
		}
		
		return label;
	}
	
	/**
	 * Train the neuronetwork
	 * 
	 * @param data		: input matrix of train data, with data[i] represents the ith sample
	 * @param target	: input matrix of train label, with target[i] represents the ith label
	 * @param maxIteration : maximum times of interation
	 * @param threshold : threshold of weights update
	 * @param errorRate : threshold for error rate
	 * @return
	 */
	public boolean train(double[][] data, double target[][], int maxIteration, double threshold, double errorRate) {
		
		// check status
		if ( status == Status.TRAINED ){
			throw new IllegalStateException();
		}
		
		// check input arguments and input parameters
		if ( data.length <=0 || data[0].length != neurodes[0].length ||
				target.length == 0 || target[0].length != neurodes[depth-1].length ) {
			throw new IllegalArgumentException();
		}
		
		int round = 1;
		boolean convergence = false;
		
		while ( round <= maxIteration && ! convergence ) {
			
			double rate = 0.2;//1.0/round;	// learn rate
			double delta = 0.0;
			for ( int r=0; r<data.length; r++) {
				double res = trainWithOneSample(data[r], target[r], rate);
				delta = (delta<res)?res:delta;
			}
			
			convergence = (delta<threshold);
			round++;
			
			System.out.printf(" %d round of train, delta is %f %n", round-1, delta);
		
		}
		
		return true;

	}

	/**
	 * Train the neuronetwork with one entry of sample data
	 * 
	 * @param data		: an vector represent one entry of taining sample
	 * @param target	: an vector represent class label of the training sample
	 * @param rate		: learn rate
	 * @return			: maximum detla of weights
	 */
	private double trainWithOneSample(double[] data, double[] target, double rate) {
		
		calculateOutput(data);
		
		// calculate error for layer n-1
		for ( int j=0; j<neurodes[depth-1].length; j++ ) {
			double output = neurodes[depth-1][j].output;
			neurodes[depth-1][j].err = output*(1-output)*(target[j]-output);
		}
		
		// calculate error for hidden layers n-2 ... 1
		for ( int d=depth-2; d>0; d-- ) {
			for ( int j=0; j<neurodes[d].length; j++ ) {
				double error = 0.0;
				for ( int k=0; k<neurodes[d+1].length; k++ ) {
					error += neurodes[d+1][k].err*weights[d][j][k];
				}
				double output = neurodes[d][j].output;
				neurodes[d][j].err = output*(1-output)*error;
			}
		}
		
		double maxDelta = 0.0;
		
		// update weights
		for ( int d=0; d<depth-1; d++ ) {
			for ( int i=0; i<neurodes[d].length; i++ ) {
				for ( int j=0; j<neurodes[d+1].length; j++ ) {
					double delta = neurodes[d][i].output*neurodes[d+1][j].err;
					weights[d][i][j] += rate*delta;
					if ( maxDelta < Math.abs(delta) ) {
						maxDelta = Math.abs(delta);
					}
				}
			}
		}
		
		// update theta
		for ( int d=1; d<depth; d++ ) {
			for ( int j=0; j<neurodes[d].length; j++ ) {
				neurodes[d][j].theta += rate*neurodes[d][j].err;
			}
		}
		
		return maxDelta;
	}

}

測試：

public class TestMain {
	
	public static double[][][] generateData(int m) {
		
		double[][][] res = new double[2][][];
		
		double[][] data = new double[m*m][2];
		double[][] label = new double[m*m][3];
		
		for ( int i=0; i<m; i++ ) {
			double x = i/(m-1.0);
			for ( int j=0; j<m; j++ ) {
				double y = j/(m-1.0);
				
				data[i*m+j][0] = x;
				data[i*m+j][1] = y;
				
				label[i*m+j][0] = label[i*m+j][1] = label[i*m+j][2] = 0; 
				if ( y > 4.0*(x-0.5)*(x-0.5) ) {
					label[i*m+j][0] = 1;
				} else if ( x < 0.5 ) {
					label[i*m+j][1] = 1;
				} else {
					label[i*m+j][2] = 1;
				}
				
			}
		}
		
		res[0] = data;
		res[1] = label;
		return res;
	}
	
	public static int calculateLabel(double x, double y) {
		if ( y > 4.0*(x-0.5)*(x-0.5) ) {
			return 0;
		} else if ( x < 0.5 ) {
			return 1;
		} else {
			return 2;
		}
	}

	/**
	 * @param args
	 */
	public static void main(String[] args) {
		
		int[] num = { 2, 3, 3 };
		
		int m = 10, n = 3;
		
		NeuroNetwork inst = new NeuroNetwork(num.length, num);
		double[][][] trainData = generateData(m);
		inst.train(trainData[0], trainData[1], 1000000, 0.001, 0.8);

		int t=50, success = 0;
		double[][][] testData = generateData(t);
		for ( int i=0; i<t*t; i++ ) {
			int res = inst.predict(testData[0][i], testData[1][i]);
			int ans = calculateLabel(testData[0][i][0], testData[0][i][1]);
			if ( res == ans ) {
				success ++;
			}
			System.out.printf("<%f, %f> : %d %b%n",testData[0][i][0],testData[0][i][1],res,res==ans);
		}
		System.out.printf("Accuracy rate is %f%n", (success+0.0)/(t*t));
		
	}

}

多層前饋神經元網路

做了一個神經元網路分類器。開始步長設定為迭代次數的倒數，效果不好；後來調整到 0.2 效果比較好。測試一個拋物線邊界的例子，準確率大約 96% 以上。 public final class NeuroNetwork { private static class Neu

多層前饋神經網路及BP演算法

1.多層前饋神經網路首先說下多層前饋神經網路，BP演算法，BP神經網路之間的關係。多層前饋（multilayer feed-forward）神經網路由一個輸入層、一個或多個隱藏層和一個輸出層組成，後向傳播(BP)演算法在多層前饋神經網路上面進行學習，採用BP

多層前饋神經網路的後向傳播演算法推導

神經網路模型定義本文研究的多層前饋神經網路（Multi-layer Feedforward Neural Networks），包含多層神經元，每層神經元與下一層神經元完全連線，神經元之間不存在同層連線，也不存在跨層連線. 設神經網路的層數為 N, 其中包括

python實現多層前饋神經網路（手寫體識別）

前饋神經網路的圖例及推導過程見https://blog.csdn.net/u010089444/article/details/52555567，接下來我們用python語言實現多層前饋神經網路。本例使用的是MINST資料集，由輸入層,兩個隱藏層,輸出層. MNIST資料集中

深度學習-多層前饋神經網路

因為公式都是在mathtype上寫的，然後插入到ppt的，而且裡面有很多圖，再用MD寫一遍簡直太耗時間，所以我就直接當作圖片插入發部落格了，還請讀者諒解。本來ppt是我們每週開組會用的，相互講解交流，自己做這些也廢了不少時間心血，覺得還可以，所以就和大家分享出來

機器學習：神經網路-多層前饋神經網路淺析（附程式碼實現）

M-P神經元模型神經網路中最基本的組成成分：神經元模型。如下圖是一個典型的“M-P神經元模型”：上圖中，神經元接收到n個其他神經元傳遞過來的輸入訊號，這些訊號通過權重的連線進行傳遞，神經元接收到的總輸入值與神經元的閾值進行比較，並通過“啟用函式”處理產生神經元輸出。常用S函式

單隱層前饋神經網路

這篇部落格主要介紹神經網路基礎，單隱層前饋神經網路與反向傳播演算法。神經網路故名思議是由人的神經系統啟發而得來的一種模型。神經網路可以用來做分類和迴歸等任務，其具有很好的非線性擬合能力。接下來我們就來詳細介紹一下但隱層前饋神經網路。首先我們來看一下神經元的數學模型，如下圖所示：

深度學習基礎--不同網路種類--前饋深度網路(feed-forwarddeep networks, FFDN)

深度神經網路可以分為3類： 1）前饋深度網路(feed-forwarddeep networks, FFDN) 2）反饋深度網路(feed-back deep networks, FBDN) 3）雙向深度網路(bi-directionaldeep networks, BDDN

mxnet-多層前向網絡

mlp print port ted lin dom 網絡層 lock 1.4 #!/usr/bin/env python2 # -*- coding: utf-8 -*- """ Created on Fri Aug 10 16:13:29 2018 @author:

深度學習基礎--不同網路種類--前饋深度網路

深度神經網路可以分為3類： 1）前饋深度網路(feed-forwarddeep networks, FFDN) 2）反饋深度網路(feed-back deep networks, FBDN) 3）雙向深度網路(bi-directionaldeep n

CNTK API文件翻譯(3)——前饋神經網路

這個教程的目的是為了讓你熟悉使用CNTK元件來進行分類任務，如果你看了本系列的邏輯迴歸部分或者熟悉機器學習，可以跳過介紹部分。介紹（見上期，本期略）前饋神經網路模型本次使用的資料集和上期邏輯迴歸教程使用的資料集一樣，不過這期的模型會結合多個邏

深度學習：前饋神經網路

神經元模型，由兩部分組成。一部分將訊號進行累加，得到得分函式，還有偏置項（bias），相當於函式的截距項或者常數項。 Z=bias+∑i=1mxiwi=∑i=0mxiwiZ=bias+∑i=1mxi

TensorFlow官方教程學習筆記（五）——前饋神經網路

本文主要是在TensorFlow上搭建一個前饋神經網路（feed-forward neural network）來對TensorFlow的運作方式進行簡單介紹。程式碼在\examples\tutorials\mnist\中，主要使用兩個檔案：mnist.py和fully

構建多層感知器神經網路對數字圖片進行文字識別

在Keras環境下構建多層感知器模型，對數字影象進行精確識別。模型不消耗大量計算資源，使用了cpu版本的keras，以Tensorflow 作為backended，在ipython互動環境jupyter notebook中進行編寫。 1.資料來源此資料庫包含四部分：訓練資

前饋神經網路（matlab例項）

定義回顧前饋網路也稱前向網路。這種網路只在訓練過程會有反饋訊號，而在分類過程中資料只能向前傳送，直到到達輸出層，層間沒有向後的反饋訊號，因此被稱為前饋網路。感知機( perceptron)與BP神經網路就屬於前饋網路。對於一個3層的前饋神經網路N，若用X

keras中使用MLP(多層感知機)神經網路來實現MNIST手寫體識別

Keras是一個基於python的的深度學習框架，比tensorflow更簡單易用，適合入門學習，本篇文章主要介紹使用keras實現手寫體識別任務。環境為python3+，Keras2.1，神經網路基礎知識在此不做過多介紹。 1. 載入MNIST資料。方式

前饋神經網路的權值初始化方法

前饋神經網路（Feedforward Neural Networks, FNNs）在眾多學習問題，例如特徵選擇、函式逼近、以及多標籤學習中有著不錯的應用。針對訓練前饋網路的學習演算法，目前已經有不少研究者提出了新穎的研究結果，但是其它相關問題的研究卻不多，例

Pybrain學習筆記-4 基於前饋神經網路的分類器

話不多說，直接上程式碼： 5.test_pybrian_5 #!usr/bin/env python #_*_coding:utf-8_*_ ''' Created on 2017年4月14日 Topic：Classification with Feed-Forward Neural Networks @a

前饋神經網路，BP演算法

單層感知機輸入層——>輸出層啟用函式在輸出層，常見啟用函式Logistic、softmax函式多層感知機（MLP）輸入層——>隱藏層——>輸出層啟用函式在隱藏層，線性變換預啟用+非線性啟用函式，資料分佈“線性可分”隱藏層所含神經元越多，可提取特徵越多常用啟用

多層前饋神經元網路

相關推薦