吳恩達機器學習 - 評估假設吳恩達機器學習 - 評估假設

阿新 • • 發佈：2018-11-05

原

吳恩達機器學習 - 評估假設

2018年06月22日 20:47:29 離殤灬孤狼閱讀數：105

																														</div>
			<div class="operating">
													</div>
		</div>
	</div>
</div>
<article>
	<div id="article_content" class="article_content clearfix csdn-tracking-statistics" data-pid="blog" data-mod="popu_307" data-dsm="post" style="height: 2211px; overflow: hidden;">
							<div class="article-copyright">
				版權宣告：如果感覺寫的不錯，轉載標明出處連結哦~blog.csdn.net/wyg1997					https://blog.csdn.net/wyg1997/article/details/80778511				</div>
							            <div class="markdown_views">
						<!-- flowchart 箭頭圖示 勿刪 -->
						<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><path stroke-linecap="round" d="M5,0 0,2.5 5,5z" id="raphael-marker-block" style="-webkit-tap-highlight-color: rgba(0, 0, 0, 0);"></path></svg>
						<p>題目連結：<a href="https://s3.amazonaws.com/spark-public/ml/exercises/on-demand/machine-learning-ex5.zip" rel="nofollow" target="_blank">點選開啟連結</a></p>

正則線性迴歸：

視覺化資料：

Code:

load ('ex5data1.mat');
plot(X, y, 'rx', 'MarkerSize', 10, 'LineWidth', 1.5);
  
   1
   2

結果為：

這裡寫圖片描述

代價函式：

公式（並正則化）：

這裡寫圖片描述

Code(填充在linearRegCostFunction.m中):

t = X*theta-y;
J = t'*t/(2.0*m) + lambda/(2.0*m)*theta(2:end)'*theta(2:end);
  
   1
   2

求正則線性迴歸梯度

公式：

這裡寫圖片描述

Code：

grad(1) = (X(:,1)'*t)/m;
grad(2:end) = (X(:,2:end)'*t./m) + (lambda/m).*theta(2:end);
  
   1
   2

訓練樣本

效果如下：

這裡寫圖片描述

偏差和方差

樣本數與訓練誤差、交叉驗證誤差的關係

Code（learningCurve.m）:

function [error_train, error_val] = ...
    learningCurve(X, y, Xval, yval, lambda)
%LEARNINGCURVE Generates the train and cross validation set errors needed  

%to plot a learning curve
%   [error_train, error_val] = ...
%       LEARNINGCURVE(X, y, Xval, yval, lambda) returns the train and
%       cross validation set errors for a learning curve. In particular, 
%       it returns two vectors of the same length - error_train and 
%       error_val. Then, error_train(i) contains the training error for
%       i examples (and similarly for error_val(i)).
%
%   In this function, you will compute the train and test errors for
%   dataset sizes from 1 up to m. In practice, when working with larger
%   datasets, you might want to do this in larger intervals.
%

% Number of training examples
m = size(X, 1);

% You need to return these values correctly
error_train = zeros(m, 1);
error_val   = zeros(m, 1);

% ====================== YOUR CODE HERE ======================
% Instructions: Fill in this function to return training errors in 
%               error_train and the cross validation errors in error_val. 
%               i.e., error_train(i) and 
%               error_val(i) should give you the errors
%               obtained after training on i examples.
%
% Note: You should evaluate the training error on the first i training
%       examples (i.e., X(1:i, :) and y(1:i)).
%
%       For the cross-validation error, you should instead evaluate on
%       the _entire_ cross validation set (Xval and yval).
%
% Note: If you are using your cost function (linearRegCostFunction)
%       to compute the training and cross validation error, you should 
%       call the function with the lambda argument set to 0. 
%       Do note that you will still need to use lambda when running
%       the training to obtain the theta parameters.
%
% Hint: You can loop over the examples with the following:
%
%       for i = 1:m
%           % Compute train/cross validation errors using training examples 
%           % X(1:i, :) and y(1:i), storing the result in 
%           % error_train(i) and error_val(i)
%           ....
%           
%       end
%

% ---------------------- Sample Solution ----------------------

for i = 1:m
    theta = trainLinearReg(X(1:i,:),y(1:i),lambda);
    [error_train(i), ~] = linearRegCostFunction(X(1:i,:),y(1:i),theta,0);

    [error_val(i), ~] = linearRegCostFunction(Xval,yval,theta,0);
end


% -------------------------------------------------------------

% =========================================================================

end
  
   1
   2
   3
   4
   5
   6
   7
   8
   9
   10
   11
   12
   13
   14
   15
   16
   17
   18
   19
   20
   21
   22
   23
   24
   25
   26
   27
   28
   29
   30
   31
   32
   33
   34
   35
   36
   37
   38
   39
   40
   41
   42
   43
   44
   45
   46
   47
   48
   49
   50
   51
   52
   53
   54
   55
   56
   57
   58
   59
   60
   61
   62
   63
   64
   65
   66
   67
   68

圖示（高偏差）：

這裡寫圖片描述

多項式迴歸

特徵擴張（將一次線性的特徵擴張到p次）

Code（polyFeatures.m）：

function [X_poly] = polyFeatures(X, p)
%POLYFEATURES Maps X (1D vector) into the p-th power
%   [X_poly] = POLYFEATURES(X, p) takes a data matrix X (size m x 1) and
%   maps each example into its polynomial features where
%   X_poly(i, :) = [X(i) X(i).^2 X(i).^3 ...  X(i).^p];
%


% You need to return the following variables correctly.
X_poly = zeros(numel(X), p);

% ====================== YOUR CODE HERE ======================
% Instructions: Given a vector X, return a matrix X_poly where the p-th 
%               column of X contains the values of X to the p-th power.
%
% 

for i = 1:p
    X_poly(:,i) = X.^i;
end

% =========================================================================

end
  
   1
   2
   3
   4
   5
   6
   7
   8
   9
   10
   11
   12
   13
   14
   15
   16
   17
   18
   19
   20
   21
   22
   23
   24

效果檢視

這裡說明一下接下來執行ex5的效果。因為我們把特徵擴張到了8次方（本例中），所以最後一個特徵的數值特別特別大，所以我們這裡需要特徵歸一化。

程式是這麼呼叫的（實際上我們可以自己寫這個部分，也不是很麻煩）（featureNormalize.m）：

function [X_norm, mu, sigma] = featureNormalize(X)
%FEATURENORMALIZE Normalizes the features in X 
%   FEATURENORMALIZE(X) returns a normalized version of X where
%   the mean value of each feature is 0 and the standard deviation
%   is 1. This is often a good preprocessing step to do when
%   working with learning algorithms.

mu = mean(X);
X_norm = bsxfun(@minus, X, mu);

sigma = std(X_norm);
X_norm = bsxfun(@rdivide, X_norm, sigma);


% ============================================================

end
  
   1
   2
   3
   4
   5
   6
   7
   8
   9
   10
   11
   12
   13
   14
   15
   16
   17

然後就是利用我們之前寫好的函式計算代價啦：

效果圖如下：

這裡寫圖片描述
從圖上看來是高方差了（過擬合）

然後我們看一下不同的λ對結果有什麼影響吧（這一步不計分，只是幫助我們來理解）~

Code（直接在控制檯執行）（只需對第一行進行修改）：

lambda = 1;
[theta] = trainLinearReg(X_poly, y, lambda);

% Plot training data and fit
figure(1);
plot(X, y, 'rx', 'MarkerSize', 10, 'LineWidth', 1.5);
plotFit(min(X), max(X), mu, sigma, theta, p);
xlabel('Change in water level (x)');
ylabel('Water flowing out of the dam (y)');
title (sprintf('Polynomial Regression Fit (lambda = %f)', lambda));

figure(2);
[error_train, error_val] = ...
    learningCurve(X_poly, y, X_poly_val, yval, lambda);
plot(1:m, error_train, 1:m, error_val);

title(sprintf('Polynomial Regression Learning Curve (lambda = %f)', lambda));
xlabel('Number of training examples')
ylabel('Error')
axis([0 13 0 100])
legend('Train', 'Cross Validation')
  
   1
   2
   3
   4
   5
   6
   7
   8
   9
   10
   11
   12
   13
   14
   15
   16
   17
   18
   19
   20
   21

λ=1時（擬合的不錯）

這裡寫圖片描述

λ=32時（欠擬合啦）

這裡寫圖片描述

那麼我們改小一點：λ=0.1時（懲罰的力度不夠，還是有點過擬合）

這裡寫圖片描述

使用交叉驗證集選擇合適的λ（畫出λ-Error曲線）

效果圖：

這裡寫圖片描述
發現λ大概在3的位置上比較好

我們來看看λ=3的曲線：

這裡寫圖片描述

吳恩達機器學習 - 評估假設吳恩達機器學習 - 評估假設

原吳恩達機器學習 - 評估假設 2018年06月22日 20:47:29 離殤灬孤狼閱讀數：105

[吳恩達機器學習筆記]15.1-3非監督學習異常檢測算法/高斯回回歸模型

閾值訓練集 jpg -a 情況 color 訓練 ase 需要 15.異常檢測 Anomaly detection 覺得有用的話,歡迎一起討論相互學習~Follow Me 15.1問題動機 Problem motivation 飛機引擎異常檢測假想你是一個飛機引擎制造

【吳恩達機器學習隨筆】什麽是機器學習？

都是 mea 預測 learn 會有 code 度量 its 價格定義　　　Tom Mitchell對機器學習定義為“計算機從經驗E中學習，解決某一任務T，進行某一度量P,通過P測定在T上的表現因經驗E而提高”。定義個人覺得大體理解即可，如果扣文咬字去理解會十分痛苦，就

吳恩達機器學習 - 推薦系統吳恩達機器學習 - 推薦系統

原吳恩達機器學習 - 推薦系統 2018年06月25日 22:26:51 離殤灬孤狼閱讀數：187

吳恩達機器學習 - 異常檢測吳恩達機器學習 - 異常檢測

原吳恩達機器學習 - 異常檢測 2018年06月25日 21:09:33 離殤灬孤狼閱讀數：69

吳恩達機器學習 - 邏輯迴歸吳恩達機器學習 - 邏輯迴歸

原吳恩達機器學習 - 邏輯迴歸 2018年06月19日 12:49:09 離殤灬孤狼閱讀數：96 更多

吳恩達機器學習 - 正規函式吳恩達機器學習 - 正規函式

原吳恩達機器學習 - 正規函式 2018年06月18日 19:40:21 離殤灬孤狼閱讀數：65 更多

吳恩達機器學習 - 神經網路吳恩達機器學習 - 神經網路

原吳恩達機器學習 - 神經網路 2018年06月19日 21:27:17 離殤灬孤狼閱讀數：97

機器學習筆記（參考吳恩達機器學習視訊筆記）09_應用機器學習的建議

9 應用機器學習的建議機器學習診斷法：是一種測試方法，通過執行這種測試，可以深入瞭解某種演算法到底是否有用。 9.1 評估假設函式通過評估假設函式來，來避免過擬合和欠擬合問題。模型通過訓練集得出引數後，對測試集運用該模型，有兩種方式計算誤差：對於線性迴歸模型，利用測

深度學習，周志華，機器學習，西瓜書，TensorFlow，Google，吳軍，數學之美，李航，統計學習方法，吳恩達，深度學習筆記，pdf下載

1. 機器學習入門經典，李航《統計學習方法》 2. 周志華的《機器學習》pdf 3.《數學之美》吳軍博士著pdf 4. Tensorflow 實戰Google深度學習框架.pdf 5.《TensorFlow實戰》黃文堅高清完整PDF 6. 復旦大

[吳恩達機器學習筆記]15.1-3非監督學習異常檢測演算法/高斯回回歸模型

15.異常檢測 Anomaly detection 覺得有用的話,歡迎一起討論相互學習~Follow Me 15.1問題動機 Problem motivation 飛機引擎異常檢測

機器學習愛好者 -- 翻譯吳恩達老師的機器學習課程字幕 http://www.ai-start.com/

http and -m nbsp 翻譯 href 機器學習 target 字幕機器學習愛好者 -- 翻譯吳恩達老師的機器學習課程字幕 http://www.ai-start.com/ https://zhuanlan.zhihu.com/fengdu78 ht

吳恩達老師深度學習視訊課筆記：構建機器學習專案(機器學習策略)(1)

機器學習策略(machine learning strategy)：分析機器學習問題的方法。正交化(orthogonalization)：要讓一個監督機器學習系統很好的工作，一般要確保四件事情，如下圖： (1)、首先，你通常必須確保至少系

吳恩達機器學習筆記（十）-應用機器學習的建議

第十一章應用機器學習的建議決定下一步做什麼當要設計機器學習系統時，如何選擇一條最適合最高效的道路？假設你已經實現了正則化的線性迴歸來預測房屋價格，然而，當在一組新的測試集上使用該假設時

Coursea吳恩達《結構化機器學習專案》課程筆記(1)機器學習策略上篇

轉載自http://blog.csdn.net/column/details/17767.html 結構化機器學習專案 — 機器學習策略（1） 1. 正交化表示在機器學習模型建立的整個流程中，我們需要根據不同部分反映的問題，去做相應的調整，從而更加

吳恩達機器學習筆記 —— 1 緒論：初識機器學習

機器學習目前已經應用在很多領域，比如網頁搜尋、垃圾郵件過濾、點選率預測、生物資訊、無人駕駛、無人機、手寫體識別、自然語言處理、計算機視覺。什麼是機器學習 1 機器學習一些比較難以變成的能力——Arthur Samuel 2 通過給定任務T以及效能度量P以及經驗E，計算機程式從經驗E中學習，用學習的結果

【機器學習吳恩達】CS229課程筆記notes4翻譯-Part VI學習理論

CS229課程筆記吳恩達 Part VI 學習理論 1 偏差/方差權衡當我們談論線性迴歸，我們討論它是否擬合一個簡單的模型，比如線性模型“y=θ0+θ1x”，或者更復雜的模型，比如多項式模型“y=θ0+θ1x+θ2x2+θ3x3+θ

吳恩達Coursera深度學習課程 deeplearning.ai (3-1) 機器學習(ML)策略（1）--課程筆記

1.1 為什麼是 ML 策略實踐中優化深度學習模型的方法有好多種，應該如何抉擇? 1.2 正交化正交化：一個維度做且只做一件事，各個維度相互獨立，不影響其他維度做的事情。比如電視條件：有調節高度的按鈕，寬度的按鈕，旋轉的按鈕，色彩

深度學習視訊，吳恩達，CS231n，斯坦福，計算機視覺，牛津大學，xDeepMind ，自然語言處理，莫煩，Tensorflow

1. 吳恩達最新深度學習視訊網易雲課堂 http://mooc.study.163.com/smartSpec/detail/1001319001.htm 《深度學習筆記v5.32》 pdf下載連結:https://pan.baidu.com/s/1m8c7OdCJJZ2

如何免費學習coursera上吳恩達的Neural Networks and Deep Learning課程

首先，在註冊時不要選擇免費試用，而是要選擇旁聽。進入旁聽之後，中間的部分課程是無法做的，這時候，需要用anaconda的jupyter notebook功能來進行作業。具體方法如下：安裝後開啟進入到一個資料夾目錄，找到這個目錄在你的資料夾的具體位置。並將作業檔案複

吳恩達機器學習 - 評估假設 吳恩達機器學習 - 評估假設

吳恩達機器學習 - 評估假設

正則線性迴歸：

視覺化資料：

Code:

結果為：

代價函式：

公式（並正則化）：

Code(填充在linearRegCostFunction.m中):

求正則線性迴歸梯度

公式：

Code：

訓練樣本

效果如下：

偏差和方差

樣本數與訓練誤差、交叉驗證誤差的關係

Code（learningCurve.m）:

圖示（高偏差）：

多項式迴歸

特徵擴張（將一次線性的特徵擴張到p次）

Code（polyFeatures.m）：

效果檢視

程式是這麼呼叫的（實際上我們可以自己寫這個部分，也不是很麻煩）（featureNormalize.m）：

然後就是利用我們之前寫好的函式計算代價啦：

效果圖如下：

然後我們看一下不同的λ對結果有什麼影響吧（這一步不計分，只是幫助我們來理解）~

Code（直接在控制檯執行）（只需對第一行進行修改）：

λ=1時（擬合的不錯）

λ=32時（欠擬合啦）

那麼我們改小一點：λ=0.1時（懲罰的力度不夠，還是有點過擬合）

使用交叉驗證集選擇合適的λ（畫出λ-Error曲線）

效果圖：

我們來看看λ=3的曲線：

相關推薦

吳恩達機器學習 - 評估假設吳恩達機器學習 - 評估假設