1.15 Generative Learning Algorithm：GDA and Mixture of Gaussians/GMM

阿新 • • 發佈：2018-12-29

1. 生成學習演算法(Generative Learning Algorithm)與判別學習演算法(Discriminative Learning Algorithm):

之前涉及到的迴歸類模型就是DLA的一類，是直接對 $\large p(y|x)$ 建模的。更詳細一點的說，沒有使用正則化的模型是對 $\large p(y|X;\theta)$

建模， $\large \theta$ 是一個引數。而使用了正則化的模型則是對 $\large p(y|X,\theta)=p(y|X,\theta)=p(y|(X,\theta))p(\theta)$ 建模。

而GLA則是對 $\large p(x|y)$ 建模，以分類問題為例，生成學習通過對不同類的特徵建模，得到 $\large p(x)=\sum_{i=1}^{k}p(x|y_i)p(y_i)$ ，

利用貝葉斯公式： $\large p(y|x)=\frac{p(x|y)p(y)}{p(x)}$ 進行分類。

現在我學到的生成學習分類演算法有兩類：GDA（高斯判別分析）和Naive Bayes（樸素貝葉斯）。

當類別標記不知道時，即未觀測，分類問題就是一個聚類問題。將未被觀測到的類標記當成隱隨機變數（latent random variable），通過EM演算法，就可以求解聚類問題。這篇部落格記錄了分類問題下GDA以及用於分類高斯混合模型。

2.GDA：二分類：

（1）

模型假設： $\large y \sim B(\phi) \\ x|y=0 \sim N(\mu_0,\Sigma )\\ x|y=1\sim N(\mu_1,\Sigma)$ ，對應的分佈為： $\large p(y)=\phi^y(1-\phi)^{1-y}\\p(x|y=0)=\frac{1}{(2\pi)^{\frac{n}{2}}|\Sigma^{\frac{1}{2}}}exp(-\frac{1}{2}(x-\mu_0)^T\Sigma^{-1}(x-\mu_0))\\ p(x|y=1)=\frac{1}{(2\pi)^{\frac{n}{2}}|\Sigma^{\frac{1}{2}}}exp(-\frac{1}{2}(x-\mu_1)^T\Sigma^{-1}(x-\mu_1))$

因為 $\large \phi$ 也是未知量，所以直接對聯合分佈 $\large p(x,y)$ 進行極大似然估計，得到對數似然函式為： $\large l(\phi,\mu_0,\mu_1,\Sigma)=log\prod_{i=1}^{m}p(x^i,y^i;\phi,\mu_0,\mu_1,\Sigma)=$ $\large log\prod_{i=1}^{m}p(x^i|y^i;\mu_0,\mu_1,\Sigma)p(y^i;\phi)$

估計, $\large \Sigma$ 時，和其他自變數無關，所以有些項是不用關心的。通過求導，

估計出來的結果是：

$\large \phi = \frac{1}{m}\sum_{i=1}^{m}1\begin{Bmatrix} y_i=1 \end{Bmatrix}\ ,\mu_0 = \frac{\sum_{i=1}^{m}1\begin{Bmatrix} y^i=0 \end{Bmatrix}x^i}{\sum_{i=1}^{m}1\begin{Bmatrix} y^i=0 \end{Bmatrix}}\\\mu_1= \frac{\sum_{i=1}^{m}1\begin{Bmatrix} y^i=1 \end{Bmatrix}x^i}{\sum_{i=1}^{m}1\begin{Bmatrix} y^i=1 \end{Bmatrix}} ,\Sigma = \frac{1}{m}\sum_{i=1}^{m}(x^i-\mu_{y^i})(x^i-\mu_{y^i})^T$

（2）與Logistic Regression的對比：

logistic regression有著更弱的假設，而高斯判別分析需要假設更強一些。

（3）多分類高斯判別分析：

令z表示類標記，有： $\large l(\theta,\mu,\Sigma)=\sum_{i=1}^{m}log(p(x^i|z^i;\mu,\Sigma)+log(p(z^i;\phi))$

最終結果如下： $\large \phi_j = \frac{1}{m}\sum_{i=1}^{m}1\begin{Bmatrix} z^i=j \end{Bmatrix}\ ,\mu_j = \frac{\sum_{i=1}^{m}1\begin{Bmatrix} z^i=j\end{Bmatrix}x^i}{\sum_{i=1}^{m}1\begin{Bmatrix} z^i=j \end{Bmatrix}}\\ \Sigma_j = \frac{\sum_{i=1}^{m}1\begin{Bmatrix} z^i=j \end{Bmatrix}(x^i-\mu_j)(x^i-\mu_j)^T}{\sum_{i=1}^{m}1\begin{Bmatrix} z^i=j \end{Bmatrix}}$

PS:這裡有一個問題就是：為什麼二分類時用的是一個 $\large \Sigma$ ,但是多分類時，有多少個類就要算多少個 $\large \Sigma$ ？

3.Mixture of Gaussians:

對於混合高斯用於聚類，需要知道EM演算法的相關知識，見1.16

優化問題同1中（3）所記錄的： $\large l(\theta,\mu,\Sigma)=\sum_{i=1}^{m}log(p(x^i|z^i;\mu,\Sigma)+log(p(z^i;\phi))$

但是因為z是一個隱變數，所以直接計算比較困難，所以使用EM迭代計算。演算法推導還沒完全搞懂，這裡不記錄。

演算法流程如下：

Repeat until convergence{

E-step: For each i,j,set $\large w_j^i=Q_i(z^i)=p(z^i=j|x^i;\phi,\mu,\Sigma)=\frac{p(x^i|z^i=j;\mu,\Sigma)p(z^i=j;\phi)}{\sum_{l=1}^{k}p(x^i|z^i=l;\mu,\Sigma)p(z^i=l;\phi)}$

M-step: Set $\large \phi_j = \frac{1}{m}\sum_{i=1}^{m}w_j^i ,\\\mu_j = \frac{\sum_{i=1}^{m}w_j^ix^i}{\sum_{i=1}^{m} w_j^i} \\\Sigma_j = \frac{\sum_{i=1}^{m}w^i_j(x^i-\mu_j)(x^i-\mu_j)^T}{\sum_{i=1}^{m}w_j^i}$

}

1.15 Generative Learning Algorithm：GDA and Mixture of Gaussians/GMM

1. 生成學習演算法(Generative Learning Algorithm)與判別學習演算法(Discriminative Learning Algorithm): 之前涉及到的迴歸類模型就是DLA的一類，是直接對建模的。更詳細一點的說，沒有使用正則化的模型是對

MIT's $1 billion college will teach the theory and ethics of AI

Today, the Massachusetts Institute of Technology announced that the university is launching the Stephen A. Schwarzman College of Computing, which is specif

GoLand 2018.2.1 is released with tangible performance improvements and lots of bug

Welcome the freshly built GoLand 2018.2.1! You can install this update via Toolbox App, as a patch for GoLand 2018.2 (use Help | Check for Upd

論文翻譯：Development and Evaluation of Emerging Design Patterns for Ubiquitous Computing

Development and Evaluation of Emerging Design Patterns for Ubiquitous Computing Eric S. Chung1, Jason I. Hong1, James Lin1, Madhu K. Pra

聚類演算法（K-means + Fuzzy C-means + Hierarchical + Mixture of Gaussians）---第一部分：簡介

前言什麼是聚類？聚類可以被認為是最重要的無監督學習問題; 所以，像這樣的其他問題一樣，它涉及在未標記資料的集合中找到一個結構。聚類的簡單定義可能是“將物件組織成某些成員相似的組的過程”。因此，"cluster"是它們之間“相似”的物件的集合，並且與屬於其他"cluste

理解一下generative learning and discriminative learning algorithm

mode pro 理解 finall predict rim fin new clas Given a training set, an algorithm like logistic regression or the perceptron algorithm (basi

15.1-全棧Java筆記：Java事件模型是什麽？事件控制的過程有哪幾步？？

java應用前邊兩節上一章節的內容，大家可以完成一個簡單的界面，但是沒有任何的功能，界面完全是靜態的，如果要實現具體功能的話，必須要學習事件模型。事件模型簡介及常見事件模型對於采用了圖形用戶界面的程序來說，事件控制是非常重要的。一個源（事件源）產生一個事件並把它（事件對象）送到一個或多個監聽器那裏，監聽器只是

分類和邏輯回歸(Classification and logistic regression)，廣義線性模型(Generalized Linear Models) ，生成學習算法(Generative Learning algorithms)

line learning nbsp ear 回歸 logs http zdb del 分類和邏輯回歸(Classification and logistic regression) http://www.cnblogs.com/czdbest/p/5768467.html

1.15 Generative Learning Algorithm：GDA and Mixture of Gaussians/GMM

1.15 Generative Learning Algorithm：GDA and Mixture of Gaussians/GMM

MIT's $1 billion college will teach the theory and ethics of AI

GoLand 2018.2.1 is released with tangible performance improvements and lots of bug

論文翻譯：Development and Evaluation of Emerging Design Patterns for Ubiquitous Computing

聚類演算法（K-means + Fuzzy C-means + Hierarchical + Mixture of Gaussians）---第一部分：簡介

理解一下generative learning and discriminative learning algorithm

15.1-全棧Java筆記：Java事件模型是什麽？事件控制的過程有哪幾步？？

分類和邏輯回歸(Classification and logistic regression)，廣義線性模型(Generalized Linear Models) ，生成學習算法(Generative Learning algorithms)

課程一(Neural Networks and Deep Learning)總結：Logistic Regression

課程一(Neural Networks and Deep Learning)，第二週（Basics of Neural Network programming）—— 1、10個測驗題（Neural N

論文解讀：DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data

比NGINX更快：nginx-1.15.5 vs mongols-1.2.3

Recognition System複習筆記：Adaboost Learning Algorithm

1.15 JavaScript6：定時器

【GAN ZOO翻譯系列】Cat GAN：UNSUPERVISED AND SEMI-SUPERVISED LEARNING WITH CATEGORICAL GAN 用於監督和半監督學習的GAN

Potter：I am not for learning and learning, but for study and work

閒談：感知器學習演算法(The perceptron learning algorithm)

Deep Learning 10_深度學習UFLDL教程：Convolution and Pooling_exercise（斯坦福大學深度學習教程）

Deep Learning 5_深度學習UFLDL教程：PCA and Whitening_Exercise（斯坦福大學深度學習教程）

Digression：The perceptron learning algorithm（感知機學習演算法）

1.15 Generative Learning Algorithm：GDA and Mixture of Gaussians/GMM

相關推薦