Dlib機器學習庫學習系列三----人臉對齊（特徵點檢測）

阿新 • • 發佈：2019-01-06

本篇部落格是Dlib庫學習的第三篇---人臉對齊。人臉對齊與人臉檢測工程建立與配置基本相同，在此不再贅述。可參照我上一篇部落格。閒話少說，來點乾貨。

步驟一：建立並配置工程，參照上一篇部落格。

步驟二：下載形狀模型檔案

步驟三：具體程式碼，這段程式碼也是dlib提供的例子，我自己新增的中文註釋！

// The contents of this file are in the public domain. See LICENSE_FOR_EXAMPLE_PROGRAMS.txt
/*

This example program shows how to find frontal human faces in an image and
estimate their pose.  The pose takes the form of 68 landmarks.  These are
points on the face such as the corners of the mouth, along the eyebrows, on
the eyes, and so forth.
****這個例子展示了怎樣在一張圖片中找到正臉和他們的姿勢.姿勢是由68個點的形式組成的.


//This face detector is made using the classic Histogram of Oriented
Gradients (HOG) feature combined with a linear classifier, an image pyramid,
and sliding window detection scheme.//
****人臉檢測器的原理

The pose estimator was created by
using dlib's implementation of the paper://根據這篇論文編寫的程式
One Millisecond Face Alignment with an Ensemble of Regression Trees by
Vahid Kazemi and Josephine Sullivan, CVPR 2014
and was trained on the iBUG 300-W face landmark dataset.

Also, note that you can train your own models using dlib's machine learning
tools.  See train_shape_predictor_ex.cpp to see an example.
****我們可以訓練自己的模型，用train_shape_predictor_ex.exe


Finally, note that the face detector is fastest when compiled with at least
SSE2 instructions enabled.  So if you are using a PC with an Intel or AMD
chip then you should enable at least SSE2 instructions.  If you are using
cmake to compile this program you can enable them by using one of the
following commands when you create the build project:
cmake path_to_dlib_root/examples -DUSE_SSE2_INSTRUCTIONS=ON
cmake path_to_dlib_root/examples -DUSE_SSE4_INSTRUCTIONS=ON
cmake path_to_dlib_root/examples -DUSE_AVX_INSTRUCTIONS=ON
This will set the appropriate compiler options for GCC, clang, Visual
Studio, or the Intel compiler.  If you are using another compiler then you
need to consult your compiler's manual to determine how to enable these
instructions.  Note that AVX is the fastest but requires a CPU from at least
2011.  SSE4 is the next fastest and is supported by most current machines.
*/


#include <dlib/image_processing/frontal_face_detector.h>
#include <dlib/image_processing/render_face_detections.h>
#include <dlib/image_processing.h>
#include <dlib/gui_widgets.h>
#include <dlib/image_io.h>
#include <iostream>

using namespace dlib;
using namespace std;

// ----------------------------------------------------------------------------------------

int main(int argc, char** argv)
{
	try
	{
		// This example takes in a shape model file and then a list of images to
		// process.  We will take these filenames in as command line arguments.
		// Dlib comes with example images in the examples/faces folder so give
		// those as arguments to this program.
		// 這個例子需要一個形狀模型檔案和一系列的圖片.
		if (argc == 1)
		{
			cout << "Call this program like this:" << endl;
			cout << "./face_landmark_detection_ex shape_predictor_68_face_landmarks.dat faces/*.jpg" << endl;
			cout << "\nYou can get the shape_predictor_68_face_landmarks.dat file from:\n";
			cout << "http://dlib.net/files/shape_predictor_68_face_landmarks.dat.bz2" << endl;//從這個地址下載模型標記點資料
			return 0;
		}

		// We need a face detector.  We will use this to get bounding boxes for
		// each face in an image.
		//****需要一個人臉檢測器，獲得一個邊界框
		frontal_face_detector detector = get_frontal_face_detector();

		// And we also need a shape_predictor.  This is the tool that will predict face
		// landmark positions given an image and face bounding box.  Here we are just
		// loading the model from the shape_predictor_68_face_landmarks.dat file you gave
		// as a command line argument.
		//****也需要一個形狀預測器，這是一個工具用來預測給定的圖片和臉邊界框的標記點的位置。
		//****這裡我們僅僅從shape_predictor_68_face_landmarks.dat檔案載入模型
		shape_predictor sp;//定義個shape_predictor類的例項
		deserialize(argv[1]) >> sp;


		image_window win, win_faces;
		// Loop over all the images provided on the command line.
		// ****迴圈所有圖片
		for (int i = 2; i < argc; ++i)
		{
			cout << "processing image " << argv[i] << endl;
			array2d<rgb_pixel> img;//注意變數型別 rgb_pixel 三通道彩色影象
			load_image(img, argv[i]);
			// Make the image larger so we can detect small faces.
			pyramid_up(img);

			// Now tell the face detector to give us a list of bounding boxes
			// around all the faces in the image.
			std::vector<rectangle> dets = detector(img);//檢測人臉，獲得邊界框
			cout << "Number of faces detected: " << dets.size() << endl;//檢測到人臉的數量

			// Now we will go ask the shape_predictor to tell us the pose of
			// each face we detected.
			//****呼叫shape_predictor類函式，返回每張人臉的姿勢
			std::vector<full_object_detection> shapes;//注意形狀變數的型別，full_object_detection
			for (unsigned long j = 0; j < dets.size(); ++j)
			{
				full_object_detection shape = sp(img, dets[j]);//預測姿勢，注意輸入是兩個，一個是圖片，另一個是從該圖片檢測到的邊界框
				cout << "number of parts: " << shape.num_parts() << endl;
				//cout << "pixel position of first part:  " << shape.part(0) << endl;//獲得第一個點的座標,注意第一個點是從0開始的
				//cout << "pixel position of second part: " << shape.part(1) << endl;//獲得第二個點的座標
				/*自己改寫，打印出全部68個點*/
				for (int i = 1; i < 69; i++)
				{
					cout << "第 " << i<< " 個點的座標： " << shape.part(i-1) << endl;
				}
				// You get the idea, you can get all the face part locations if
				// you want them.  Here we just store them in shapes so we can
				// put them on the screen.
				shapes.push_back(shape);
			}

			// Now let's view our face poses on the screen.
			//**** 顯示結果
			win.clear_overlay();
			win.set_image(img);
			win.add_overlay(render_face_detections(shapes));

			// We can also extract copies of each face that are cropped, rotated upright,
			// and scaled to a standard size as shown here:
			//****我們也能提取每張剪裁後的人臉的副本，旋轉和縮放到一個標準尺寸
			dlib::array<array2d<rgb_pixel> > face_chips;
			extract_image_chips(img, get_face_chip_details(shapes), face_chips);
			win_faces.set_image(tile_images(face_chips));

			cout << "Hit enter to process the next image..." << endl;
			cin.get();
		}
	}
	catch (exception& e)
	{
		cout << "\nexception thrown!" << endl;
		cout << e.what() << endl;
	}
}

// ----------------------------------------------------------------------------------------

其他的和上一篇部落格相同，祝大家好運！

Dlib機器學習庫學習系列三----人臉對齊（特徵點檢測）

本篇部落格是Dlib庫學習的第三篇---人臉對齊。人臉對齊與人臉檢測工程建立與配置基本相同，在此不再贅述。可參照我上一篇部落格。閒話少說，來點乾貨。步驟一：建立並配置工程，參照上一篇部落格。步驟二：下載形狀模型檔案下載地址

人臉對齊（二十一）--A Recurrent Encoder-Decoder Network for Sequential Face Alignment

轉自：https://blog.csdn.net/shuzfan/article/details/52438910 本次介紹一篇關於人臉關鍵點檢測(人臉對齊)的文章：《ECCV16 A Recurrent Encoder-Decoder Network for Sequential Fac

opencv學習筆記三十六：AKAZE特徵點檢測與匹配

KAZE是日語音譯過來的， KAZE與SIFT、SURF最大的區別在於構造尺度空間，KAZE是利用非線性方式構造，得到的關鍵點也就更準確（尺度不變性）； Hessian矩陣特徵點檢測，方向指定，基於一階微分影象（旋轉不變性）；描述子生成，歸一化處理（光照不變

人臉識別之人臉對齊（三）--AAM演算法原文： http://blog.csdn.net/colourfulcloud/article/details/9774017 AAM(Active Appear

原文： http://blog.csdn.net/colourfulcloud/article/details/9774017 AAM(Active Appearance Model)主動外觀模型主要分為兩個階段，模型建立階段和模型匹配階段。其中模型建立階段包括了對訓練樣本分別建立形狀模型(

學習OpenCV——HoG特徵詳解（特徵點篇）

HOG即histogram of oriented gradient, 是用於目標檢測的特徵描述子，該技術將影象區域性出現的方向梯度次數進行計數，該方法和邊緣方向直方圖、scale-invariant feature transform類似，不同的是hog的計算基於一致空

學習OpenCV——Surf（特徵點篇）&flann快速最近鄰搜尋演算法

Surf(Speed Up Robust Feature) Surf演算法的原理 1.構建Hessian矩陣構造

學習OpenCV——Surf（特徵點篇）&flann

Surf(Speed Up Robust Feature) Surf演算法的原理 1.構建Hessian矩陣構造高斯金字塔

人臉識別之人臉對齊（九）--SDM演算法

轉自：http://blog.csdn.net/huneng1991/article/details/51901912 http://blog.csdn.net/qq_14845119/article/details/53520847 略刪改。 SDM(Supervis

人臉識別之人臉對齊（八）--LBF演算法

整體來看，其實，ESR是基礎版本的形狀迴歸，ERT將回歸樹修改為GBDT，由原始的直接回歸形狀，改進為迴歸形狀殘差，而LBF，是加速特徵提取，由原來的畫素差分特徵池，改為隨機選擇點。轉自：http://blog.csdn.net/qq_14845119/article/de

人臉識別之人臉對齊（七）--JDA演算法

其實，這裡JDA之前在人臉檢測中解釋過，這裡再轉一篇的目的在於，此文更貼近論文，同時，JDA本來包含人臉檢測和人臉對齊，作為一個整體訓練和測試的。轉自：http://blog.csdn.net/shixiangyun2/article/details/50809078 第一節： &nb

人臉識別之人臉對齊（六）--ERT演算法

1.概述文章名稱：One Millisecond Face Alignment with an Ensemble of Regression Trees 文章來源：2014CVPR 文章作者：Vahid Kazemi ，Josephine Sullivan 簡要介紹：One Milliseco

人臉識別之人臉對齊（五）--ESR演算法

轉自：https://blog.csdn.net/app_12062011/article/details/52573024 原文：http://www.thinkface.cn/thread-2911-1-2.html 原文翻譯我看的好蛋疼，完全機器翻譯。甚至懷疑作者是否有通讀過一次

人臉識別之人臉對齊（四）--CLM演算法及概率圖模型改進

原文： http://blog.csdn.net/marvin521/article/details/11489453 04、概率圖模型應用例項最近一篇文章《Deform

人臉檢測、人臉對齊（MTCNN方法）

眾所眾知，嚴格定義上的人臉識別分為四個步驟： ①人臉檢測：從圖片中準確定位到人臉 ②人臉矯正（對齊）：檢測到的人臉，可能角度不是很正，需要使其對齊 ③對矯正後的人臉進行特徵提取 ④對兩張人臉影象的特徵向量進行對比，計算相似度這裡，我們主要是推薦步驟1和步驟2用到的一個方

人臉對齊（十）--人臉對齊綜述（綜述及2D人臉對齊總結2018.8）

本文主要是這篇文章的翻譯，後面增加具體的演算法理解。還有另一篇綜述文獻（內容好像截止2013年） Facial feature point detection: A comprehensive survey Neurocomputing Available onl

人臉對齊（六）--ERT演算法

1.概述文章名稱：One Millisecond Face Alignment with an Ensemble of Regression Trees 文章來源：2014CVPR 文章作者：Vahid Kazemi ，Josephine Sullivan 簡要介紹：On

人臉對齊（二十）--PRN

Joint3D Face Reconstruction and Dense Alignment with Position Map Regression（PRN2018）我們從之前的論文可以看出，基本的3D人臉對齊，稠密人臉對齊，人臉重建，主要分兩個方向，一是3DMM+特

機器學習環境配置系列三之Anaconda

1、下載Anaconda檔案 2、安裝anaconda 執行安裝命令 bash Anaconda***.sh 根據安裝提示就可以完成安裝 3、環境配置 echo 'export PATH="/home/使用者名稱/anaconda3/bin:$PATH"'>> ~/.bashrc

Python_sklearn機器學習庫學習筆記（四）decision_tree（決策樹）

min n) 空間 strong output epo from 標簽 ict # 決策樹 import pandas as pd from sklearn.tree import DecisionTreeClassifier from sklearn.

Deep Learning（深度學習）學習筆記整理系列三

學習能力架構 -s 解釋區別初始化 filter 大牛 ted Deep Learning（深度學習）學習筆記整理系列聲明： 1）該Deep Learning的學習系列是整理自網上很大牛和機器學習專家所無私奉獻的資料的。具體引用的資料請看參考文獻。具體的版本聲明

Dlib機器學習庫學習系列三----人臉對齊（特徵點檢測）

相關推薦