Re-ID：AlignedReID: Surpassing Human-Level Performance in Person Re-Identification 論文解析

阿新 • • 發佈：2019-01-06

A global feature(a C-d vector) is extracted by directly applying global pooling on the feature map.
對於全域性特徵的提取，便是用global pooling在feature map上滑動提取特徵。
For the local features, a horizontal pooling, which is a global pooling in the horizontal direction, is first applied to extract a local feature for each row, and a 1X1 convolution is then applied to reduce the channel number from C to c. In this way, each local feature(a c-d vector) represents a horizontal part of image for a person.

對於區域性特徵提取，便是用horizontal pooling對feature map進行逐行提取，然後再進行1x1的卷積操作。這樣得到的特徵代表人體的水平部分。
As a result, a person image is represented by a global feature and H local features.
最後，一張影象就可以用一個全域性特徵和多個區域性特徵代替。
The distance of two person images is the summation of their global and local distances.
兩張圖片的距離是全域性特徵距離與區域性特徵距離之和。

The global distance is simply the L2 distance of the global features.
全域性特徵距離是指全域性特徵之間的L2距離。
For the local distance, we dynamically match the local parts from top to bottom to find the alignment of local feature with the minimum total distance.
區域性特徵距離是指通過動態規劃的方法求出的最短路徑，並通過該最短距離找到對齊的區域性特徵。
This is based on a simple assumption that, for two images of the same person, the local feature from one body part of the first image is more similar to the semantically corresponding body part of the other image.

當然這一度量學習是基於假設：對於同一個人的同一部位在不同的圖片中具有較高的相似度。
Given the local features of two image, F=f1,...,fH and G=g1,...,gH, we first normalize the distance to [0, 1) by an element-wise transformation:
- where di,j is the distance between the i-th vertical part of
  the first image and the j-th vertical part of the second image. A distance matrix D is formed based on these distances, where its (i, j)-element is di,j.
We define the local distance between the two images as the total distance of the shortest path from (1, 1) to (H, H) in the matrix D.
以上公式是matrix D的每個元素的計算公式
The distance can be calculated through dynamic programming as follows:
- where Si,jis the total distance of the shortest path when walking from (1, 1) to (i, j) in the distance matrix D, andSH,H is the total distance of the final shortest path between two image.
以上公式便是動態規劃中求最短路徑所採用的狀態轉移方程。
Non-corresponding alignments are necessary to maintain the order of vertical alignment, as well as make the correspnding alignments possible.
在最短路徑中，可能包含非對齊的特徵，但這非但不會對結果造成影響，而且還會對維護垂直方向對齊的次序起著至關重要的作用。
The reason for using the global distance to mine hard samples is due to two consideration:
- First, the calculation of the global distance is much faster than that of the local distance.
- Second, we observe that there is no significant difference in mining hard samples using both distances.
Note that in the inference stage, we only use the global features to compute the similaritity of two person images. We make this choice mainly because we unexpectedly observed that the global feature itself is also almost as good as the combined features.
This somehow counter-intuitive phenomenon might be caused by two factors:
- the feature map jointly learned is better than learning the global feature only, because we have exploited the structure prior of the person image in the learning stage;
- with the aid of local feature matching, the global feature can pay more attention to the body of the person, rather than over fitting the background.
以上解釋了為什麼只使用全域性特徵距離而不使用區域性特徵或者兩者都使用。
We apply mutual learning to train models for AlignedReID, which can further improve performance.
作者採用mutual learning去訓練模型，因為這樣可以提高效能。
A distillation-based model usually transfers knowledge from a pre-trained large teacher network to a small student network.
一個好的模型通常都是採用遷移學習的方法：預訓練一個模型然後在進行微調獲得自己的模型。
In this paper, we train a set of student models simultaneously, transferring knowledge between each other.
這篇論文同時訓練多個模型，並讓它們相互學習。
We propose a new mutual learning loss for metric learning.
- The overall loss function include the metric loss, the metric mutual loss, the classification loss and classification mutual loss.
- The metric loss is decided by both the global distances and the local distances, while the metric mutual loss is decided only by the global distances.
- The classification mutual loss is the KL divergence for classification.
The mutual learning loss is defined as:
By applying the zero gradient function, the second-order gradients is:
We found that it speeds up the convergence and improves the accuracy compared to a mutual loss without the zero gradient function.
這篇論文定義了新的mutual learning loss，且該loss中的zero gradient function加快了收斂速度，並提高了準確率。

Re-ID：AlignedReID: Surpassing Human-Level Performance in Person Re-Identification 論文解析

A global feature(a C-d vector) is extracted by directly applying global pooling on the feature map. 對於全域性特徵的提取，便是用global pooling在feature map上滑動提取特徵。 For

【Person Re-ID】AlignedReID: Surpassing Human-Level Performance in Person Re-Identification

一. 論文背景論文：AlignedReID: Surpassing Human-Level Performance inPerson Re-Identification 【點選下載】Caffe程式碼：【Github】首先通過下圖來看ReID面臨的問題：由於檢測框不準確（a-b）,姿態變化

Person Re-id：評價指標

評價指標：1、rank-n: 搜尋結果中最靠前（置信度最高）的n張圖有正確結果的概率。CMC曲線：算一種top-k的擊中概率，主要用來評估閉集中rank的正確率。例如： lable為m1，在100個樣本中搜索。如果識別結果是m1、m2、m3、m4、m5……，則此時rank-

Person Re-identification 系列論文筆記（二）：A Discriminatively Learned CNN Embedding for Person Re-identification

triplet put ali com multi 深度學習 native alt 出現　　A Discriminatively Learned CNN Embedding for Person Re-identification Zheng Z, Zheng L, Ya

Human Semantic Parsing for Person Re-identification

論文地址 GitHub程式碼 Introduction 目前大部分的Person ReID方法都開始集中於提取更加具有表徵能力的區域性特徵輔助全域性特徵用於行人檢索。這篇文章是CVPR2018中關於Person ReID的一篇，文章的主體思路就是part-base的方法，但是跟大部分pa

CVPR2018論文翻譯 Human Semantic Parsing for Person Re-identification

論文連結：摘要混亂的背景、光照、視角等因素制約了提取魯棒性表示的能力，因此reid是個挑戰性的任務。為了改進表示學習，通常提取行人身體各部分的區域性特徵。然而，實際中通常基於包圍框的部分檢測。本文提出了改編的human semantic parsing，它有著畫素等級

出現錯誤日誌：The APR based Apache Tomcat Native library which allows optimal performance in production environments was not found on the java.library.path

div 錯誤日誌 a.out library logs openss product arc nec tomcat6出現錯誤日誌：信息: The APR based Apache Tomcat Native library which allows optimal pe

機器學習：模型性能度量(performance measure)(待補充)

splay 樣本常用 spl n) enc 統計學習方法後者性能對學習器的泛化性能進行評估，不僅需要有效的實驗估計方法，還需要有衡量模型泛化性能的評準指標，這就是性能度量。性能度量反應任務需求，對比不同模型能力時，使用不同性能度量能導致不同的評判結果。因此，模型的好

分布式唯一id：snowflake算法思考

idworker 下一個什麽 ted 回退隊列如何 mage args 匠心零度轉載請註明原創出處，謝謝！緣起為什麽會突然談到分布式唯一id呢？原因是最近在準備使用RocketMQ，看看官網介紹：一句話，消息可能會重復，所以消費端需要做冪等。為什麽消息會

idea出現：error:java: Target level '1.7' is incompatible with source level '1.8'.解決辦法

tin get 技術 strong mark 更改 ID HR ati 當我們開始使用idea的時候，編譯jsp程序我們有可能出現編譯錯誤，然而我們的代碼又沒有什麽問題。解決方法一：我們開始的時候可以通過修改java compiler來解決這樣的問題，點擊file菜

Person Re-identification 系列論文筆記（八）：SPReID

最終數據集 pipeline 論文筆記 cat cati 對齊技術分享通道 Human Semantic Parsing for Person Re-identification Kalayeh M M, Basaran E, Gokmen M, et al. H

CVPR 2017：See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-based Person Re-identification

network 測試 eee 分享 The 因此進行最大變化 [1] Z. Zhou, Y. Huang, W. Wang, L. Wang, T. Tan, Ieee, See the Forest for the Trees: Joint Spatial and

Re-ID：AlignedReID: Surpassing Human-Level Performance in Person Re-Identification 論文解析

Re-ID：AlignedReID: Surpassing Human-Level Performance in Person Re-Identification 論文解析

【Person Re-ID】AlignedReID: Surpassing Human-Level Performance in Person Re-Identification

Person Re-id：評價指標

Person Re-identification 系列論文筆記（二）：A Discriminatively Learned CNN Embedding for Person Re-identification

Human Semantic Parsing for Person Re-identification

CVPR2018論文翻譯 Human Semantic Parsing for Person Re-identification

出現錯誤日誌：The APR based Apache Tomcat Native library which allows optimal performance in production environments was not found on the java.library.path

機器學習：模型性能度量(performance measure)(待補充)

分布式唯一id：snowflake算法思考

idea出現：error:java: Target level '1.7' is incompatible with source level '1.8'.解決辦法

Person Re-identification 系列論文筆記（八）：SPReID

CVPR 2017：See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-based Person Re-identification

事件ID：2019的處理

Server 2012 R2 SceCli 事件ID：1202( 0x534)解決方案

錯誤事件ID：7026（“下列引導或系統啟動驅動程序無法加載: cdrom”）的解決方法

ID：4----看書學習知識

ID：1----學習英語

關於re-ID的各種綜述

基於深度學習的Person Re-ID（綜述）

python進階（2）——re模組：正則表示式1

Re-ID：AlignedReID: Surpassing Human-Level Performance in Person Re-Identification 論文解析

相關推薦