Focal Loss 的理解

阿新 • • 發佈：2019-01-04

論文：《Focal Loss for Dense Object Detection》

Focal Loss 是何愷明設計的為了解決one-stage目標檢測在訓練階段前景類和背景類極度不均衡（如1：1000）的場景的損失函式。它是由二分類交叉熵改造而來的。

標準交叉熵

其中，p是模型預測屬於類別y=1的概率。為了方便標記，定義:

交叉熵CE重寫為：

α-平衡交叉熵：

有一種解決類別不平衡的方法是引入一個值介於[0; 1]之間的權重因子α：當y=1時，取α; 當y=0時，取1-α。

這種方法，當y=0（即背景類）時，隨著α的增大，會對損失進行很大懲罰（降低權重），從而減輕背景類

太多對訓練的影響。

類似Pt,可將α-CE重寫為：

Focal Loss定義

雖然α-CE起到了平衡正負樣本的在損失函式值中的貢獻，但是它沒辦法區分難易樣本的樣本對損失的貢獻。因此就有了Focal Loss，定義如下：

其中，alpha和gamma均為常熟，是一個超引數。y'為模型預測，其值介於（0-1）之間。

當y=1時，y'->1，表示easy positive，它對權重的貢獻->0;

當y=0是，y'->0，表示easy negative，它對權重的貢獻->0.

因此，Focal Loss不僅降低了背景類的權重，還降低了easy positive/negative的權重。

gamma是對損失函式的調節，當gamma=0是，Focal Loss與α-CE等價。以下是gamma

對Focal Loss的調節。

Focal Loss的Pytorch實現（藍色字型）

以下Focal Loss=Focal Loss + Regress Loss;

程式碼來自：https://github.com/yhenon/pytorch-retinanet

  1 import numpy as np
  2 import torch
  3 import torch.nn as nn
  4 
  5 def calc_iou(a, b):
 
  6     area = (b[:, 2] - b[:, 0]) * (b[:, 3] - b[:, 1])
  7 
  8     iw = torch.min(torch.unsqueeze(a[:, 2], dim=1), b[:, 2]) - torch.max(torch.unsqueeze(a[:, 0], 1), b[:, 0])
  9     ih = torch.min(torch.unsqueeze(a[:, 3], dim=1), b[:, 3]) - torch.max(torch.unsqueeze(a[:, 1], 1), b[:, 1])
 10 
 11     iw = torch.clamp(iw, min=0)
 12     ih = torch.clamp(ih, min=0)
 13 
 14     ua = torch.unsqueeze((a[:, 2] - a[:, 0]) * (a[:, 3] - a[:, 1]), dim=1) + area - iw * ih
 15 
 16     ua = torch.clamp(ua, min=1e-8)
 17 
 18     intersection = iw * ih
 19 
 20     IoU = intersection / ua
 21 
 22     return IoU
 23 
 24 class FocalLoss(nn.Module):
 25     #def __init__(self):
 26 
 27     def forward(self, classifications, regressions, anchors, annotations):
 28         alpha = 0.25
 29         gamma = 2.0
 30         batch_size = classifications.shape[0]
 31         classification_losses = []
 32         regression_losses = []
 33 
 34         anchor = anchors[0, :, :]
 35 
 36         anchor_widths  = anchor[:, 2] - anchor[:, 0]
 37         anchor_heights = anchor[:, 3] - anchor[:, 1]
 38         anchor_ctr_x   = anchor[:, 0] + 0.5 * anchor_widths
 39         anchor_ctr_y   = anchor[:, 1] + 0.5 * anchor_heights
 40 
 41         for j in range(batch_size):
 42 
 43             classification = classifications[j, :, :]
 44             regression = regressions[j, :, :]
 45 
 46             bbox_annotation = annotations[j, :, :]
 47             bbox_annotation = bbox_annotation[bbox_annotation[:, 4] != -1]
 48 
 49             if bbox_annotation.shape[0] == 0:
 50                 regression_losses.append(torch.tensor(0).float().cuda())
 51                 classification_losses.append(torch.tensor(0).float().cuda())
 52 
 53                 continue
 54 
 55             classification = torch.clamp(classification, 1e-4, 1.0 - 1e-4)
 56 
 57             IoU = calc_iou(anchors[0, :, :], bbox_annotation[:, :4]) # num_anchors x num_annotations
 58 
 59             IoU_max, IoU_argmax = torch.max(IoU, dim=1) # num_anchors x 1
 60 
 61             #import pdb
 62             #pdb.set_trace()
 63 
 64             # compute the loss for classification
 65             targets = torch.ones(classification.shape) * -1
 66             targets = targets.cuda()
 67 
 68             targets[torch.lt(IoU_max, 0.4), :] = 0
 69 
 70             positive_indices = torch.ge(IoU_max, 0.5)
 71 
 72             num_positive_anchors = positive_indices.sum()
 73 
 74             assigned_annotations = bbox_annotation[IoU_argmax, :]
 75 
 76             targets[positive_indices, :] = 0
 77             targets[positive_indices, assigned_annotations[positive_indices, 4].long()] = 1
 78 
 79             alpha_factor = torch.ones(targets.shape).cuda() * alpha
 80 
 81             alpha_factor = torch.where(torch.eq(targets, 1.), alpha_factor, 1. - alpha_factor)
 82             focal_weight = torch.where(torch.eq(targets, 1.), 1. - classification, classification)
 83             focal_weight = alpha_factor * torch.pow(focal_weight, gamma)
 84 
 85             bce = -(targets * torch.log(classification) + (1.0 - targets) * torch.log(1.0 - classification))
 86 
 87             # cls_loss = focal_weight * torch.pow(bce, gamma)
 88             cls_loss = focal_weight * bce
 89 
 90             cls_loss = torch.where(torch.ne(targets, -1.0), cls_loss, torch.zeros(cls_loss.shape).cuda())
 91 
 92             classification_losses.append(cls_loss.sum()/torch.clamp(num_positive_anchors.float(), min=1.0))
 93 
 94             # compute the loss for regression
 95 
 96             if positive_indices.sum() > 0:
 97                 assigned_annotations = assigned_annotations[positive_indices, :]
 98 
 99                 anchor_widths_pi = anchor_widths[positive_indices]
100                 anchor_heights_pi = anchor_heights[positive_indices]
101                 anchor_ctr_x_pi = anchor_ctr_x[positive_indices]
102                 anchor_ctr_y_pi = anchor_ctr_y[positive_indices]
103 
104                 gt_widths  = assigned_annotations[:, 2] - assigned_annotations[:, 0]
105                 gt_heights = assigned_annotations[:, 3] - assigned_annotations[:, 1]
106                 gt_ctr_x   = assigned_annotations[:, 0] + 0.5 * gt_widths
107                 gt_ctr_y   = assigned_annotations[:, 1] + 0.5 * gt_heights
108 
109                 # clip widths to 1
110                 gt_widths  = torch.clamp(gt_widths, min=1)
111                 gt_heights = torch.clamp(gt_heights, min=1)
112 
113                 targets_dx = (gt_ctr_x - anchor_ctr_x_pi) / anchor_widths_pi
114                 targets_dy = (gt_ctr_y - anchor_ctr_y_pi) / anchor_heights_pi
115                 targets_dw = torch.log(gt_widths / anchor_widths_pi)
116                 targets_dh = torch.log(gt_heights / anchor_heights_pi)
117 
118                 targets = torch.stack((targets_dx, targets_dy, targets_dw, targets_dh))
119                 targets = targets.t()
120 
121                 targets = targets/torch.Tensor([[0.1, 0.1, 0.2, 0.2]]).cuda()
122 
123 
124                 negative_indices = 1 - positive_indices
125 
126                 regression_diff = torch.abs(targets - regression[positive_indices, :])
127 
128                 regression_loss = torch.where(
129                     torch.le(regression_diff, 1.0 / 9.0),
130                     0.5 * 9.0 * torch.pow(regression_diff, 2),
131                     regression_diff - 0.5 / 9.0
132                 )
133                 regression_losses.append(regression_loss.mean())
134             else:
135                 regression_losses.append(torch.tensor(0).float().cuda())
136 
137 return torch.stack(classification_losses).mean(dim=0, keepdim=True), torch.stack(regression_losses).mean(dim=0, keepdim=True)

何愷明大神的「Focal Loss」，如何更好地理解？

轉自：http://blog.csdn.net/c9Yv2cf9I06K2A9E/article/details/78920998 作者丨蘇劍林單位丨廣州火焰資訊科技有限公司研究方向丨NLP，神經網路個人主頁丨kexue.fm 前言

focal loss 兩點理解

png 感覺技術 src 類別 com 大量。。 ima 博客給出了三個算例。可以看出，focal loss 對可很好分類的樣本賦予了較小的權重，但是對分錯和不易分的樣本添加了較大的權重。對於類別不平衡，使用了$\alpha_t$進行加權，文章中提到較好的值是0

Focal Loss 的理解

論文：《Focal Loss for Dense Object Detection》 Focal Loss 是何愷明設計的為了解決one-stage目標檢測在訓練階段前景類和背景類極度不均衡（如1：1000）的場景的損失函式。它是由二分類交叉熵改造而來的。標準交叉熵其中，p是模型預測屬於類別y=

Focal Loss 論文理解及公式推導

作者: Tsung-Yi, Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollar 團隊: FAIR 精度最高的目標檢測器往往基於 RCNN 的 two-stage 方法，對候選目標位置再採用

Focal Loss for Dense Object Detection 論文閱讀

因此分類技術分享模型出發點 oss oca 圖片同時何凱明大佬 ICCV 2017 best student paper 作者提出focal loss的出發點也是希望one-stage detector可以達到two-stage detector的準確率，同時

目標檢測focal loss 和 loss rank mining筆記

focal loss 參考https://blog.csdn.net/qq_34564947/article/details/77200104 α是控制類別不均衡，對屬於少數類別的樣本，增大α γ是區分樣本識別難易 loss rank mining paper:https://

focal loss

Focal Loss 就是一個解決分類問題中類別不平衡、分類難度差異的一個 loss. Kaiming 大神的 Focal Loss ,二分類形式,是：如果落實到 ŷ =σ(x) 這個預測，那麼就有：通過一系列調參，得到 α=0.25, γ=2（在他的模型上）的效果最好

Focal Loss(RetinaNet) 與 OHEM

Focal Loss for Dense Object Detection-RetinaNet YOLO和SSD可以算one-stage演算法裡的佼佼者，加上R-CNN系列演算法，這幾種演算法可以說是目標檢測領域非常經典的演算法了。這幾種演算法在提出之後經過數次改進，都得到了很高的精確度，但是one-sta

facenet:triplet-loss理解與train_tripletloss.py程式碼理解

對於Facenet進行人臉特徵提取，演算法內容較為核心和比較難以理解的地方在於三元損失函式Triplet-loss。神經網路所要學習的目標是：使得Anchor到Positive的距離要比Anchor到Negative的距離要短（Anchor為一個樣本，Positive為與Anchor同類的

Focal Loss for Dense Object Detection

Focal loss是Kaiming He和RBG發表在ICCV2017上的文章。 abstract: one-stage網路和two-stage網路相比，one-stage會得到大量目標位置。one stage不好的原因在於：極度不平衡的正負樣本比例：abchor近

[論文筆記] Focal Loss for Dense Object Detection

Introduction 在 object detection 中，one-stage 跟 two-stage 的 model 的精準度的比較往往是一個高度討論的熱門話題，本論文中大致的描述了自己對於 two-stage 精準度上較高原因提出了一些猜測，詳細的

smooth L1 loass and focal loss

import keras from . import backend def focal(alpha=0.25, gamma=2.0): """ Create a functor for computing the focal loss.

caffe新增層：Focal Loss的caffe實現

1，caffe.proto 原始檔在src/caffe/proto/目錄裡從492行這些optional裡，作者添加了兩行: optional ReLU6Parameter relu6_param = 208; optional FocalLossParamete

YOLO loss理解

自己理解的YOLO loss 是對於label有物體的框，不管預測有沒有，都需要計算位置（座標）損失，權重大一點。所有框都計算判別概率損失，無物體的框權重小一點，對於label有物體的框，計算預測損失。

目標檢測之focal loss

https://blog.csdn.net/dreamer_on_air/article/details/78187565 我的批註：作者沒有考慮負樣本的情況，當正樣本被預測正確時，其loss下降為0；當正樣本預測錯誤時，其loss有稍微的下降；也就是，對於容易訓練的樣本，其loss

論文閱讀-《Focal Loss for Dense Object Detection》

FAIR. ICCV2017 Oral Kaiming He & RBG 1.Motivation 一直以來，one-stage detector都以快著稱，yolo剛釋出的時候表明了是主打速度的，但是這些one-stage detector的精

Focal loss and RetinaNet

這是一篇論文閱讀筆記論文連結：https://arxiv.org/abs/1708.02002 程式碼連結：https://github.com/facebookresearch/Detectron 首先，提一個問題，為什麼one stage方法精度比two stage方法

【Caffe】Focal Loss

Pk對zk的求導，以及Pk對zj的求導請參考https://blog.csdn.net/u013066730/article/details/86231215 前向程式碼： for (int i = 0; i < outer_num_; ++i) { for (int j

論文(3) Focal Loss

Focal Loss @(目標檢測) Focal Loss是KaiMing大神提出來的，這篇文章的重點在於分析了one-stage網路的檢測精度為什麼會弱於two-stage的網路。當原理分析出來之後，其實公式的更改就很簡單了。這篇paper也自建了一個網路

CTC loss 理解

前言：理解了很久的CTC，每次都是點到即止，所以一直沒有很明確，現在重新整理。定義 CTC (Connectionist Temporal Classification)是一種loss function 對比傳統方法在傳統的

Focal Loss 的理解

相關推薦