【超解析度】python中的影象空間的轉換 RGB--YCBCR

阿新 • • 發佈：2018-12-06

由於人眼對顏色不敏感，而對光亮通道更加敏感。因此在超解析度任務中，我們通常需要將RGB通道轉換為Ycbcr通道。在Python的程式碼實現中，我發現opencv的RGB轉Ycbcr的計算方式和Matlab的實現方式有些不同，而NTIRE的評估往往是在matlab平臺的。因此，這裡需要注意。

Python RGB轉Ycbcr通道

對於Set5中的baby影象

Code:

img = cv2.imread(imgpath)
img = cv2.cvtColor(img, cv2.COLOR_BGR2YCR_CB)
img_y = img[:,:,0]

Result:

array([[253, 253, 253, ..., 254, 254, 254],
       [253, 253, 253, ..., 254, 254, 254],
       [253, 253, 253, ..., 254, 254, 254],
       ...,
       [ 62,  70,  72, ...,  67,  67,  67],
       [ 54,  58,  59, ...,  69,  68,  68],
       [ 49,  52,  53, ...,  70,  70,  69]], dtype=uint8)

實驗原理:
在這裡插入圖片描述

參考連結：https://docs.opencv.org/3.0.0/de/d25/imgproc_color_conversions.html

Matlab RGB轉Ycbcr通道

Code:

im  = imread(imgpath);
im = rgb2ycbcr(im);
im = im(:, :, 1);

Result:
在這裡插入圖片描述

Matlab實現方式:

function ycbcr = rgb2ycbcr(varargin)
%RGB2YCBCR Convert RGB color values to YCbCr color space.
%   YCBCRMAP = 
 RGB2YCBCR(MAP) converts the RGB values in MAP to the YCBCR
%   color space. MAP must be a M-by-3 array. YCBCRMAP is a M-by-3 matrix
%   that contains the YCBCR luminance (Y) and chrominance (Cb and Cr) color
%   values as columns.  Each row represents the equivalent color to the
%   corresponding row in the RGB colormap.
%
%   YCBCR = RGB2YCBCR(RGB) converts the truecolor image RGB to the
%   equivalent image in the YCBCR color space. RGB must be a M-by-N-by-3
%   array.
%
%   If the input is uint8, then YCBCR is uint8 where Y is in the range [16
%   235], and Cb and Cr are in the range [16 240].  If the input is a double,
%   then Y is in the range [16/255 235/255] and Cb and Cr are in the range
%   [16/255 240/255].  If the input is uint16, then Y is in the range [4112
%   60395] and Cb and Cr are in the range [4112 61680].
%
%   Class Support
%   -------------
%   If the input is an RGB image, it can be uint8, uint16, or double. If the
%   input is a colormap, then it must be double. The output has the same class
%   as the input.
%
%   Examples
%   --------
%   Convert RGB image to YCbCr.
%
%      RGB = imread('board.tif');
%      YCBCR = rgb2ycbcr(RGB);
%
%   Convert RGB color space to YCbCr.
%
%      map = jet(256);
%      newmap = rgb2ycbcr(map);
%
%   See also NTSC2RGB, RGB2NTSC, YCBCR2RGB.

%   Copyright 1993-2010 The MathWorks, Inc.  

%   References: 
%     C.A. Poynton, "A Technical Introduction to Digital Video", John Wiley
%     & Sons, Inc., 1996, p. 175
% 
%     Rec. ITU-R BT.601-5, "STUDIO ENCODING PARAMETERS OF DIGITAL TELEVISION
%     FOR STANDARD 4:3 AND WIDE-SCREEN 16:9 ASPECT RATIOS",
%     (1982-1986-1990-1992-1994-1995), Section 3.5.

rgb = parse_inputs(varargin{:});

%initialize variables
isColormap = false;

%must reshape colormap to be m x n x 3 for transformation
if (ndims(rgb) == 2)
  %colormap
  isColormap=true;
  colors = size(rgb,1);
  rgb = reshape(rgb, [colors 1 3]);
end

% This matrix comes from a formula in Poynton's, "Introduction to
% Digital Video" (p. 176, equations 9.6). 

% T is from equation 9.6: ycbcr = origT * rgb + origOffset;
origT = [65.481 128.553 24.966;...
     -37.797 -74.203 112; ...
     112 -93.786 -18.214];
origOffset = [16;128;128];

% The formula ycbcr = origT * rgb + origOffset, converts a RGB image in the range
% [0 1] to a YCbCr image where Y is in the range [16 235], and Cb and Cr
% are in that range [16 240]. For each class type (double,uint8,
% uint16), we must calculate scaling factors for origT and origOffset so that
% the input image is scaled between 0 and 1, and so that the output image is
% in the range of the respective class type.

scaleFactor.double.T = 1/255;      % scale output so in range [0 1].
scaleFactor.double.offset = 1/255; % scale output so in range [0 1].
scaleFactor.uint8.T = 1/255;       % scale input so in range [0 1].
scaleFactor.uint8.offset = 1;      % output is already in range [0 255].
scaleFactor.uint16.T = 257/65535;  % scale input so it is in range [0 1]  
                                   % and scale output so it is in range 
                                   % [0 65535] (255*257 = 65535).
scaleFactor.uint16.offset = 257;   % scale output so it is in range [0 65535].

% The formula ycbcr = origT*rgb + origOffset is rewritten as 
% ycbcr = scaleFactorForT * origT * rgb + scaleFactorForOffset*origOffset.  
% To use imlincomb, we rewrite the formula as ycbcr = T * rgb + offset, where T and
% offset are defined below.
classIn = class(rgb);
T = scaleFactor.(classIn).T * origT;
offset = scaleFactor.(classIn).offset * origOffset;

%initialize output
ycbcr = zeros(size(rgb),classIn);

for p = 1:3
  ycbcr(:,:,p) = imlincomb(T(p,1),rgb(:,:,1),T(p,2),rgb(:,:,2), ...
                         T(p,3),rgb(:,:,3),offset(p));
end  

if isColormap
  ycbcr = reshape(ycbcr, [colors 3 1]);
end

%%%
%Parse Inputs
%%%
function X = parse_inputs(varargin)

narginchk(1,1);
X = varargin{1};

if ndims(X)==2
  % For backward compatibility, this function handles uint8 and uint16
  % colormaps. This usage will be removed in a future release.

  validateattributes(X,{'uint8','uint16','double'},{'nonempty'},mfilename,'MAP',1);
  if (size(X,2) ~=3 || size(X,1) < 1)
    error(message('images:rgb2ycbcr:invalidSizeForColormap'))
  end
  if ~isa(X,'double')
    warning(message('images:rgb2ycbcr:notAValidColormap'))
    X = im2double(X);
  end

elseif ndims(X)==3
  validateattributes(X,{'uint8','uint16','double'},{},mfilename,'RGB',1);
  if (size(X,3) ~=3)
    error(message('images:rgb2ycbcr:invalidTruecolorImage'))
  end
else
  error(message('images:rgb2ycbcr:invalidInputSize'))
end

實驗可發現兩種實現方式的結果存在著不同，這是因為兩者的內部實現原理不同。這裡提供一個與Matlab的Ycbcr空間轉換類似的函式：


def rgb2ycbcr(img, only_y=True):
    '''same as matlab rgb2ycbcr
    only_y: only return Y channel
    Input:
        uint8, [0, 255]
        float, [0, 1]
    '''
    in_img_type = img.dtype
    img.astype(np.float32)
    if in_img_type != np.uint8:
        img *= 255.
    # convert
    if only_y:
        rlt = np.dot(img, [65.481, 128.553, 24.966]) / 255.0 + 16.0
    else:
        rlt = np.matmul(img, [[65.481, -37.797, 112.0], [128.553, -74.203, -93.786],
                              [24.966, 112.0, -18.214]]) / 255.0 + [16, 128, 128]
    if in_img_type == np.uint8:
        rlt = rlt.round()
    else:
        rlt /= 255.
    return rlt.astype(in_img_type)

【超解析度】python中的影象空間的轉換 RGB--YCBCR

由於人眼對顏色不敏感，而對光亮通道更加敏感。因此在超解析度任務中，我們通常需要將RGB通道轉換為Ycbcr通道。在Python的程式碼實現中，我發現opencv的RGB轉Ycbcr的計算方式和Matlab的實現方式有些不同，而NTIRE的評估往往是在matlab平臺的。因此，這裡需要注意

【超解析度】超解析度中的imresize函式（python, Matlab）

背景：超解析度挑戰賽Super Resolution Challenges (e.g. NTIRE) 降取樣（downscaling）- bicubic interpolation- 是利用Matlab的imresize()函式實現的。 Track info: Track 1:

【人人都是Pythoner】【超全】python的collections模組詳解

前言： python中內建容器包括list、dict、set、tuple，而python中的collections模組則另引入了五種資料結構，更好地滿足編碼需求。下文驗證資料型別方法用到的程式碼放在了我的github上，歡迎下載： AdvancingMsCat的github co

【DRF認證】Python中第三方庫rest_framework的用法

本文詳細講述了DRF認證元件的原理以及用法. 原始碼剖析上一篇部落格講解DRF版本的時候我們都知道了，在dispatch方法裡執行了initial方法來初始化我們的版本. 而在initial方法裡

【DRF頻率】Python中第三方庫rest_framework的用法

開發平臺的API介面呼叫需要限制其頻率，以節約伺服器資源和避免惡意的頻繁呼叫. DRF就為我們提供了一些頻率限制的方法. DRF中的版本、認證、許可權、頻率元件的原始碼是一個流程，且頻率元件再最後執行.

【超解析度】Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

之前我一直在做基於CNN的超解析度研究。最近因為工作需要，需要研究基於生成對抗網路GAN的網路來做超解析度任務。在這段時間以來，我發現CNN和GAN兩類網路的側重點其實完全不同。CNN旨在於忠實的恢復影象的高頻資訊，而GAN在於生成更真實或者說更符合人眼的高

【超解析度】Laplacian Pyramid Networks（LapSRN）

看了眼作者，Jia-Bin Huang是傳統演算法領域（呃，自從深度學習出來後，我就將之前的演算法都算傳統方法了）的超解析度學習的專家大牛。然而有意思的是，這篇論文結合了傳統演算法laplacian pyramid 和 CNN網路，這也給我們這些研究者一

zhlan--【偷】Python中的賦值運算符

運算 alt ges 比較 images pytho 比較運算符賦值技術分享 >>>>Python中的賦值運算符： >>>>Python中的比較運算符： zhlan--【偷】Python中的賦值運算符

【Python學習】Python中的數據類型精度問題

類型一次 /usr logs int 第一次 pytho 整數問題 Python真的很神奇。。。神奇到沒有直接的數據類型概念，並且精度可以是任意精度。想當初，第一次接觸OI算法時，寫得第一個算法就是高精度加法，搗鼓了半天。一切在Python看來，僅僅三行代碼即可完成。

【轉載】python中math模塊常用的方法

sum tran magic 大於 mea 正弦 erlang his isnan 轉自：https://www.cnblogs.com/renpingsheng/p/7171950.html ceil #取大於等於x的最小的整數值，如果x是一個整數，則返回x ceil(x

【轉載】Python中的正則表達式教程

大小區別 some 操作按位或出了 sta 技術分享嘗試本文http://www.cnblogs.com/huxi/archive/2010/07/04/1771073.html 正則表達式經常被用到，而自己總是記不全，轉載一份完整的以備不時之需。 1.

【Python】Python中的列表操作

元素提取添加 sta 連接 not n個元素 none 格式 Python的列表操作可謂是功能強大且方便（相對於Java）簡單、常規的操作就不說了（這不是一個入門教程），介紹幾個很有特點的例子添加 # 追加到結尾(append) li = [1, 2, 3, 4, 5

【轉載】Python 中的 if name == 'main' 該如何理解

一個知識如果協程運行 pat 執行開始參考資料轉自曠世的憂傷 http://blog.konghy.cn/2017/04/24/python-entry-program/ 程序入口對於很多編程語言來說，程序都必須要有一個入口，比如 C，C++，以及完全面向

【轉】Python中操作mysql的pymysql模塊詳解

定義 padding 參數化查詢 finall 支持順序執行sql mysq syntax Python中操作mysql的pymysql模塊詳解前言 pymsql是Python中操作MySQL的模塊，其使用方法和MySQLdb幾乎相同。但目前pymysql支持p

【轉】python中獲取python版本號的方法

n) https href light nor body true print brush 原文 python3 #!/usr/bin/python # 第1種方法 import platform print(platform.python_version())

【轉】python中安裝包出現Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None))…

ted port 鏡像如果 after conf tab fun src 問題： python3安裝web.py安裝包出現Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=Non

【Python-pip】Python中pip加速設置

https 文件 imp 國內技術分享 simple 技術 -h users 1：在C:\Users\Administrator\pip建一個文件pip.ini如果Administrator中沒有pip文件夾則自己新建一個，然後新建一個pip.ini文件 2：在pip.i

【python apply】python 中apply、map、applymap的用法

apply 用在dataframe上，用於對row或者column進行計算 applymap：作用在dataframe的每一個元素上 map （其實是python自帶的）用於series上，是元素級別的操作,map 跟apply 功能類似，用法差不多 #

【Python】Python中 sys.argv[]的用法簡明解釋

sys.argv[]說白了就是一個從程式外部獲取引數的橋樑，這個“外部”很關鍵，所以那些試圖從程式碼來說明它作用的解釋一直沒看明白。因為我們從外部取得的引數可以是多個，所以獲得的是一個列表（list)，也就是說sys.argv其實可以看作是

【轉】python中的os模塊

圖片註意 getmtime 獲取路徑測試 strong ipc 創建文件 .com 在自動化測試中，經常需要查找操作文件，比如說查找配置文件（從而讀取配置文件的信息），查找測試報告（從而發送測試報告郵件），經常要對大量文件和大量路徑進行操作，這就依賴於os模塊，所以今天

【超解析度】python中的影象空間的轉換 RGB--YCBCR

Python RGB轉Ycbcr通道

Matlab RGB轉Ycbcr通道

相關推薦