TensorFlow中的類似opencv的、對圖片預處理的函式

阿新 • • 發佈：2019-02-16

"""
TensorFlow支援JPG、PNG影象格式，RGB、RGBA顏色空間
影象用與影象尺寸相同(heightwidthchnanel)張量表示
通道表示為包含每個通道顏色數量標量秩1張量
影象所有畫素存在磁碟檔案，需要被載入到記憶體
影象載入與二進位制檔案相同
影象需要解碼
輸入生成器(tf.train.string_input_producer)找到所需檔案，載入到佇列
tf.WholeFileReader載入完整影象檔案到記憶體，WholeFileReader.read讀取影象，tf.image.decode_jpeg解碼JPEG格式影象
影象是三階張量
RGB值是一階張量
載入影象格式為[batch_size,image_height,image_width,channels]
 
批資料影象過大過多，佔用記憶體過高，系統會停止響應
大尺寸影象輸入佔用大量系統記憶體
訓練CNN需要大量時間，載入大檔案增加更多訓練時間，也難存放多數系統GPU視訊記憶體
大尺寸影象大量無關本徵屬性資訊，影響模型泛化能力
tf.image.decode_jpeg解碼JPEG格式影象
tf.image.decode_png解碼PNG格式影象
差別在alpha(透明度)資訊
移除區域alpha值設0,有助於標識
JPEG影象頻繁操作會留下偽影(atrifact)
PNG格式無失真壓縮，保留原始檔案全部資訊(被縮放或降取樣除外)，檔案體積較大
TensorFlow內建檔案格式TFRecord，二進位制資料和訓練類別標籤資料儲存在同一檔案
 
模型訓練前影象轉換為TFRecord格式
TFRecord檔案是protobuf格式
資料不壓縮，可快速載入到記憶體
獨熱編碼(one-hot encoding)格式，表示多類分類(單)標籤資料
影象載入到記憶體，轉換為位元組陣列，新增到tf.train.Example檔案，SerializeToString 序列化為二進位制字元，儲存到磁碟
序列化將記憶體物件轉換為可安全傳輸檔案格式，可被載入，可被反序列化為樣本格式
直接載入TFRecord檔案，可以節省訓練時間
支援寫入多個樣本
TFRecordReader物件讀取TFRecord檔案
tf.parse_single_example不解碼影象，解析TFRecord，影象按原始位元組讀取(tf.decode-raw)
 
tf.reshape調整形狀，使佈局符合tf.nn.conv2d要求([image_height,image_width,image_channels])
tf.expand擴充套件維數，把batch_size維新增到input_batch
tf.equal檢查是否載入同一影象
sess.run(tf.cast(tf_record_features['label'], tf.string))檢視從TFRecord檔案載入的標籤
使用影象資料推薦使用TFRecord檔案儲存資料與標籤
做好影象預處理並儲存結果
最好在預處理階段完成影象操作，裁剪、縮放、灰度調整等
影象載入後，翻轉、扭曲，使輸入網路訓練資訊多樣化，緩解過擬合
Python影象處理框架PIL、OpenCV
TensorFlow提供部分影象處理方法
裁剪,tf.image.central_crop，移除影象區域，完全丟棄其中資訊，與tf.slice(移除張量分量)類似,基於影象中心返回結果
訓練時，如果背景有用，tf.image.crop_to_bounding_box(只接收確定形狀張量，輸入影象需要事先在資料流圖執行) 隨機裁剪區域起始位置到影象中心的偏移量
tf.image.pad_to_bounding_box 用0填充邊界，使輸入影象符合期望尺寸
尺寸過大過小影象，邊界填充灰度值0畫素
tf.image.resize_image_with_crop_or_pad，相對影象中心，裁剪或填充同時進行
翻轉，每個畫素位置沿水平或垂真方向翻轉
隨機翻轉影象，可以防止過擬合
tf.slice選擇影象資料子集
tf.image.flip_left_right 完成水平翻轉
tf.image.flip_up_down 完成垂直翻轉
seed引數控制翻轉隨機性
編輯過影象訓練，誤導CNN模型
屬性隨機修改，使CNN精確匹配編輯過或不同光照影象特徵
tf.image.adjust_brightness 調整灰度
tf.image.adjust_contrast 調整對比度
調整對比度，選擇較小增量，避免“過曝”，達到最大值無法恢復，可能全白全黑
tf.slice 突出改變畫素
tf.image.adjust_hue 調整色度，色彩更豐富
delta引數控制色度數量
tf.image.adjust_saturation 調整飽和度，突出顏色變化
單一顏色影象，灰度顏色空間，單顏色通道，只需要單個分量秩1張量
縮減顏色空間可以加速訓練
灰度圖具有單個分量，取值範圍[0,255]
tf.image.rgb_to_grayscale 把RGB影象轉換為灰度圖
灰度變換，每個畫素所有顏色值取平均
tf.image.rgb_to_hsv RGB影象轉換為HSV， 色度、飽和度、灰度構成HSV顏色空間，3個分量秩1張量
更貼近人類感知屬性
HSB，B亮度值
tf.image.hsv_to_rgb HSV影象轉換為RGB，tf.image.grayscale_to_rgb 灰度影象轉換為RGB
python-colormath提供LAB顏色空間，顏色差異對映貼近人類感知，兩個顏色歐氏距離反映人類感受的顏色差異
tf.image.convert_image_dtype(image, dtype,saturate=False) 影象資料型別變化，畫素值比例變化
"""
import tensorflow as tf
sess = tf.Session()
red = tf.constant([255, 0, 0])
file_names = ['./images/chapter-05-object-recognition-and-classification/working-with-images/test-input-image.jpg']
filename_queue = tf.train.string_input_producer(file_names)
image_reader = tf.WholeFileReader()
_, image_file = image_reader.read(filename_queue)
image = tf.image.decode_jpeg(image_file)
sess.run(tf.global_variables_initializer())
coord = tf.train.Coordinator()
threads = tf.train.start_queue_runners(sess=sess,coord=coord)
print sess.run(image)
filename_queue.close(cancel_pending_enqueues=True)
coord.request_stop()
coord.join(threads)
print "------------------------------------------------------"
image_label = b'\x01'
image_loaded = sess.run(image)
image_bytes = image_loaded.tobytes()
image_height, image_width, image_channels = image_loaded.shape
writer = tf.python_io.TFRecordWriter("./output/training-image.tfrecord")
example = tf.train.Example(features=tf.train.Features(feature={
        'label': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image_label])),
        'image': tf.train.Feature(bytes_list=tf.train.BytesList(value=[image_bytes]))
    }))
print example
writer.write(example.SerializeToString())
writer.close()
print "------------------------------------------------------"
tf_record_filename_queue = tf.train.string_input_producer(["./output/training-image.tfrecord"])
tf_record_reader = tf.TFRecordReader()
_, tf_record_serialized = tf_record_reader.read(tf_record_filename_queue)
tf_record_features = tf.parse_single_example(
tf_record_serialized,
features={
    'label': tf.FixedLenFeature([], tf.string),
    'image': tf.FixedLenFeature([], tf.string),
    })
tf_record_image = tf.decode_raw(
    tf_record_features['image'], tf.uint8)
tf_record_image = tf.reshape(
    tf_record_image,
    [image_height, image_width, image_channels])
print tf_record_image
tf_record_label = tf.cast(tf_record_features['label'], tf.string)
print tf_record_label
print "------------------------------------------------------"
sess.close()
sess = tf.Session()
sess.run(tf.global_variables_initializer())
coord = tf.train.Coordinator()
threads = tf.train.start_queue_runners(sess=sess,coord=coord)
print sess.run(tf.equal(image, tf_record_image))
sess.run(tf_record_label)
coord.request_stop()
coord.join(threads)
print "------------------------------------------------------"
print sess.run(tf.image.central_crop(image, 0.1))
real_image = sess.run(image)
bounding_crop = tf.image.crop_to_bounding_box(
    real_image, offset_height=0, offset_width=0, target_height=2, target_width=1)
print sess.run(bounding_crop)
print "------------------------------------------------------"
real_image = sess.run(image)
pad = tf.image.pad_to_bounding_box(
    real_image, offset_height=0, offset_width=0, target_height=4, target_width=4)
print sess.run(pad)
print "------------------------------------------------------"
crop_or_pad = tf.image.resize_image_with_crop_or_pad(
    real_image, target_height=2, target_width=5)
print sess.run(crop_or_pad)
print "------------------------------------------------------"
sess.close()
sess = tf.Session()
top_left_pixels = tf.slice(image, [0, 0, 0], [2, 2, 3])
flip_horizon = tf.image.flip_left_right(top_left_pixels)
flip_vertical = tf.image.flip_up_down(flip_horizon)
sess.run(tf.global_variables_initializer())
coord = tf.train.Coordinator()
threads = tf.train.start_queue_runners(sess=sess,coord=coord)
print sess.run([top_left_pixels, flip_vertical])
print "------------------------------------------------------"
top_left_pixels = tf.slice(image, [0, 0, 0], [2, 2, 3])
random_flip_horizon = tf.image.random_flip_left_right(top_left_pixels)
random_flip_vertical = tf.image.random_flip_up_down(random_flip_horizon)
print sess.run(random_flip_vertical)
print "------------------------------------------------------"
example_red_pixel = tf.constant([254., 2., 15.])
adjust_brightness = tf.image.adjust_brightness(example_red_pixel, 0.2)
print sess.run(adjust_brightness)
print "------------------------------------------------------"
adjust_contrast = tf.image.adjust_contrast(image, -.5)
print sess.run(tf.slice(adjust_contrast, [1, 0, 0], [1, 3, 3]))
print "------------------------------------------------------"
adjust_hue = tf.image.adjust_hue(image, 0.7)
print sess.run(tf.slice(adjust_hue, [1, 0, 0], [1, 3, 3]))
print "------------------------------------------------------"
adjust_saturation = tf.image.adjust_saturation(image, 0.4)
print sess.run(tf.slice(adjust_saturation, [1, 0, 0], [1, 3, 3]))
print "------------------------------------------------------"
gray = tf.image.rgb_to_grayscale(image)
print sess.run(tf.slice(gray, [0, 0, 0], [1, 3, 1]))
print "------------------------------------------------------"
hsv = tf.image.rgb_to_hsv(tf.image.convert_image_dtype(image, tf.float32))
print sess.run(tf.slice(hsv, [0, 0, 0], [3, 3, 3]))
print "------------------------------------------------------"
rgb_hsv = tf.image.hsv_to_rgb(hsv)
rgb_grayscale = tf.image.grayscale_to_rgb(gray)
print rgb_hsv, rgb_grayscale
print "------------------------------------------------------"

TensorFlow中的類似opencv的、對圖片預處理的函式

""" TensorFlow支援JPG、PNG影象格式，RGB、RGBA顏色空間影象用與影象尺寸相同(heightwidthchnanel)張量表示通道表示為包含每個通道顏色數量標量秩1張量影象所有畫素存在磁碟檔案，需要被載入到記憶體影象載入與二進位制檔案相同影象需

對tensorflow中的tensor、placeholder及feed_dict的理解

以前不知道tf.placeholder的feed_dict格式要求，以為隨便是什麼格式都可以，直到自己在做測試的時候出現以下錯誤才知道tf.placeholder 的feed_dict填充內容不可以是tensor格式的，對自己來說反而方便很多。 TypeError: The valu

python中利用opencv簡單做圖片比對

python環境中，利用opencv對二值單通道圖片進行比對下面程式碼中利用了兩種比對的方法，一對圖片矩陣（m x m）求解特徵值，通過比較特徵值是否在一定的範圍內，判斷圖片是否相同。二對圖片矩陣（m x m）中1求和，通過比較sum和來比較圖片。

JS中類方法、對象方法、原型方法

script ava 返回 name clas func new 構造函數對象方法 1、對象方法：包括構造函數中的方法以及構造函數原型上面的方法；2、類方法：其實這裏的類就是一個函數。在js中由於函數也是一個對象，所以可以為函數添加屬性以及方法，這種方法在node中用的比

ORACLE中建立表、對錶進行增刪改查的語法

最近在學習ORACLE，現將在ORACLE中建立表、對錶進行增刪該查的語法總結如下：表是一種資料庫物件，是基本的資料儲存單位，由行和列組成表的建立(以課程資訊表為例): CREATE TABLE OBJECTS

滑鼠操作事件,擷取攝像頭視訊中區域，圖片預處理，識別數字

通過回撥函式擷取攝像頭視訊感興趣區域（儀器數字區域），並進行預處理，然後識別，程式碼如下效果圖如下 //滑鼠操作事件,擷取攝像頭視訊中區域，識別圖片 #include <opencv2/core/core.hpp> #include <openc

tensorflow中的常量、變數和佔位符

部分內容轉自https://blog.csdn.net/baidu_15113429/article/details/78077834?locationNum=8&fps=1https://blog.csdn.net/fei13971414170/article/de

tensorflow中實現自動、手動梯度下降：GradientDescent、Momentum、Adagrad

tensorflow中提供了自動訓練機制（見nsorflow optimizer minimize 自動訓練和var_list訓練限制），本文主要展現不同的自動梯度下降並附加手動實現。 learning rate、step、計算公式如下：在預測中，x是關於y的變數，

tensorflow圖片預處理和測試效果（resize，crop，pad等）

#%% 讀圖片程式碼 import matplotlib.pyplot as plt # plt 用於顯示圖片 import matplotlib.image as mpimg # mpimg 用於讀取圖片 import tensorflow as tf test_ima

jupyter 中對圖片基本處理操作

eye()的使用方法 import numpy as np a = np.eye(2,3) #隨機生成一個2*3的矩陣 #a = np.eye(3) #隨機生成一個3*3的矩陣 print (a) 在jupyter notebook中顯示資料夾中的圖片 impor

tensorflow中儲存模型、載入模型做預測（不需要再定義網路結構）

下面用一個線下回歸模型來記載儲存模型、載入模型做預測參考文章: 訓練一個線下回歸模型並儲存看程式碼： import tensorflow as tfimport numpy as

tensorflow中張量、常量、變數、佔位符

引言從例項出發 #先匯入TensorFlow import tensorflow as tf # Create TensorFlow object called hello_constant hello_constant = tf.constant('Hello Wo

Tensorflow(4) Semantic Segmentation 圖片預處理

在語義分割中我們常用的資料集是VOC2012，在實際訓練的時候我們如何利用這個資料集對模型進行訓練呢，下面是處理的一些細節以及相關程式碼。這個資料集的介紹詳細請見我的另一篇部落格用tensorflow對其進行處理 tensorflow讀圖片的方式：

吳裕雄 python 機器學習——神經網絡TensorFlow圖片預處理調整圖片

sta ring val nes 機器 con order process 學習 import numpy as np import tensorflow as tf import matplotlib.pyplot as plt def distort_co

字符串操作練習：星座、凱撒密碼、99乘法表、詞頻統計預處理

千分位不足 last 乘法表控制 ise 精度 end for 實例：輸出12個星座符號，以反斜線分隔。 for i in range(12): print(chr(9800+i),end="/") 愷撒密碼的編碼 sr1="abcdefghij

caffe Python API 之圖片預處理

# 設定圖片的shape格式為網路data層格式 transformer = caffe.io.Transformer({'data': net.blobs['data'].data.shape}) # 改變維度的順序，由原始圖片維度(width, height, channel)變為(channel,

資料科學和人工智慧技術筆記六、日期時間預處理

六、日期時間預處理作者：Chris Albon 譯者：飛龍協議：CC BY-NC-SA 4.0 把日期和時間拆成多個特徵 # 載入庫 import pandas as pd # 建立資料幀 df = pd.DataFrame() # 建立五個日期

資料庫中儲存使用者名稱、密碼時如何處理？

一般的專案都有一個使用者表，請問在這個表中，你的賬號和密碼都是明文儲存的麼？那麼怎麼防止被別人看見使用者的密碼呢？我見過一個專案是這樣的，在使用者註冊時就對使用者的密碼進行MD5加密，這樣使用者表中儲存的密碼就是加密的資訊，就算管理員也不能看到使用者的密碼，使用者在登入時

【OpenCV筆記】影象預處理

void FillWhite(IplImage *pImage) { cvRectangle(pImage, cvPoint(0, 0), cvPoint(pImage->width, pImage->height), CV_RGB(255, 255, 255), CV_FILLE

【Java-16】Java中字串表示、字串一些重要操作函式、整數與字串轉換

Java中有一個字串型別String，實際上確切說是有一個字串類，而類可以等價看作一個數據型別，所以我們就把String看成字串型別，String[]自然就是字串陣列，且對於每一個字串其都附帶很多方法，如字串切割等，具體看程式碼註釋解釋 package array; public class

TensorFlow中的類似opencv的、對圖片預處理的函式

相關推薦