Tensorflow Object Detection 生成自己的tfrecord訓練資料集

阿新 • • 發佈：2018-12-20

Object Detection API谷歌

該文章部分參考別的大佬的，由於忘了內容出處，所以沒有加轉載連結，請諒解，有原創作者看到可以聯絡我新增。

========轉載請註明出處==========

此python檔案放在dataset_tools下面

生成自己訓練的資料集主要看個人annotation檔案是什麼格式的。我這裡的每張圖都有自己的annotation檔案，例如：

圖片xxx.jpg，其annotation檔案為xxx.box

box檔案內容為:

Xmin Ymin Xmax Ymax label 如下圖：如果有多個label ,可以繼續追加在下一行：

Xmin Ymin Xmax Ymax label \n

Xmin Ymin Xmax Ymax label

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import hashlib
import io
import os
import PIL.Image
import tensorflow as tf
import pandas as pd
import cv2
from functools import reduce
import operator
from object_detection.utils import dataset_util

flags = tf.app.flags
flags.DEFINE_string('train_imgs_dir', '/home/ai/Downloads/competition_change_box_img/img', 'Root directory to bc train dataset.')
flags.DEFINE_string('train_labels', '/home/ai/Downloads/competition_change_box_img/box',
                    '(Relative) path to annotations directory.')
flags.DEFINE_string('train_output', '../All_tf_record/competition_img_test.record', 'Path to output TFRecord')
FLAGS = flags.FLAGS



def create_coordinate_info_of_content_list(image_dir,label_dir):
    content_list_all = []
    for item,file_name in enumerate(os.listdir(label_dir)):
        img = cv2.imread(os.path.join(image_dir,file_name.replace('.box','.jpg')))
        height = img.shape[0]
        width = img.shape[1]
        deepth = img.shape[2]
        content_list = [[file_name.replace('.box', '.jpg'), height, width, deepth]]
        with open(os.path.join(label_dir,file_name), 'r') as f: lines = f.readlines()
        for line in lines:
            new_line = line.split(' ')[:]
            content_one = [new_line[0],new_line[1],new_line[2],new_line[3],new_line[4]]
            content_list.append(content_one)
        a = reduce(operator.add,content_list)
        content_list_all.append(a)
   
    return content_list_all

def create_tf_example(content_list, imgs_dir):
    height = int(content_list[1])
    width = int(content_list[2])
    filename = content_list[0]
    img_path = os.path.join(imgs_dir, filename)
    with tf.gfile.GFile(img_path, 'rb') as fid:
        encoded_jpg = fid.read()
    encoded_jpg_io = io.BytesIO(encoded_jpg)
    image = PIL.Image.open(encoded_jpg_io)
    if image.format != 'JPEG':
        raise ValueError('Image format not JPEG')
    key = hashlib.sha256(encoded_jpg).hexdigest()

    xmin = []
    ymin = []
    xmax = []
    ymax = []
    classes = []
    classes_text = []
    box_num = int((len(content_list) - 4) / 5)    #一張圖上可能有多個label
    for i in range(box_num):
        xmin.append(float(content_list[5 * i + 4 + 0]) / width)
        ymin.append(float(content_list[5 * i + 4 + 1]) / height)
        xmax.append(float(content_list[5 * i + 4 + 2]) / width)
        ymax.append(float(content_list[5 * i + 4 + 3]) / height)
        classes_text.append(content_list[5 * i + 4 + 4].encode('utf8'))
        classes.append(classMap[content_list[5 * i + 4 + 4]])
        print('the class id is {} '.format(classMap[content_list[5 * i + 4 + 4]]))
    example = tf.train.Example(features=tf.train.Features(feature={
        'image/height': dataset_util.int64_feature(height),
        'image/width': dataset_util.int64_feature(width),
        'image/filename': dataset_util.bytes_feature(
            filename.encode('utf8')),
        'image/source_id': dataset_util.bytes_feature(
            filename.encode('utf8')),
        'image/key/sha256': dataset_util.bytes_feature(key.encode('utf8')),
        'image/encoded': dataset_util.bytes_feature(encoded_jpg),
        'image/format': dataset_util.bytes_feature('jpeg'.encode('utf8')),
        'image/object/bbox/xmin': dataset_util.float_list_feature(xmin),
        'image/object/bbox/xmax': dataset_util.float_list_feature(xmax),
        'image/object/bbox/ymin': dataset_util.float_list_feature(ymin),
        'image/object/bbox/ymax': dataset_util.float_list_feature(ymax),
        'image/object/class/text': dataset_util.bytes_list_feature(classes_text),
        'image/object/class/label': dataset_util.int64_list_feature(classes),
    }))
    return example

def main(_):
    # train tfrecord generate
    print("Reading from {}".format(FLAGS.train_imgs_dir))
    writer = tf.python_io.TFRecordWriter(FLAGS.train_output)
    content_list_all = create_coordinate_info_of_content_list(FLAGS.train_imgs_dir, FLAGS.train_labels)
    for line in content_list_all:
        content_list = line
        tf_example = create_tf_example(content_list, FLAGS.train_imgs_dir)
        writer.write(tf_example.SerializeToString())
    writer.close()

if __name__ == '__main__':
    tf.app.run()

Tensorflow Object Detection 生成自己的mask_rcnn資料集

此文章參考自大神shirhe的github,由於之前大神寫的有點bug，mask訓練出來後檢測不會出來mask掩碼，所以自己研究了下改的新的（最近大神也修改了這個bug），轉載請註明出處。有三個python檔案需要建立（如果是labelme生成的是xml檔案，另附一份

Tensorflow Object Detection 生成自己的tfrecord訓練資料集

Object Detection API谷歌該文章部分參考別的大佬的，由於忘了內容出處，所以沒有加轉載連結，請諒解，有原創作者看到可以聯絡我新增。 ========轉載請註明出處========== 此python檔案放在dataset_tools下面生成自己訓練

Tensorflow object detection API 訓練自己的資料集

環境：win10 Anaconda3 tensorflow 1.9.0 上篇運行了demo之後，打算訓練自己的資料集，但是沒有完全成功，不過反覆弄了好幾次後，這些步驟還是熟了的，把遇到的問題也貼出來，有人會的話幫我解答下一、準備資料集

Windows下安裝TensorFlow Object Detection API，訓練自己的資料集

Object Detection API 環境搭建 1、首先安裝配置好TensorFlow，參考地址 3、通過pip安裝：pillow, jupyter, matplotlib, lxml，如下： pip install pillow 4、編

用Tensorflow Object Detection API 訓練自己的資料集

一、準備資料集 Tensorflow Object Detection API 用 TFRecord 檔案格式讀取資料，需把 VOC 格式的資料集進行轉換（我自己的資料集是VOC2007） 1、修改 tensorflow/models/object_dete

關於使用tensorflow object detection API訓練自己的模型-補充部分（程式碼，資料標註工具，訓練資料，測試資料）

之前分享過關於tensorflow object detection API訓練自己的模型的幾篇部落格，後面有人陸續碰到一些問題，問到了我解決方法。所以在這裡補充點大家可能用到的東西。宣告一下，本人專業不是搞這個的，屬於愛好者這類的，而且已經時隔已久，可能很多東西已經遺忘了，有時候可能無法完美解答大

tensorflow object detection api訓練自己的資料集

tensorflow object detection API 創造一些精確的機器學習模型用於定位和識別一幅影象裡的多元目標仍然是一個計算機視覺領域的核心挑戰。tensorflow object detection API是一個開源的基於tensorflow的框架，使得建立

TensorFlow Object Detection API 超詳細教程和踩坑過程（資料準備和訓練）

1.準備資料 object detection的資料是需要tfrecord格式的，但是一般我們還是先製作voc格式的資料更加方便。 1.voc格式資料的準備：github上下載一個label-img：然後選擇VOC格式，開始漫長的資料

使用tensorflow object detection API 訓練自己的目標檢測模型（一）labelImg的安裝配置過程

第一步：準備自己的資料集。比如我要檢測車牌。首先用到的是labelImg軟體：先簡要介紹一下labelimg安裝的步驟。接下來需要安裝一些python的包：我的環境是win10anacondapythonn36需要安裝的庫有：lxml, pyqt5,一般anaconda會有l

使用tensorflow object detection API 訓練自己的目標檢測模型（二）

在上一篇部落格"使用tensorflow object detection API 訓練自己的目標檢測模型（一）"中介紹瞭如何使用LabelImg標記資料集，生成.xml檔案，經過個人的手工標註，形成了一個大概有兩千張圖片的資料集。但是這仍然不滿足t

（更新視訊教程）Tensorflow object detection API 搭建屬於自己的物體識別模型（2）——訓練並使用自己的模型

2018.05.10 本人時差黨，有時候回覆不及時。建立了一個QQ群，方便大家互相學習交流。 -----------------------------------------------------------------------------------------

TensorFlow Object Detection API教程——製作自己的資料集

""" Usage: # From tensorflow/models/ # Create train data: python generate_tfrecord.py --csv_input=data/train_labels.csv --output_path=train.record

Tensorflow Object Detection API之MaskRCNN-資料處理篇

TensorFlow官網介紹：Run an Instance Segmentation Model 要求將資料處理為PNG Instance Segmentation Masks格式以下部分為處理單張Mask圖片的方式： from PIL import Image, ImageDr

【CV】如何使用Tensorflow提供的Object Detection API--3--手工標註資料

前面兩篇看完，我們已經知道如何選用預訓練模型以及將現存的其他資料集變成TFRecord格式的資料了。但是如果需要用你自己的資料集，該怎麼辦呢？本篇主要講如何建立自己的資料集，並用object_detection提供的模型來進行訓練，識別。首先需要的是標記資料。 La

完整實現利用tensorflow訓練自己的圖片資料集

經過差不多一個禮拜的時間的學習，終於把完整的一個利用自己爬取的圖片做訓練資料集的卷積神經網路的實現（基於tensorflow）簡單整理一下思路：獲取資料集（上網爬取，或者直接找公開的圖片資料集） reshape圖片成相同大小（公開資料集一般都是相同sha

用多張GPU 顯示卡　加速TensorFlow Object Detection API 模型訓練的過程

本篇記錄如何使用多張GPU 顯示卡，加速TensorFlow Object Detection API 模型訓練的過程。雖然TensorFlow Object Detection API 已經有支援多張GPU 卡平行計算的功能，但是缺乏說明檔案，所以我自己也

使用TensorFlow slim資料夾當中的inception_resnet_v2網路訓練自己的分類資料集

每個資料夾存放一種類別的圖片資料夾名稱即為類別名稱轉換資料集為TFRecords格式的檔案：進入下載以後的資料夾中/models/research/slim/，使用pycharm開啟slim資料夾，開啟轉換格式的檔案download_and_c

【CV】如何使用Tensorflow提供的Object Detection API--4--開始訓練模型

至此已經學習瞭如何選擇預訓練模型，將資料集轉為TFRecord格式。模型和資料都準備好了，是時候開啟訓練了。這些在COCO資料集上的模型都是針對90類進行識別的，如果自己的任務沒有這麼多類，或者類不同

基於谷歌開源的TensorFlow Object Detection API視訊物體識別系統搭建自己的應用（三）

下載opencv的cv2包在Python官網即可下載opencv相關庫，點選此處直接進入。 pip install opencv-python安裝完成後，進入IDLE輸入命令import cv2若未報錯，則opencv-python庫成功匯入，環境搭配成功。基於上篇新建Cam

Tensorflow Object Detection API分散式訓練模型

說明：Tensorflow官方的models專案（https://github.com/tensorflow/models）中已經支援了各種模型的訓練和驗證，並且有詳細的教程，但是在models/research/Object_detection的教程中並沒有提及如何進行分散式訓練，

Tensorflow Object Detection 生成自己的tfrecord訓練資料集

相關推薦