[深度學習]inception_v3識別任何圖片(程式碼)

阿新 • • 發佈：2018-12-17

運用已經在imagenet上訓練的inception_v3網路,識別各種圖片:

1. 在網上下載Inception_v3的訓練模型,解壓後會得到如下檔案(需要的可以私信我): 在這裡插入圖片描述其中第一個第二個是imagenet中數字標號和英文label的檔案: 第三個是該模型結構的帶權重的Graph. 2.運用下面的程式碼可以生成一個tfevents檔案,然後用tensorboard檢視他的網路結構.

import tensorflow as tf
import os
inception_pretrain_model_dir = './inception_v3'
log_dir = 'inception_v3_log'
if not os.path.exists(log_dir):
    os.makedirs(log_dir)
   
inception_graph_def_file = os.path.join(inception_pretrain_model_dir, 'classify_image_graph_def.ckpt')

with tf.Session() as sess:
    with tf.gfile.FastGFile(inception_graph_def_file, 'rb') as f:
        graph_def = tf.GraphDef()
        graph_def.ParseFromString(f.read())
        tf.import_graph_def(graph_def, name='')
    writer = tf.summary.FileWriter(log_dir, sess.graph)
    writer.close()

在這裡插入圖片描述 3.然後載入模型到圖中,把label寫入新的字典,方便後期數字和英文轉換.然後進行檢測.

import tensorflow as tf 
import os
import numpy as np
import matplotlib.pyplot as plt
from PIL import Image
label_lookup_path = './imagenet_2012_challenge_label_map_proto.pbtxt'
id_lookup_path ='./imagenet_synset_to_human_label_map.txt'

class Nodelookup(object):
    def __init__(self,label_lookup_path, id_lookup_path):
        self.label_lookup_path = label_lookup_path
        self.id_lookup_path =id_lookup_path
        self.node_lookup = self.load(self.label_lookup_path, self.id_lookup_path)
        
    def load(self, label_lookup_path, id_lookup_path):
        #分類字串對應的類別名稱  (n00004475	organism, being)
        human_label = tf.gfile.GFile(id_lookup_path).readlines()
        id_to_human = {}
        for line in human_label:
            line = line.strip('\n')
            parsed_item = line.split('\t')
            uid = parsed_item[0]
            human_string = parsed_item[1]
            id_to_human[uid] = human_string
         
        
        #分類字串與對應的編號
        '''entry {
                  target_class: 449
                  target_class_string: "n01440764"
                }'''
        
        label_to_id = tf.gfile.GFile(label_lookup_path).readlines()
        id_to_label = {}
        for line in label_to_id:
            if line.startswith('  target_class:'):
                target_class = int(line.split(': ')[1])
                
            if line.startswith('  target_class_string:'):
                target_class_string  = line.split(': ')[1]
                #不要左右的引號所以是從1:-2
                id_to_label[target_class] = target_class_string[1:-2]
         
        #建立一個新的字典,第二個字典的val作為第一個字典的key,得到的數值(也就是英文類別名稱)作為新字典的val.
        #再把第二個字典的key作為新字典的key,建立新的對應(44----dog)
        id_to_name = {}
        for key, val in id_to_label.items():
            number = id_to_human[val]
            id_to_name[key] = number
        return id_to_name
    
    
    #傳入分類編號返回英文名稱
    def id_to_string(self, node_id):
        if node_id not in self.node_lookup:
            return ' ***** '
        return self.node_lookup[node_id]

#建立一個圖來存放inception訓練好的模型
with tf.gfile.FastGFile('./classify_image_graph_def.pb', 'rb') as f:
    graph_def = tf.GraphDef()
    graph_def.ParseFromString(f.read())
    tf.import_graph_def(graph_def, name='')
    

with tf.Session() as sess:
    
    softmax_tensor = sess.graph.get_tensor_by_name('softmax:0')
    
    for root, dirs, files in os.walk('image/'):
        for file in files:
            image_data = tf.gfile.FastGFile(os.path.join(root,file), 'rb').read()
            predictions = sess.run(softmax_tensor, {'DecodeJpeg/contents:0':image_data})
            predictions = np.squeeze(predictions)
            img_path = os.path.join(root, file)
            
            print(img_path)
            img = Image.open(img_path)
            plt.imshow(img)
            plt.axis('off')
            plt.show()
            
            node_lookup =Nodelookup(label_lookup_path, id_lookup_path)
            top_k = predictions.argsort()[-1:]
            for node_id in top_k:
                result = node_lookup.id_to_string(node_id)
                score = predictions[node_id]
                print('識別為: %s | 概率為: %.4f'% (result, score))
            print('\n')

在這裡插入圖片描述

[深度學習]inception_v3識別任何圖片(程式碼)

運用已經在imagenet上訓練的inception_v3網路,識別各種圖片: 1. 在網上下載Inception_v3的訓練模型,解壓後會得到如下檔案(需要的可以私信我): 其中第一個第二個是imagenet中數字標號和英文label的檔案: 第三個是該

Dlib+OpenCV深度學習人臉識別

row 拷貝 too 這一驗證 message word endif all 目錄(?)[+] DlibOpenCV深度學習人臉識別前言人臉數據庫導入人臉檢測人臉識別異常處理 Dlib+OpenCV深度學習人臉識別前言人臉

機器學習（三）深度學習的經典論文、程式碼、部落格文章

前言總結了Deep Learning應用相關的經典論文、程式碼、部落格文章之類，包括CNN、RCNN、DQN、RNN等，github上看到。原文地址：https://github.com/kristjankorjus/applied-deep-l

深度學習，opencv讀取圖片，歸一化，顯示，多張圖片顯示

import numpy as np import cv2 def cv_norm_proc(img): # cv_norm_proc函式將圖片歸一化 [-1,1] img_rgb = (img / 255. - 0.5) * 2 return img_rgb def cv_in

【深度學習】ResNet解讀及程式碼實現

簡介 ResNet是何凱明大神在2015年提出的一種網路結構，獲得了ILSVRC-2015分類任務的第一名，同時在ImageNet detection，ImageNet localization，COCO detection和COCO segmentation等任務中均獲得了第一名，在當

[GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(上）

在之前的兩篇 GAN 系列文章--[GAN學習系列1]初識GAN以及[GAN學習系列2] GAN的起源中簡單介紹了 GAN 的基本思想和原理，這次就介紹利用 GAN 來做一個圖片修復的應用，主要採用的也是 GAN 在網路結構上的升級版--DCGAN，最初始的 GAN 採用的還是神經網路，即全連線網路，而 DC

[GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(中）

上一篇文章--[GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(上）中，我們先介紹了對於影象修復的背景，需要利用什麼資訊來對缺失的區域進行修復，以及將影象當做概率分佈取樣的樣本來看待，通過這個思路來開始進行影象的修復。這篇文章將繼續介紹原文的第二部分，利用對抗生成網路來快速生成假圖片

[GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(下）

這是本文的最後一部分內容了，前兩部分內容的文章： [GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(上） [GAN學習系列3]採用深度學習和 TensorFlow 實現圖片修復(中）以及原文的地址： bamos.github.io/2016/08/09/… 最後一

wav2letter++簡介：深度學習語音識別系統

語音識別系統是深度學習生態中發展最成熟的領域之一。當前這一代的語音識別模型基本都是基於遞迴神經網路（Recurrent Neural Network）對聲學和語言模型進行建模，以及用於知識構建的計算密集的特徵提取流水線。雖然基於RNN的技術已經在語音識別任務中得到驗證，但訓練RNN網路所需要

深度學習之目標檢測object_detection程式碼實現

基於tensorflow的object_detection框架和slim框架，實現一個目標檢測系統：一：資料及準備 1.資料標註，使用labelImg對資料集進行標註，生成對應的xml檔案 2.使用create_pet_tf_record.py指令碼生成tfrec

如何走近深度學習人臉識別？你需要這篇超長綜述 | 附開原始碼

相信做機器學習或深度學習的同學們回家總會有這樣一個煩惱：親朋好友詢問你從事什麼工作的時候，如何通俗地解釋能避免尷尬？我嘗試過很多名詞來形容自己的工作：機器學習，深度學習，演算法工程師/研究員，搞計算機的，程式設計師…這些詞要麼自己覺得不滿意，要麼對方聽不懂。經歷無數次失敗溝通，最後總結了一個簡單實用的答案：“

現有深度學習人臉識別綜述

現有人臉檢測三類方法：1、Cascade CNN：速度最快，精度相對較低；代表演算法：MI-CNN，ICS2、Faster R-CNN：速度較慢，精度較高；代表演算法：Face R-CNN，Face R

【深度學習影象識別課程】keras實現CNN系列：（9）bottleneck特徵生成

一、bottleneck特徵說明將所有影象穿過（VGG16去掉最後全連線層）得到輸出，作為新的輸入。二、bottleneck特徵提取程式碼 1、載入預處理影象庫 from keras.applications.vgg16 import preprocess

《解析深度學習語音識別實踐》高清中文版PDF下載

《解析深度學習語音識別實踐》高清中文版PDF下載高清中文版PDF，全書321頁帶目錄下載連結：https://pan.baidu.com/s/1Ly4sdpNpcU_AwnwEVdBKLA備用連結：https://u1593575.ctfile.com/fs/1593575-330744495 本書首