pytorch: 準備、訓練和測試自己的圖片資料

阿新 • • 發佈：2018-12-29

大部分的pytorch入門教程，都是使用torchvision裡面的資料進行訓練和測試。如果我們是自己的圖片資料，又該怎麼做呢？

一、我的資料

我在學習的時候，使用的是fashion-mnist。這個資料比較小，我的電腦沒有GPU，還能吃得消。關於fashion-mnist資料，可以百度，也可以點此瞭解一下，資料就像這個樣子：

但是下載下來是一種二進位制檔案，並不是圖片，因此我先轉換成了圖片。

我先解壓gz檔案到e:/fashion_mnist/資料夾

然後執行程式碼：

import os
from skimage import io
import torchvision.datasets.mnist as mnist

root 
="E:/fashion_mnist/"
train_set = (
    mnist.read_image_file(os.path.join(root, 'train-images-idx3-ubyte')),
    mnist.read_label_file(os.path.join(root, 'train-labels-idx1-ubyte'))
        )
test_set = (
    mnist.read_image_file(os.path.join(root, 't10k-images-idx3-ubyte')),
    mnist.read_label_file(os.path.join(root,  
't10k-labels-idx1-ubyte'))
        )
print("training set :",train_set[0].size())
print("test set :",test_set[0].size())

def convert_to_img(train=True):
    if(train):
        f=open(root+'train.txt','w')
        data_path=root+'/train/'
        if(not os.path.exists(data_path)):
            os.makedirs(data_path)
         
for i, (img,label) in enumerate(zip(train_set[0],train_set[1])):
            img_path=data_path+str(i)+'.jpg'
            io.imsave(img_path,img.numpy())
            f.write(img_path+' '+str(label)+'\n')
        f.close()
    else:
        f = open(root + 'test.txt', 'w')
        data_path = root + '/test/'
        if (not os.path.exists(data_path)):
            os.makedirs(data_path)
        for i, (img,label) in enumerate(zip(test_set[0],test_set[1])):
            img_path = data_path+ str(i) + '.jpg'
            io.imsave(img_path, img.numpy())
            f.write(img_path + ' ' + str(label) + '\n')
        f.close()

convert_to_img(True)
convert_to_img(False)

這樣就會在e:/fashion_mnist/目錄下分別生成train和test資料夾，用於存放圖片。還在該目錄下生成了標籤檔案train.txt和test.txt.

二、進行CNN分類訓練和測試

先要將圖片讀取出來，準備成torch專用的dataset格式，再通過Dataloader進行分批次訓練。

程式碼如下：

import torch
from torch.autograd import Variable
from torchvision import transforms
from torch.utils.data import Dataset, DataLoader
from PIL import Image
root="E:/fashion_mnist/"

# -----------------ready the dataset--------------------------
def default_loader(path):
    return Image.open(path).convert('RGB')
class MyDataset(Dataset):
    def __init__(self, txt, transform=None, target_transform=None, loader=default_loader):
        fh = open(txt, 'r')
        imgs = []
        for line in fh:
            line = line.strip('\n')
            line = line.rstrip()
            words = line.split()
            imgs.append((words[0],int(words[1])))
        self.imgs = imgs
        self.transform = transform
        self.target_transform = target_transform
        self.loader = loader

    def __getitem__(self, index):
        fn, label = self.imgs[index]
        img = self.loader(fn)
        if self.transform is not None:
            img = self.transform(img)
        return img,label

    def __len__(self):
        return len(self.imgs)

train_data=MyDataset(txt=root+'train.txt', transform=transforms.ToTensor())
test_data=MyDataset(txt=root+'test.txt', transform=transforms.ToTensor())
train_loader = DataLoader(dataset=train_data, batch_size=64, shuffle=True)
test_loader = DataLoader(dataset=test_data, batch_size=64)


#-----------------create the Net and training------------------------

class Net(torch.nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.conv1 = torch.nn.Sequential(
            torch.nn.Conv2d(3, 32, 3, 1, 1),
            torch.nn.ReLU(),
            torch.nn.MaxPool2d(2))
        self.conv2 = torch.nn.Sequential(
            torch.nn.Conv2d(32, 64, 3, 1, 1),
            torch.nn.ReLU(),
            torch.nn.MaxPool2d(2)
        )
        self.conv3 = torch.nn.Sequential(
            torch.nn.Conv2d(64, 64, 3, 1, 1),
            torch.nn.ReLU(),
            torch.nn.MaxPool2d(2)
        )
        self.dense = torch.nn.Sequential(
            torch.nn.Linear(64 * 3 * 3, 128),
            torch.nn.ReLU(),
            torch.nn.Linear(128, 10)
        )

    def forward(self, x):
        conv1_out = self.conv1(x)
        conv2_out = self.conv2(conv1_out)
        conv3_out = self.conv3(conv2_out)
        res = conv3_out.view(conv3_out.size(0), -1)
        out = self.dense(res)
        return out


model = Net()
print(model)

optimizer = torch.optim.Adam(model.parameters())
loss_func = torch.nn.CrossEntropyLoss()

for epoch in range(10):
    print('epoch {}'.format(epoch + 1))
    # training-----------------------------
    train_loss = 0.
    train_acc = 0.
    for batch_x, batch_y in train_loader:
        batch_x, batch_y = Variable(batch_x), Variable(batch_y)
        out = model(batch_x)
        loss = loss_func(out, batch_y)
        train_loss += loss.data[0]
        pred = torch.max(out, 1)[1]
        train_correct = (pred == batch_y).sum()
        train_acc += train_correct.data[0]
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()
    print('Train Loss: {:.6f}, Acc: {:.6f}'.format(train_loss / (len(
        train_data)), train_acc / (len(train_data))))

    # evaluation--------------------------------
    model.eval()
    eval_loss = 0.
    eval_acc = 0.
    for batch_x, batch_y in test_loader:
        batch_x, batch_y = Variable(batch_x, volatile=True), Variable(batch_y, volatile=True)
        out = model(batch_x)
        loss = loss_func(out, batch_y)
        eval_loss += loss.data[0]
        pred = torch.max(out, 1)[1]
        num_correct = (pred == batch_y).sum()
        eval_acc += num_correct.data[0]
    print('Test Loss: {:.6f}, Acc: {:.6f}'.format(eval_loss / (len(
        test_data)), eval_acc / (len(test_data))))

打印出來的網路模型：

訓練和測試結果：

原文連結

https://www.cnblogs.com/denny402/p/7520063.html

pytorch: 準備、訓練和測試自己的圖片資料

大部分的pytorch入門教程，都是使用torchvision裡面的資料進行訓練和測試。如果我們是自己的圖片資料，又該怎麼做呢？一、我的資料我在學習的時候，使用的是fashion-mnist。這個資料比較小，我的電腦沒有GPU，還能吃得消。關於fashion-mnist資料，可以百度，也可以點此瞭解

Caffe上用SSD訓練和測試自己的資料

學習caffe第一天，用SSD上上手。我的根目錄$caffe_root為/home/gpu/ljy/caffe 一、執行SSD示例程式碼 1.到https://github.com/weiliu89/caffe.git下載caffe-ssd程式碼，是一個caffe資料夾 2.參考已經

Windows Caffe 學習筆記（三）在Caffe上訓練和測試自己的資料

本文是學習Caffe官方文件"ImageNet Tutorial"時做的，同樣由於是Windows版本的原因，很多shell指令碼不能直接使用，走了不少彎路，但是收穫也不少。比如：如何讓shell指令

YOLOv3+Faster R-CNN+SSD訓練和測試自己的資料

首先製作自己的資料集—VOC2007資料集製作，接下來就可以開始搞事情了.... 一：YOLOv3相關二：Faster R-CNN相關 Python原始碼：點選開啟連結三：SSD相關

faster-rcnn訓練和測試自己的資料（VGG/ResNet）以及遇到的問題

http://www.cnblogs.com/caffeaoto/p/6536482.html主要參照這個教程改的需要準備的檔案:Annotation檔案，圖片，用來訓練的圖片名稱list.txt訓練：需要改的檔案：lib/datasets/下的pascal_voc.py,f

【12】Caffe學習系列：訓練和測試自己的圖片

一、準備資料有條件的同學，可以去imagenet的官網http://www.image-net.org/download-images，下載imagenet圖片來訓練。驗證碼始終出不來需要翻牆（是google網站的驗證碼）。但是我沒有下載，原因是資料太大了。。。我去網上找了一些其它的圖片

Caffe9:訓練和測試自己的圖片

在深度學習的實際應用中，我們經常用到的原始資料是圖片檔案，如jpg,jpeg,png,tif等格式的，而且有可能圖片的大小還不一致。而在caffe中經常使用的資料型別是lmdb或leveldb，因此就產生了這樣的一個問題：如何從原始圖片檔案轉換成caffe中能夠執行的db（leveldb/lmdb)檔

caffe隨記（七）---訓練和測試自己的圖片

前面也介紹了tools工具，今天來試著自己跑一下影象分類的例項 1、下載資料我沒有用imagenet的資料，因為太大了不想下，而且反正也只是當作例程跑一下而已，所以我用的是另一位博主分享的網盤上的資料，共有500張圖片，分為大巴車、恐龍、大象、鮮花和馬五個類，每個類1

Caffe中檔案引數設定（九-1）：訓練和測試自己的圖片-linux版本

在深度學習的實際應用中，我們經常用到的原始資料是圖片檔案，如jpg,jpeg,png,tif等格式的，而且有可能圖片的大小還不一致。而在caffe中經常使用的資料型別是lmdb或leveldb，因此就產生了這樣的一個問題：如何從原始圖片檔案轉換成caffe中能夠執行的db（l

Caffe傻瓜系列(9)：訓練和測試自己的圖片

FCN製作自己的資料集、訓練和測試 caffe

花了兩三週的時間，在導師的催促下，把FCN的全部流程走了一遍，期間走了很多彎路，現在記錄一下。系統環境：ubuntu 16.04LTS 一、資料集的製作注：我的資料集是仿照VOC資料集進行製作的 1.resize 資料集我的GPU視訊記憶體4G，跑過大的圖片帶不動，需要resize圖片大小，放幾

Caffe上用SSD訓練和測試自己的數據

輸出 makefile b數 text play cal 上下 lba san 學習caffe第一天，用SSD上上手。我的根目錄$caffe_root為/home/gpu/ljy/caffe 一、運行SSD示例代碼 1.到https://github.com

caffe安裝，編譯（包括CUDA和cuDNN的安裝），並訓練，測試自己的資料（caffe使用教程）

caffe是一個非常清晰且高效的深度學習框架，目前有著不少的使用者，也漸漸的形成了自己的社群，社群上可以討論相關的問題。我從開始看深度學習的相關內容到能夠用caffe訓練測試自己的資料，看了不少網站，教程和部落格，也走了不少彎路，在此把整個流程梳理和總結一遍，以期望可以可

PyTorch(三)——使用訓練好的模型測試自己圖片

PyTorch的學習和使用（三）在上一篇文章中實現瞭如何增加一個自定義的Loss，以Siamese network為例。現在實現使用訓練好的該網路對自己手寫的數字圖片進行測試。首先需要對訓練時的權

PyTorch(七)——模型的訓練和測試、儲存和載入

PyTorch的學習和使用（七）模型的訓練和測試在訓練模型時會在前面加上： model.train() 在測試模型時在前面使用： model.eval() 同時發現，如果不寫這兩個程式也可以執行，這是因為這兩個方法是針對在網路訓練和測試時採用不同方式的

pytorch代碼中同時包含訓練和測試代碼時顯存爆炸

evaluate 表現驗證 tor lua 查看包含測試 mode 原因在於沒有使用torch.no_grad()函數。在查看驗證集和測試集表現時，應使用類似這樣的代碼 def evaluate(data_loader): with torch.no_grad

pytorch程式碼中同時包含訓練和測試程式碼時視訊記憶體爆炸

原因在於沒有使用torch.no_grad()函式。在檢視驗證集和測試集表現時，應使用類似這樣的程式碼 def evaluate(data_loader): with torch.no_grad(): mean_acc, mean_iou = 0, 0 for i,

FCN製作自己的資料集並訓練和測試

前言這篇部落格記錄的是如何製作自己的資料集，並使用FCN模型訓練資料，前提要搭建caffe框架，可以參考這篇部落格，我製作的資料集是仿照voc2012資料集來在做的製作影象標籤這一部分是最難的部分，在製作標籤之前要搞清楚你的影象共分為幾類調整影象尺寸

Caffe學習筆記1：linux下建立自己的資料庫訓練和測試caffe中已有網路

本文是基於薛開宇《學習筆記3：基於自己的資料訓練和測試“caffeNet”》基礎上，從頭到尾把實驗跑了一遍~對該文中不清楚的地方做了更正和說明。主要工作如下： 1、下載圖片建立資料庫 2、將圖片轉化為256*256的lmdb格式 3、計算影象均值 4、定義網路修改部分引

用自己的資料訓練和測試“caffenet”

本次實驗本來參考examples/imagenet下的readme.txt進行，但因為資料集過於龐大，所以模擬學習，參考薛開宇的學習方式，模仿搭建自己的資料庫。首先在caffe/data下新建資料夾myself，然後在網上下載貓、鳥、狗的訓練圖片各50張，測

pytorch: 準備、訓練和測試自己的圖片資料

相關推薦