1. 程式人生 > >[深度學習][CIFRA資料處理] CIFRA-10 與 CIFRA-100 資料集介紹

[深度學習][CIFRA資料處理] CIFRA-10 與 CIFRA-100 資料集介紹

作為主流的分類資料集,這一篇主要講 CIFRA-10 與 CIFRA-100 資料集下載與Python版本的處理
資料集下載地址:http://www.cs.toronto.edu/~kriz/cifar.html

資料介紹

CIFAR-10和CIFAR-100是兩組有8000萬個微小影象資料組成的標記圖片資料集。它們是由Alex Krizhevsky,Vinod Nair和Geoffrey Hinton(大佬大佬大佬)收集的。

CIFAR-10資料集

CIFAR-10資料集由10個類(‘airplane’, ‘automobile’, ‘bird’, ‘cat’, ‘deer’, ‘dog’, ‘frog’, ‘horse’, ‘ship’, ‘truck’)共60000個32x32彩色影象組成,每個類有6000個影象。被劃分為50000個訓練影象和10000個測試影象。

在這裡插入圖片描述

資料集分為五個訓練批次(data_batch)和一個測試批次(test_batch),每個批次有10000個影象。測試批次包含來自每個類別的1000個隨機選擇的影象。訓練批次以隨機順序包含剩餘影象,但是一些訓練批次可能包含來自一個類別的更多影象而不是另一個類別。在它們之間,訓練批次包含來自每個類別的5000個影象。

在這裡插入圖片描述

下載地址

Version Size md5sum
CIFAR-10 python version 163 MB c58f30108f718f92721af3b95e74349a
CIFAR-10 Matlab version 175 MB 70270af85842c9e89bb428ec9976c926
CIFAR-10 binary version (suitable for C programs) 162 MB c32a1d4ab5d03f1284b67883e8d87530

CIFRA-100資料集

此資料集與CIFAR-10類似,不同之處在於它有100個類,每個類包含600個影象。每類分為500個訓練影象和100個測試影象。其中100個類分為20個大類。每個影象都帶有一個“精細”標籤(它所屬的類)和一個“粗略”標籤(它所屬的大類)。
以下是CIFAR-100中的類列表:

Superclass Classes
aquatic mammals beaver, dolphin, otter, seal, whale
fish aquarium fish, flatfish, ray, shark, trout
flowers orchids, poppies, roses, sunflowers, tulips
food containers bottles, bowls, cans, cups, plates
fruit and vegetables apples, mushrooms, oranges, pears, sweet peppers
household electrical devices clock, computer keyboard, lamp, telephone, television
household furniture bed, chair, couch, table, wardrobe
insects bee, beetle, butterfly, caterpillar, cockroach
large carnivores bear, leopard, lion, tiger, wolf
large man-made outdoor things bridge, castle, house, road, skyscraper
large natural outdoor scenes cloud, forest, mountain, plain, sea
large omnivores and herbivores camel, cattle, chimpanzee, elephant, kangaroo
medium-sized mammals fox, porcupine, possum, raccoon, skunk
non-insect invertebrates crab, lobster, snail, spider, worm
people baby, boy, girl, man, woman
reptiles crocodile, dinosaur, lizard, snake, turtle
small mammals hamster, mouse, rabbit, shrew, squirrel
trees maple, oak, palm, pine, willow
vehicles 1 bicycle, bus, motorcycle, pickup truck, train
vehicles 2 lawn-mower, rocket, streetcar, tank, tractor

下載地址

Version Size md5sum
CIFAR-100 python version 161 MB eb9058c3a382ffc7106e4002c42a8d85
CIFAR-100 Matlab version 175 MB 6a4bfa1dcd5c9453dda6bb54194911f4
CIFAR-100 binary version (suitable for C programs) 161 MB 03b5dce01913d631647c71ecec9e9cb8