計算機視覺常用開源庫

阿新 • • 發佈：2019-01-20

– Implementation of a unified approach for face detection, pose estimation, and landmark localization (CVPR 2012).

Attributes and Semantic Features

– Modified implementation of RankSVM to train Relative Attributes (ICCV 2011).
– Implementation of object bank semantic features (NIPS 2010). See also
– Software for extracting high-level image descriptors (ECCV 2010, NIPS 2011, CVPR 2012).

Large-Scale Learning

– Source code for fast additive kernel SVM classifiers (PAMI 2013).
– Library for large-scale linear SVM classification.
– Implementation for Pegasos SVM and Homogeneous Kernel map.

Fast Indexing and Image Retrieval

FLANN – Library for performing fast approximate nearest neighbor.
– Source code for Kernelized Locality-Sensitive Hashing (ICCV 2009).
– Code for generation of small binary codes using Iterative Quantization and other baselines such as Locality-Sensitive-Hashing (CVPR 2011).
– Efficient code for state-of-the-art large-scale image retrieval (CVPR 2011).

Object Detection

– Very fast and accurate pedestrian detector (CVPR 2012).
– Excellent resource for pedestrian detection, with various links for state-of-the-art implementations.
– Enhanced implementation of Viola&Jones real-time object detector, with trained models for face detection.
– Source code for branch-and-bound optimization for efficient object localization (CVPR 2008).

3D Recognition

– Library for 3D image and point cloud processing.

Action Recognition

– Source code for action recognition based on the ActionBank representation (CVPR 2012).
– software for computing space-time interest point descriptors
- C++ code for activity recognition using the velocity histories of tracked keypoints (ICCV 2009)

Datasets

Attributes

– 30,475 images of 50 animals classes with 6 pre-extracted feature representations for each image.
– Attribute annotations for images collected from Yahoo and Pascal VOC 2008.
– 15,000 faces annotated with 10 attributes and fiducial points.
– 58,797 face images of 200 people with 73 attribute classifier outputs.
– 13,233 face images of 5,749 people with 73 attribute classifier outputs.
– 8,000 people with annotated attributes. Check also this link for another dataset of human attributes.
– Large-scale scene attribute database with a taxonomy of 102 attributes.
– Variety of attribute labels for the ImageNet dataset.
– Data for OSR and a subset of PubFig datasets. Check also this link for the WhittleSearch data.
– Images of shopping categories associated with textual descriptions.

Fine-grained Visual Categorization

– Hundreds of bird categories with annotated parts and attributes.
– 20,000 images of 120 breeds of dogs from around the world.
– 37 category pet dataset with roughly 200 images for each class. Pixel level trimap segmentation is included.
– 832 images of 10 species of butterflies.

Face Detection

– UMass face detection dataset and benchmark (5,000+ faces)
– Classical face detection dataset.

Face Recognition

– Large collection of face recognition datasets.
– UMass unconstrained face recognition dataset (13,000+ face images).
– includes face recognition grand challenge (FRGC), vendor tests (FRVT) and others.
– contains more than 750,000 images of 337 people, with 15 different views and 19 lighting conditions.
FERET – Classical face recognition dataset.
– Easy to use if you want play with simple face datasets including Yale, ORL, PIE, and Extended Yale B.
– Low-resolution face dataset captured from surveillance cameras.

Handwritten Digits

MNIST – large dataset containing a training set of 60,000 examples, and a test set of 10,000 examples.

Pedestrian Detection

– 10 hours of video taken from a vehicle,350K bounding boxes for about 2.3K unique pedestrians.
– Currently one of the most popular pedestrian detection datasets.
– Urban dataset captured from a stereo rig mounted on a stroller.
– Dataset with image pairs recorded in an crowded urban setting with an onboard camera.
– One of 20 categories in PASCAL VOC detection challenges.
– Small dataset captured from surveillance cameras.

Generic Object Recognition

– Currently the largest visual recognition dataset in terms of number of categories and images.
– 80 million 32x32 low resolution images.
– One of the most influential visual recognition datasets.
/ – Popular image datasets containing 101 and 256 object categories, respectively.
– Online annotation tool for building computer vision databases.

Scene Recognition

– MIT scene understanding dataset.

Feature Detection and Description

– Widely used dataset for measuring performance of feature detection and description. Checkfor an evaluation framework.

Action Recognition

– CVPR 2012 tutorial covering various datasets for action recognition.

RGBD Recognition

– Dataset containing 300 common household objects

Reference:

特徵提取機器視覺綜合程式碼主頁程式碼行人檢測視覺壁障物體檢測演算法人臉檢測 ICA獨立成分分析濾波演算法路面識別分割演算法

MATLAB Normalized Cuts Segmentation Code：

計算機視覺常用開源庫

– Implementation of a unified approach for face detection, pose estimation, and landmark localization (CVPR 2012). Attributes and Semantic Featur

go語言常用開源庫整理

red ces href app ptc github 時間插件 ova 單元框架 https://github.com/go-martini/martini 圖形驗證碼 https://github.com/dchest/captcha ORM https://gith

Android常用開源庫的使用——————————————LitePal的使用方法

LitePal是一款開源Android資料庫框架，採用了物件關係對映的模式，詳細的使用文件見LitePal專案的Github主頁快速設定步驟： 1. Include library Edit your build.gradle file and add below dependency. If

Android常用開源庫———————————OkHttp的使用方法

OkHttp是一個優秀的網路通訊庫,GitHub專案主頁地址。基本使用在專案中中新增OkHttp庫的依賴。. implementation("com.squareup.okhttp3:okhttp:3.12.0") OkHttp進行Get請求 //

Android常用開源庫專案，種類齊全，覆蓋面廣

宣告：眾多開源庫出現的目的是為了不重複造輪子! 真正的高手切記，知其然，知其所以然！多花點時間，找主流的開源庫研究原始碼，成長更大！從中窺探掌握本質的技術和原理，萬變不離其宗！推薦理由：這份

計算機視覺常用的評價標準

計算機視覺中常用的評價標準 1 召回率 Recall，又稱“查全率”——還是查全率好記，也更能體現其實質意義。 2 準確率 Precision，又稱“精度”、“正確率”。以檢索為例，可以把搜尋情況用下圖表示：相關不

bootstrap常用開源庫，cdn加速

http://open.bootcss.com/ 常用cdn加速 http://cdn.bootcss.com/bootstrap/3.2.0/css/bootstrap.min.css http://cdn.bootcss.com/bootstrap/3.2.0/js/b

計算機視覺演算法開源實現程式碼程式 Computer Vision Algorithm Implementations

Participate in Reproducible Research General Image Processing (C/C++ code, BSD lic) Image manipulation, matrix manipulation, transforms

Java常用開源庫

Java的經久不衰，很大程度上得益於Java的生態好。在日常開發中，我們也會經常使用到各種開源庫和工具類，為了避免重複造輪子，本文將貼出工作及學習中會用到的部分開源庫和工具類。Java的生態實在太大，這裡只能列舉一部分。如果你對此感興趣，不妨去讀讀他們的原始碼。 vHTML解析器jsoup 1.1 介

OpenCV 4.0.1 和 3.4.5 釋出，Intel 開源的計算機視覺庫

OpenCV 4.0.1 和 3.4.5 已釋出，OpenCV 是 Intel 開源的計算機視覺庫。它由一系列 C 函式和少量 C++ 類構成，實現了影象處理和計算機視覺方面的很多通用演算法。OpenCV 擁有包括 300 多個 C 函式的跨平臺的中、高層 API。這兩

計算機視覺、機器學習等開源庫網站連結

持續跟新場景識別： SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling https://gi

『python』OpenCV3計算機視覺庫安裝

targe cond 最新 cnblogs enc 2.0 sent lan oca 1.下載OpenCV： https://codeload.github.com/Itseez/opencv/zip/3.0.0 2.解壓下載相關依賴： sudo apt-get inst

常用牛人主頁鏈接（計算機視覺、模式識別、機器學習相關方向,陸續更新。。。。）【轉】

short psu works charles 貝葉斯 learning 數學 ocr 相關轉自：http://blog.csdn.net/goodshot/article/details/53214935 目錄(?)[-] The Kalman

『python』計算機視覺_OpenCV3庫安裝

找不到新建 imp size libs redis pac nload .py Anaconda打包安裝： conda install --channel https://conda.anaconda.org/menpo opencv3 驗證： >>&g

常用牛人主頁鏈接（計算機視覺、模式識別、機器學習相關方向,陸續更新

bject detail ebo abi err 技術 arch college eoj 牛人主頁（主頁有很多論文代碼） Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU C

常用Java開源庫(新手必看)

調用 filters 服務器布局最小輸出處理 eclips parameter Jakarta common: Commons LoggingJakarta Commons Logging (JCL)提供的是一個日誌(

轉:計算機視覺人臉相關開源專案總結

原文:https://blog.csdn.net/chaipp0607/article/details/78885720 openface openface是一個基於深度神經網路的開源人臉識別系統。該系統基於谷歌的文章《FaceNet: A Unified Embeddin

幾個常用的Excel開源庫

之前試過 SpreadSheet，該類庫對Excel的操作是基於odbc的，而且還只能使用MFC來操作。當然，可以把SpreadSheet封裝成dll，然後給win32的程式來呼叫，我就是這麼幹的。但是由於SpreadSheet是基於ODBC的，資料匯出的時候太慢了（實測：大概一秒鐘處理5

2017 Android GitHub 常用的開源庫

原文地址現在 GitHub 上流行的開源庫極大地節省了開發者從 0 開發的時間，很多公司和個人都在 GitHub 上開源自己的專案，今天我們就來整理一下 Android 開發中一些非常流行的庫，也是我們必須掌握的，這樣可以使我們在使用到時快速的查詢到，這裡的總結基本也都是自己在開發中用到的，也就

深度學習常用資料集資源（計算機視覺領域）

目錄 1、MNIST 2、ImageNet 4、COCO 5、PASCAL VOC 6、FDDB 1、MNIST 深度學習領域的入門資料集，當前主流的深度學習框架幾乎都將MNIST資料集的處理

計算機視覺常用開源庫

相關推薦