1. 程式人生 > >計算機視覺常用開源庫

計算機視覺常用開源庫

  •  – Implementation of a unified approach for face detection, pose estimation, and landmark localization (CVPR 2012).

    Attributes and Semantic Features

    •  – Modified implementation of RankSVM to train Relative Attributes (ICCV 2011).

    •  – Implementation of object bank semantic features (NIPS 2010). See also 

    •  – Software for extracting high-level image descriptors (ECCV 2010, NIPS 2011, CVPR 2012).


    Large-Scale Learning

    •  – Source code for fast additive kernel SVM classifiers (PAMI 2013).

    •  – Library for large-scale linear SVM classification.

    •  – Implementation for Pegasos SVM and Homogeneous Kernel map.


    Fast Indexing and Image Retrieval

    • FLANN – Library for performing fast approximate nearest neighbor.

    •  – Source code for Kernelized Locality-Sensitive Hashing (ICCV 2009).

    •  – Code for generation of small binary codes using Iterative Quantization and other baselines such as Locality-Sensitive-Hashing (CVPR 2011).

    •  – Efficient code for state-of-the-art large-scale image retrieval (CVPR 2011).


    Object Detection

    •  – Very fast and accurate pedestrian detector (CVPR 2012).

    •  – Excellent resource for pedestrian detection, with various links for state-of-the-art implementations.

    •  – Enhanced implementation of Viola&Jones real-time object detector, with trained models for face detection.

    •  – Source code for branch-and-bound optimization for efficient object localization (CVPR 2008).


    3D Recognition

    •  – Library for 3D image and point cloud processing.


    Action Recognition

    •  – Source code for action recognition based on the ActionBank representation (CVPR 2012).

    •  – software for computing space-time interest point descriptors

    •  - C++ code for activity recognition using the velocity histories of tracked keypoints (ICCV 2009)


    Datasets

    Attributes

    •  – 30,475 images of 50 animals classes with 6 pre-extracted feature representations for each image.

    •  – Attribute annotations for images collected from Yahoo and Pascal VOC 2008.

    •  – 15,000 faces annotated with 10 attributes and fiducial points.

    •  – 58,797 face images of 200 people with 73 attribute classifier outputs.

    •  – 13,233 face images of 5,749 people with 73 attribute classifier outputs.

    •  – 8,000 people with annotated attributes. Check also this link for another dataset of human attributes.

    •  – Large-scale scene attribute database with a taxonomy of 102 attributes.

    •  – Variety of attribute labels for the ImageNet dataset.

    •  – Data for OSR and a subset of PubFig datasets. Check also this link for the WhittleSearch data.

    •  – Images of shopping categories associated with textual descriptions.


    Fine-grained Visual Categorization

    •  – Hundreds of bird categories with annotated parts and attributes.

    •  – 20,000 images of 120 breeds of dogs from around the world.

    •  – 37 category pet dataset with roughly 200 images for each class. Pixel level trimap segmentation is included.

    •  – 832 images of 10 species of butterflies.

    Face Detection

    •  – UMass face detection dataset and benchmark (5,000+ faces)

    •  – Classical face detection dataset.


    Face Recognition

    •  – Large collection of face recognition datasets.

    •  – UMass unconstrained face recognition dataset (13,000+ face images).

    •  – includes face recognition grand challenge (FRGC), vendor tests (FRVT) and others.

    •  – contains more than 750,000 images of 337 people, with 15 different views and 19 lighting conditions.

    • FERET – Classical face recognition dataset.

    •  – Easy to use if you want play with simple face datasets including Yale, ORL, PIE, and Extended Yale B.

    •  – Low-resolution face dataset captured from surveillance cameras.


    Handwritten Digits

    • MNIST – large dataset containing a training set of 60,000 examples, and a test set of 10,000 examples.


    Pedestrian Detection

    •  – 10 hours of video taken from a vehicle,350K bounding boxes for about 2.3K unique pedestrians.

    •  – Currently one of the most popular pedestrian detection datasets.

    •  – Urban dataset captured from a stereo rig mounted on a stroller.

    •  – Dataset with image pairs recorded in an crowded urban setting with an onboard camera.

    •  – One of 20 categories in PASCAL VOC detection challenges.

    •  – Small dataset captured from surveillance cameras.


    Generic Object Recognition

    •  – Currently the largest visual recognition dataset in terms of number of categories and images.

    •  – 80 million 32x32 low resolution images.

    •  – One of the most influential visual recognition datasets.

    •  /  – Popular image datasets containing 101 and 256 object categories, respectively.

    •  – Online annotation tool for building computer vision databases.


    Scene Recognition

    •  – MIT scene understanding dataset.

    Feature Detection and Description

    •  – Widely used dataset for measuring performance of feature detection and description. Checkfor an evaluation framework.


    Action Recognition

    •  – CVPR 2012 tutorial covering various datasets for action recognition.


    RGBD Recognition

    •  – Dataset containing 300 common household objects


    Reference:


    特徵提取 機器視覺 綜合程式碼 主頁程式碼 行人檢測 視覺壁障 物體檢測演算法 人臉檢測 ICA獨立成分分析 濾波演算法 路面識別 分割演算法
    • MATLAB Normalized Cuts Segmentation Code:

  • 相關推薦

    計算機視覺常用開源

     – Implementation of a unified approach for face detection, pose estimation, and landmark localization (CVPR 2012). Attributes and Semantic Featur

    go語言常用開源整理

    red ces href app ptc github 時間插件 ova 單元 框架 https://github.com/go-martini/martini 圖形驗證碼 https://github.com/dchest/captcha ORM https://gith

    Android常用開源的使用——————————————LitePal的使用方法

    LitePal是一款開源Android資料庫框架,採用了物件關係對映的模式, 詳細的使用文件見LitePal專案的Github主頁 快速設定步驟: 1. Include library Edit your build.gradle file and add below dependency. If

    Android常用開源———————————OkHttp的使用方法

    OkHttp是一個優秀的網路通訊庫,GitHub專案主頁地址。   基本使用 在專案中中新增OkHttp庫的依賴。. implementation("com.squareup.okhttp3:okhttp:3.12.0")   OkHttp進行Get請求 //

    Android常用開源專案,種類齊全,覆蓋面廣

    宣告: 眾多開源庫出現的目的是為了不重複造輪子! 真正的高手切記,知其然,知其所以然!多花點時間,找主流的開源庫研究原始碼,成長更大!從中窺探掌握本質的技術和原理,萬變不離其宗! 推薦理由: 這份

    計算機視覺常用的評價標準

    計算機視覺中常用的評價標準 1 召回率 Recall,又稱“查全率”——還是查全率好記,也更能體現其實質意義。 2 準確率 Precision,又稱“精度”、“正確率”。 以檢索為例,可以把搜尋情況用下圖表示: 相關 不

    bootstrap常用開源,cdn加速

    http://open.bootcss.com/ 常用cdn加速 http://cdn.bootcss.com/bootstrap/3.2.0/css/bootstrap.min.css http://cdn.bootcss.com/bootstrap/3.2.0/js/b

    計算機視覺演算法開源實現 程式碼程式 Computer Vision Algorithm Implementations

    Participate in Reproducible Research General Image Processing (C/C++ code, BSD lic) Image manipulation, matrix manipulation, transforms

    Java常用開源

    Java的經久不衰,很大程度上得益於Java的生態好。在日常開發中,我們也會經常使用到各種開源庫和工具類,為了避免重複造輪子,本文將貼出工作及學習中會用到的部分開源庫和工具類。Java的生態實在太大,這裡只能列舉一部分。如果你對此感興趣,不妨去讀讀他們的原始碼。 vHTML解析器jsoup 1.1 介

    OpenCV 4.0.1 和 3.4.5 釋出,Intel 開源計算機視覺

       OpenCV 4.0.1 和 3.4.5 已釋出,OpenCV 是 Intel 開源的計算機視覺庫。它由一系列 C 函式和少量 C++ 類構成,實現了影象處理和計算機視覺方面的很多通用演算法。OpenCV 擁有包括 300 多個 C 函式的跨平臺的中、高層 API。 這兩

    計算機視覺、機器學習等開源網站連結

    持續跟新 場景識別: SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling https://gi

    『python』OpenCV3計算機視覺安裝

    targe cond 最新 cnblogs enc 2.0 sent lan oca 1.下載OpenCV: https://codeload.github.com/Itseez/opencv/zip/3.0.0 2.解壓下載相關依賴: sudo apt-get inst

    常用牛人主頁鏈接(計算機視覺、模式識別、機器學習相關方向,陸續更新。。。。)【轉】

    short psu works charles 貝葉斯 learning 數學 ocr 相關 轉自:http://blog.csdn.net/goodshot/article/details/53214935 目錄(?)[-] The Kalman

    『python』計算機視覺_OpenCV3安裝

    找不到 新建 imp size libs redis pac nload .py Anaconda打包安裝: conda install --channel https://conda.anaconda.org/menpo opencv3 驗證: >>&g

    常用牛人主頁鏈接(計算機視覺、模式識別、機器學習相關方向,陸續更新

    bject detail ebo abi err 技術 arch college eoj 牛人主頁(主頁有很多論文代碼) Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU C

    常用Java開源(新手必看)

    調用 filters 服務器 布局 最小 輸出 處理 eclips parameter Jakarta common: Commons LoggingJakarta Commons Logging (JCL)提供的是一個日誌(

    轉:計算機視覺人臉相關開源專案總結

    原文:https://blog.csdn.net/chaipp0607/article/details/78885720   openface openface是一個基於深度神經網路的開源人臉識別系統。該系統基於谷歌的文章《FaceNet: A Unified Embeddin

    幾個常用的Excel開源

    之前試過 SpreadSheet,該類庫對Excel的操作是基於odbc的,而且還只能使用MFC來操作。當然,可以把SpreadSheet封裝成dll,然後給win32的程式來呼叫,我就是這麼幹的。 但是由於SpreadSheet是基於ODBC的,資料匯出的時候太慢了(實測:大概一秒鐘處理5

    2017 Android GitHub 常用開源

    原文地址 現在 GitHub 上流行的開源庫極大地節省了開發者從 0 開發的時間,很多公司和個人都在 GitHub 上開源自己的專案,今天我們就來整理一下 Android 開發中一些非常流行的庫,也是我們必須掌握的,這樣可以使我們在使用到時快速的查詢到,這裡的總結基本也都是自己在開發中用到的,也就

    深度學習常用資料集資源(計算機視覺領域)

    目錄   1、MNIST  2、ImageNet  4、COCO  5、PASCAL VOC 6、FDDB 1、MNIST  深度學習領域的入門資料集,當前主流的深度學習框架幾乎都將MNIST資料集的處理