#讀原始碼+論文# 三維點雲分割Deep Learning Based Semantic Labelling of 3D Point Cloud in Visual SLAM

阿新 • • 發佈：2018-11-27

from Deep Learning Based Semantic Labelling of 3D Point Cloud in Visual SLAM

超體素方法進行預分割，將點雲根據相似性變成表層面片（surface patches）降低計算複雜度。

將場景分割問題轉換為圖分割問題（graph partitioning problem）

Method 1：Mean-shift聚類演算法計算node之間的距離

node指的是每個patch，連線node之間的line就是相鄰patch的共邊；
距離可以是歐氏距離，也可以是馬氏距離；
Mean-shift演算法可見簡單介紹及Python實現或者簡單的機器學習演算法Mean-shift演算法

缺點：計算量太大

Method 2：利用面片的法向量方法聚類法向量可以表示出區域性凸性資訊。

缺點：當noise太多的時候可靠性降低。

最終使用method 2 結合可靠性平面來做分割最後使用圖割法分割

關於2D Object Detection and Semantic Segmentation

An essential component to get semantic information is object detection

, which can localize object instances in images. Girshick et al. [21] presented R-CNN, which proposed to apply CNN to object detection. Other similar methods have been proposed in recent years, like Fast R-CNN [22], Faster RCNN [23], Mask-RCNN [24] and YOLO [25-26]. R-CNN uses selective search algorithm for generating region proposals, which runs very slow. Faster R-CNN replaces the slow selective search algorithm with a fast neural net. Mask R-CNN

improves the region of interest (ROI) pooling layer and extends Faster R-CNN to pixel-level image segmentation
Semantic segmentation is to understand an image at a pixel level, which can label each pixel with a class identity. Similar to object detection, state-of-the-art semantic segmentation approaches also rely on CNN. FCN [4] by Long et al. is the first end-to-end system, which popularizes CNN architecture for semantic segmentation. U-Net [5] is a popular encoder-decoder architecture which can make use of annotated samples more efficiently and have a higher accuracy. SegNet [6] is a similar encoderdecoder architecture. SegNet copies indices from max-pooling for up-sampling, which makes it more memory efficient. RefineNet [7] proposes a method called RefineNet block which fuses both high resolution and low resolution features. It solves the problem of significant decrease in image resolution when we repeat the sub-sampling operation. PSPNet [8] introduces a pyramid pooling method to aggregate the context. DeepLab [9-11] utilizes dilated convolutions to increase the field of view.

#讀原始碼+論文# 三維點雲分割Deep Learning Based Semantic Labelling of 3D Point Cloud in Visual SLAM

#讀原始碼+論文# 三維點雲分割Deep Learning Based Semantic Labelling of 3D Point Cloud in Visual SLAM

基於深度學習的三維點雲分類和分割(找了幾篇文章)

三維計算機視覺（三）--點雲分割

三維點雲的地面分割演算法

三維點雲網絡——PointNet論文解讀

3D Registration 三維點雲配準

三維點雲資料集

【深度學習】三維點雲資料集總結

三維點雲配準

基於深度學習的三維點雲分類的介紹

從PCL庫看三維點雲依賴的相關知識

[硬體]三維點雲資料獲取

用VS+Opencv3.1從雙目立體視差圖中重建三維點雲

VTK 點雲重建和讀取.txt文件顯示三維點雲

PCL求取三維點雲模型每點曲率

通過Kinect的深度影象資料計算三維點雲

兩種三維點雲密度聚類方法的研究與對比

pointNet:用於三維分類和分割的點集深度學習

PCL—點雲分割（最小割算法）

Efficient Online Segmentation for Sparse 3D Laser Scans-- 線上的稀疏點雲分割

#讀原始碼+論文# 三維點雲分割Deep Learning Based Semantic Labelling of 3D Point Cloud in Visual SLAM

相關推薦