1. 程式人生 > >Ubuntu16.04+GeForce GTX 1070Ti+CUDA8.0+cuDNN5.1+TensorFlow1.2+tf-faster-rcnn訓練

Ubuntu16.04+GeForce GTX 1070Ti+CUDA8.0+cuDNN5.1+TensorFlow1.2+tf-faster-rcnn訓練

1、下載CUDA8.0和CUDNN5.1

百度網盤下載地址(包含8.0和9.0):https://pan.baidu.com/s/1ir3rKhUtU1aIRE7n1BQ5mg

2、安裝CUDA8.0

安裝方式1(字尾為.deb的):

sudo dpkg -i cuda-repo-ubuntu1604-8-0-local-ga2_8.0.61-1_amd64.deb
sudo apt-get update
sudo apt-get install cuda

解除安裝CUDA方式:

sudo apt autoremove cuda

安裝方式2(字尾為.run的):

sudo sh cuda_8.0.44_linux.run

然後一直Enter到100%,並輸入accept

接著出現 Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 361.62?

一定要選擇No,否則之前的驅動安裝就白安了

安裝完畢後會出現以下內容:

=========== 
= Summary = 
===========

Driver: Not Selected 
Toolkit: Installed in /usr/local/cuda-8.0 
Samples: Installed in /home/textminer

Please make sure that 
– PATH includes /usr/local/cuda-8.0/bin 
– LD_LIBRARY_PATH includes /usr/local/cuda-8.0/lib64, or, add /usr/local/cuda-8.0/lib64 to /etc/ld.so.conf and run ldconfig as root

To uninstall the CUDA Toolkit, run the uninstall script in /usr/local/cuda-8.0/bin

Please see CUDA_Installation_Guide_Linux.pdf in /usr/local/cuda-8.0/doc/pdf for detailed information on setting up CUDA.

***WARNING: Incomplete installation! This installation did not install the CUDA Driver. A driver of version at least 361.00 is required for CUDA 8.0 functionality to work. 
To install the driver using this installer, run the following command, replacing with the name of this run file: 
sudo .run -silent -driver

Logfile is /opt/temp//cuda_install_6583.log

接著新增環境變數:

sudo gedit /etc/profile

 

開啟“profile”檔案,在末尾處新增(注意不要有空格,不然會報錯):

export PATH=/usr/local/cuda-8.0/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-8.0/lib64$LD_LIBRARY_PATH

測試CUDA是否安裝成功,輸入:

nvcc -V

3、安裝CUDNN V5.1

下載完成之後進入下載目錄(將下載的安裝包拷貝到home資料夾下),執行以下命令進行解壓:

sudo tar -zxvf ./cudnn-8.0-linux-x64-v5.1.tgz

解壓之後,得到一個 cudn 資料夾,該資料夾下include 和 lib64 兩個資料夾,命令列進入 cuda/include 路徑下,然後進行以下操作:

cd cuda/include
sudo cp cudnn.h /usr/local/cuda/include #複製標頭檔案

再將進入lib64目錄下的動態檔案進行復制和連結:

cd ..
cd lib64
sudo cp lib* /usr/local/cuda/lib64/ #複製動態連結庫
cd /usr/local/cuda/lib64/
sudo rm -rf libcudnn.so libcudnn.so.5 #刪除原有動態檔案
sudo ln -s libcudnn.so.5.1.10 libcudnn.so.5 #生成軟銜接
sudo ln -s libcudnn.so.5 libcudnn.so #生成軟連結
sudo ldconfig

驗證CUDNN是否安裝成功:

cat /usr/local/cuda/include/cudnn.h | grep CUDNN_MAJOR -A 2

4、安裝Tensorflow-gpu 1.2

根據CUDA和CUDNN版本選擇Tensorflow版本,即:tensorflow_gpu-1.2.0

(1)、安裝tensorflow虛擬環境(由於支援python3.6,因此我選擇3.6安裝)

conda create -n tensorflow1.2 python=3.6

(2)、安裝tensorflow

pip install tensorflow-gpu==1.2.0

解除安裝命令:pip uninstall tensorflow-gpu==1.2.0

 

(3)、安裝其它依賴包

pip install Cython
pip install easydict
pip install opencv_python==3.3.1.11
pip install matplotlib
pip install Pillow
pip install scipy
pip install easydict

5、檢測tensorflow是否使用gpu進行計算,在pycharm裡配好工程環境後新建一個py檔案,輸入以下程式碼:

import tensorflow as tf
ess = tf.Session(config=tf.ConfigProto(log_device_placement=True))

輸出:

6、訓練tf-faster-rcnn

在使用CPU訓練的時候,平均速度是訓練一次需要2s多一點,使用CPU加速後,每訓練一次在0.5s左右,速度確實有了非常明顯

的提升。