1. 程式人生 > >ubuntu 16.0 安裝cuda

ubuntu 16.0 安裝cuda

DL伺服器主機環境配置(ubuntu14.04+GTX1080+cuda8.0)解決桌面重複登入

參考部落格原址:http://www.cnblogs.com/Qwells/p/6086773.html#undefined

問題的開始在於:登入不了桌面,停留在重複輸入密碼介面

博文中分析的結論:
虛擬機器中不能直接呼叫物理顯示卡進行 CUDA 程式設計;虛擬機器中執行 CUDA 需要硬體和軟體的配合才能使用,對於一般使用者可能暫時不太可能的。

參考博文:

深度學習主機環境配置: Ubuntu16.04+Nvidia GTX 1080+CUDA8.0

深度學習主機環境配置: Ubuntu16.04+GeForce GTX 1080+TensorFlow

ubuntu14.04+cuda8.0(GTX1080)+caffe安裝

如何搭建一臺深度學習伺服器

Ctrl+alt+F1進入字元介面,關閉圖形介面

sudo service lightdm stop //必須有,不然會安裝失敗 
sudo /etc/init.d/lightdm stop //一樣的命令

sudo chmod 755 NVIDIA-Linux-x86_64-367.27.run //獲取許可權
sudo ./NVIDIA-Linux-x86_64-367.27.run //安裝驅動

Accept
Continue installation
安裝完成之後

sudo service lightdm start

圖形介面出現,然後關機,由讓人重複輸入密碼,登入不了

博主

$ sudo /etc/init.d/gdm stop
$ sudo nvidia-installer --update
$ sudo /etc/init.d/gdm start

升級到375版本, 還是沒用,啟動進入不了桌面,重複登入

Install driver 367

Uninstall previous nvidia drivers.

$ sudo apt-get purge nvidia-*

Stop light gdm (graphical interface)

$ sudo service lightgdm

Go to tty (CTRL+ALT+F1). Set your init state to 3 (text only mode). It is important to do this. Note these commands on a paper or something. I experienced sometimes the tty does not show with the newest driver. I just ssh to my PC as a way around.

$ sudo init 3

Log in to tty and cd to the directory where your have downloaded the driver.

$ sudo ./NVIDIA-Linux-x86_64-367.35.run

It will ask if you want to install 32-bit libraries, say no (assuming you do not have a 32-bit OS, hopefully. If you do have a 32-bit OS it is a good idea to upgrade…)

In a few minutes it is done….smooth. Reboot your PC

$ sudo reboot

update 之後還是不能進 圖形介面

Uninstall previous nvidia drivers.

sudo apt-get purge nvidia-*
sudo apt-get autoremove
sudo apt-get --purge remove nvidia-*

remove 之後,

nvidia-smi 

還是能看到gpu的。why?

解除安裝不了?

sudo apt-get install nvidia-prime

$ sudo /etc/init.d/lightdm stop
$ sudo nvidia-installer --update
$ sudo /etc/init.d/lightdm start

升級到375版本, 還是沒用,啟動進入不了桌面,重複登入

有人說,安裝必須要在安裝桌面前安裝GTX 1080 driver,後面方案驗證來看, 那個引數才是關鍵。

解決方法

利用sudo gedit /etc/modprobe.d/blacklist-nouveau.conf新建blacklist-nouveau.conf檔案,輸入命令

blacklist nouveau

blacklist lbm-nouveau

options nouveau modeset=0

alias nouveau off

alias lbm-nouveau off

儲存並退出。這一步是為了禁掉Ubuntu自帶開源驅動nouveau。之後sudo reboot重啟系統。在終端執行命令

lsmod | grep nouveau

檢視nouveau模組是否被載入。如果什麼都沒輸出,則執行下一步。

根本問題在於 引數: --no-opengl-files

sudo /etc/init.d/lightdm stop
sudo ./NVIDIA-Linux-x86_64-375.20.run --no-opengl-files
sudo /etc/init.d/lightdm start

即可以正常登入介面了!!

在安裝過程中的選項:

Accept

Continue installation

register the kernel moudle sources with DKMS?

NO

Would you like to run the nvidia-xconfig utility to automatically update your X Configuration file so set the NVIDIA X driver will be used when you restart X?

NO

Install 32-Bit compatibility libraries?參考

NO

cuda8.0安裝

執行

sudo sh cuda_8.0.44_linux.run

選項如下所示:

Description

This package includes over 100+ CUDA examples that demonstrate
various CUDA programming principles, and efficient CUDA
implementation of algorithms in specific application domains.
The NVIDIA CUDA Samples License Agreement is available in
Do you accept the previously read EULA?
accept/decline/quit: accept

Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 367.48?
(y)es/(n)o/(q)uit: n

Install the CUDA 8.0 Toolkit?
(y)es/(n)o/(q)uit: y

Enter Toolkit Location
[ default is /usr/local/cuda-8.0 ]:

Do you want to install a symbolic link at /usr/local/cuda?
(y)es/(n)o/(q)uit: y

Install the CUDA 8.0 Samples?
(y)es/(n)o/(q)uit: y

Enter CUDA Samples Location
[ default is /home/c302 ]:

Installing the CUDA Toolkit in /usr/local/cuda-8.0
Installing the CUDA Samples in /home/c302 …
Copying samples to /home/c302/NVIDIA_CUDA-8.0_Samples now…
Finished copying samples.

===========
= Summary =

Driver: Not Selected
Toolkit: Installed in /usr/local/cuda-8.0
Samples: Installed in /home/c302

Please make sure that

  • PATH includes /usr/local/cuda-8.0/bin
  • LD_LIBRARY_PATH includes /usr/local/cuda-8.0/lib64, or, add /usr/local/cuda-8.0/lib64 to /etc/ld.so.conf and run ldconfig as root

To uninstall the CUDA Toolkit, run the uninstall script in /usr/local/cuda-8.0/bin

Please see CUDA_Installation_Guide_Linux.pdf in /usr/local/cuda-8.0/doc/pdf for detailed information on setting up CUDA.

***WARNING: Incomplete installation! This installation did not install the CUDA Driver. A driver of version at least 361.00 is required for CUDA 8.0 functionality to work.
To install the driver using this installer, run the following command, replacing <CudaInstaller> with the name of this run file:
sudo <CudaInstaller>.run -silent -driver

Logfile is /tmp/cuda_install_9045.log

設定環境變數

export PATH=/usr/local/cuda-8.0/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-8.0/lib64:$LD_LIBRARY_PATH

新增系統變數修改到系統檔案
sudo vi /etc/profile

在最後新增上面兩句,然後儲存。使立即生效

sudo ldconfig //環境變數立即生效

驗證 cuda

c302@c302-dl:~/Downloads$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Sun_Sep__4_22:14:01_CDT_2016
Cuda compilation tools, release 8.0, V8.0.44

測試cuda的samples

cd ‘/home/zhou/NVIDIA_CUDA-8.0_Samples’
make  //這裡需要點時間

最後顯示:

make[1]: Leaving directory `/home/c302/NVIDIA_CUDA-8.0_Samples/7_CUDALibraries/MersenneTwisterGP11213'

Finished building CUDA samples

cd 0_Simple/matrixMul

執行測試如下:

[email protected]:~/NVIDIA_CUDA-8.0_Samples/0_Simple/matrixMul$ ./matrixMul
[Matrix Multiply Using CUDA] - Starting...
GPU Device 0: "GeForce GTX 1080" with compute capability 6.1

MatrixA(320,320), MatrixB(640,320)
Computing result using CUDA Kernel…
done
Performance= 1109.06 GFlop/s, Time= 0.118 msec, Size= 131072000 Ops, WorkgroupSize= 1024 threads/block
Checking computed result for correctness: Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

下一篇將會是安裝cuDNN、tensorflow等lib