GlusterFS 分散式儲存

阿新 • • 發佈：2019-01-30

一、GlusterFS概述；

概述：GlusterFS（Google File System）是一個開源的分散式檔案系統，Gluster 藉助 TCP/IP 網絡將儲存資源分散儲存在網路的不同節點，在通過匯聚為客戶端提供統一的資源訪問，在存儲方面具有很強大的橫向擴充套件能力，通過擴充套件不同的節點可以支援 PB 級別的儲存容量； Bit、Byte、KB、MB、GB、TB、PB、EB、ZB、YB、DB、NB

特點：

擴充套件性與高效能：通過 Scale-out 架構可以增加儲存節點的方式來提高容量和效能（磁碟、計算、I/O 資源都可以獨立增加），Gluster 彈性雜湊（Elastic Hash）解決了 Gluster 服務對元資料伺服器的依賴，Gluster 採用彈性雜湊演算法來確定資料在 chunk 節點中的分佈情況，無須元資料伺服器，實現了儲存的橫向擴充套件，改善了元資料伺服器節點的壓力以及單點故障；

高可用性：GlusterFS 通過配置不同型別的卷，可以對資料進行自動複製（類似於 RAID1），即使某節點故障，也不影響資料的訪問；

通用性：GlusterFS 沒有設定獨立的私有資料檔案系統，而是採用以往的 ext4、ext3 等，數據可以通過傳統的磁碟訪問方式被客戶端所訪問；

彈性卷管理：GlusterFS 通過將資料儲存在邏輯捲上，邏輯卷從邏輯儲存池進行獨立邏輯劃分，邏輯儲存池可以線上進行增加和刪除，不會導致業務中斷，邏輯卷的數量可以根據實際需求進行自行增加和縮減；

二、GlusterFS儲存架構；

專業術語：

Brick（儲存塊）：儲存池中節點對外提供儲存服務的目錄； Volume（邏輯卷）：一個邏輯卷時一組 Brick 的集合，卷是資料儲存的邏輯裝置，類似 LVM 中的邏輯卷，大部分 GlusterFS 管理操作都是在邏輯捲上進行的； FUSE（使用者空間檔案系統）：是一個核心模組，使用者自行建立掛載的的檔案系統； VFS（介面）：核心空間對使用者空間提供的訪問磁碟的介面； Glusterd（後臺管理程序）：在儲存叢集中的每個節點上都要執行；

三、GlusterFS工作原理；資料訪問程：

1. 首先是在客戶端，使用者通過 glusterfs 的 mount point 來讀寫資料，對於使用者來說，叢集系統的存在對使用者是完全透明的，使用者感覺不到是操作本地系統還是遠端的集群系統。

2. 使用者的這個操作被遞交給本地 linux 系統的 VFS 來處理。

3. VFS 將資料遞交給 FUSE 核心檔案系統：在啟動 glusterfs 客戶端以前，需要想系統註冊一個實際的檔案系統 FUSE，如上圖所示，該檔案系統與 ext3 在同一個層次上面，ext3 是對實際的磁碟進行處理，而 fuse 檔案系統則是將資料通過/dev/fuse 這個裝置檔案遞交給了 glusterfs client 端。所以我們可以將 fuse 檔案系統理解為一個代理。

4. 資料被 fuse 遞交給 Glusterfs client 後，client 對資料進行一些指定的處理（所謂的指定，是按照 client 配置檔案據來進行的一系列處理，我們在啟動 glusterfs client 時需要指定這個文件，其默認位置：/etc/glusterfs/client.vol）。

5. 在 glusterfs client 的處理末端，通過網路將資料遞交給 Glusterfs Server，並且將資料寫入到伺服器所控制的儲存裝置上。

四、GlusterFS卷的型別；分散式卷、條帶卷、複製卷、分散式條帶卷、分散式複製卷、條帶複製卷、分散式條帶復制卷；

1. 分散式卷

分散式卷是 GlusterFS 的預設卷，在建立卷時，預設選項是建立分散式卷。在該模式下，並沒有對檔案進行分塊處理，檔案直接儲存在某個 Server 節點上。由於使用本地檔案系統，所以存取效率並沒有提高，反而會因為網路通訊的原因而有所降低，另外支援超大型檔案也會有一定的難度，因為分散式卷不會對檔案進行分塊處理，一個檔案要麼在 Server1 上，要麼在 Serve2 上，不能分塊同時存放在 Sever1 和 Server2 上；

特點：

檔案分佈在不同的伺服器，不具備冗餘性；更容易且廉價地擴展卷的大小；單點故障會造成資料丟失；依賴底層的資料保護；

建立方法：

[[email protected] ~]# gluster volume create dis-volume server1:/dir1 server2:/dir2 Creation of dis -volume has been successful

Please start the volume to access data

2.條帶卷

Stripe 模式相當於 RAIDO，在該模式下，根據偏移量將檔案分成 N 塊，輪詢地儲存在每個 Brick Server 節點。節點把每個資料塊都作為普通檔案存入本地檔案系統中，通過擴充套件屬性記錄總塊數（Stripe-count）和每塊的序號（Stripe-index），在配置時指定的條帶數必須等於卷中 Brick 所包含的儲存伺服器數，在儲存大檔案時，效能尤為突出，但是不具備冗餘性；

特點：

資料被分割成更小塊分佈到塊伺服器群中的不同；分佈減少了負載且更小的檔案提高了存取速度；沒有資料冗餘；

建立方法：

[[email protected] ~]# gluster volume create stripe-volume stripe 2 transport tcp server1:/dir1 server2:/dir2

create of Stripe -volume has been successful please start the volume to access data

3.複製卷

也稱為 AFR（AutGilePepliatio）相當於 RAD1，即同一檔案儲存一份或多份副本。每個節點上儲存相同的內容和目錄結構。複製模式因為要儲存副本，所以磁碟利用率較低，複製卷時，複製數必須等於卷中 Brick 所包含的儲存伺服器數，複製卷具備冗餘性，即使一個節點損壞，也不影響資料的正常使用；

特點：

卷中所有的伺服器均儲存一個完整的副本；卷的副本數量可由客戶建立的時候決定；最少保證兩個塊伺服器或更多伺服器；具備冗餘效果；

建立方法：

[[email protected] ~]# gluster volume create rep-volume replica 2 transport tcp server1:/dir1 server2:/dir2

create of rep -volume has been successful please start the volume to access data

4.分散式條帶卷分散式條帶卷兼顧分散式卷和條帶卷的功能，可以理解成為大型的條帶卷，主要用於大檔案訪問處理，建立一個分散式條帶，卷最少需要 4 臺伺服器；

建立方法：

[[email protected] ~]# gluster volume create dis-stripe stripe 2 transport tcp server1:/dir1 server2:/dir2 server3:/dir3 server4:/dir4

create of dis-stripe has been successful please start the volume to access data

上述命令建立了一個名為 dis-stripe 的分散式條帶卷，配置分散式條帶卷時，卷中 Brick 所包

含的儲存伺服器必須是條帶數的倍數（大於等於 2 倍），如上述命令，Brick 的數量為 4，條帶數為 2；

5.分散式複製卷分散式複製卷兼顧分散式卷和複製卷的功能，可以理解成為大型的複製卷，主要用於冗餘的場景下，建立一個分散式複製卷，最少需要 4 塊 brick；

建立方法：

[[email protected] ~]# gluster volume create dis-rep replica 2 transport tcp server1:/dir1 server2:/dir2 server3:/dir3 server4:/dir4

create of dis-rep has been successful please start the volume to access data

6.條帶複製卷

條帶複製卷兼顧了條帶卷和複製卷兩者的優點，相當於 RADI 10，用於儲存效率高，備份冗餘的場景下，建立條帶複製卷，最少需要四個 brick；

建立方法：

[[email protected] ~]# gluster volume create test-volume stripe 2 replica 2 transport tcp server1:/dir1 server2:/dir2 server3:/dir3 server4:/dir4

create of test-volume has been successful please start the volume to access data

7.分散式條帶複製卷分散式條帶複製卷將分佈條帶資料在複製卷叢集。為了獲得最佳效果，可以選擇使用分佈在高併發的條帶複製卷環境下並行訪問非常大的檔案和效能是至關重要的；

五、案例：搭建Gluster分散式檔案系統；案例環境：

系統型別	IP 地址	主機名	所需軟體
Centos 7.4 1708 64bit	192.168.100.101	data1.linuxfan.cn	glusterfs glusterfs-server glusterfs-fuse glusterfs-rdma
Centos 7.4 1708 64bit	192.168.100.102	data2.linuxfan.cn	glusterfs glusterfs-server glusterfs-fuse glusterfs-rdma
Centos 7.4 1708 64bit	192.168.100.103	data3.linuxfan.cn	glusterfs glusterfs-server glusterfs-fuse glusterfs-rdma
Centos 7.4 1708 64bit	192.168.100.104	client.linuxfan.cn	glusterfs glusterfs-fuse

案例步驟：

Ø 配置主機之間的解析（在此所有主機配置相同，在此只列舉 data1 節點的配置）；

Ø 在所有 data 節點上安裝 GlusterFS（在此所有主機配置相同，在此只列舉 data1 節點的配置）；

Ø 在 data1 節點上進行建立叢集，其他節點會同步配置；

Ø 在多個 data 節點建立資料儲存的位置；

Ø 在 data1 節點建立資料儲存的卷（複製卷），其他節點會同步配置；

Ø 在 client 客戶端節點上安裝 gluster 客戶端工具並測試掛載；

Ø client 客戶端節點測試存放檔案；

Ø 擴充套件：Gluster 的管理命令；

Ø 配置主機之間的解析（在此所有主機配置相同，在此只列舉data1節點的配置）；

[[email protected] ~]# cat <<END >>/etc/hosts 192.168.100.101 data1.linuxfan.cn

1. 1. 1. data2.linuxfan.cn
    2. data3.linuxfan.cn
    3. client.linuxafn.cn END

[[email protected] ~]# ping data1.linuxfan.cn -c 2 ##ping 命令進行測試

PING data1.linuxfan.cn (192.168.100.101) 56(84) bytes of data.

64 bytes from data1.linuxfan.cn (192.168.100.101): icmp_seq=1 ttl=64 time=0.062 ms 64 bytes from data1.linuxfan.cn (192.168.100.101): icmp_seq=2 ttl=64 time=0.040 ms

Ø 在所有data節點上安裝GlusterFS（在此所有主機配置相同，在此只列舉data1節點的配置）；

[[email protected] ~]# yum -y install centos-release-gluster ## 安裝 gluster 包的

yum 源

[[email protected] ~]# yum -y install glusterfs glusterfs-server glusterfs-fuse glusterfs-rdma [[email protected] ~]# systemctl start glusterd

[[email protected] ~]# systemctl enable glusterd

Created symlink from /etc/systemd/system/multi-user.target.wants/glusterd.service to

/usr/lib/systemd/system/glusterd.service. [[email protected] ~]# netstat -utpln |grep glu

tcp 0 0 0.0.0.0:24007 0.0.0.0:* LISTEN

1313/glusterd

[[email protected] ~]# netstat -utpln |grep rpc

tcp	0	0	0.0.0.0:111	0.0.0.0:*	LISTEN
1311/rpcbind
udp	0	0	0.0.0.0:111	0.0.0.0:*
1311/rpcbind
udp	0	0	0.0.0.0:634	0.0.0.0:*
1311/rpcbind

Ø 在data1節點上進行建立叢集，其他節點會同步；

[[email protected] ~]# gluster peer probe data1.linuxfan.cn ##新增本機節點 peer probe: success. Probe on localhost not needed

[[email protected] ~]# gluster peer probe data2.linuxfan.cn ##新增 data2 節點

peer probe: success.

[[email protected] ~]# gluster peer probe data3.linuxfan.cn ##新增 data3 節點 peer probe: success.

[[email protected] ~]# gluster peer status ##檢視 gluster 集

群狀態

Number of Peers: 2

Hostname: data2.linuxfan.cn

Uuid: a452f7f4-7604-4d44-8b6a-f5178a41e308

State: Peer in Cluster (Connected)

Hostname: data3.linuxfan.cn

Uuid: b08f1b68-3f2c-4076-8121-1ab17d1517e1

State: Peer in Cluster (Connected)

Ø 在多個data節點建立資料儲存的位置；

[[email protected] ~]# mkdir /data [[email protected] ~]# gluster volume info No volumes present

Ø 在 data1節點建立資料儲存的卷（複製卷），其他節點會同步配置；

[[email protected] ~]# gluster volume create rep-volume replica 3 transport tcp data1.linuxfan.cn:/data data2.linuxfan.cn:/data data3.linuxfan.cn:/data force ##建立複製卷，名稱如上

volume create: rep-volume: success: please start the volume to access data [[email protected] ~]# gluster volume info

Volume Name: rep-volume Type: Replicate

Volume ID: ac59612b-e6ce-46ce-85a7-74262fb722b2 Status: Created

Snapshot Count: 0

Number of Bricks: 1 x 3 = 3 Transport-type: tcp

Bricks:

Brick1: data1.linuxfan.cn:/data Brick2: data2.linuxfan.cn:/data Brick3: data3.linuxfan.cn:/data Options Reconfigured: transport.address-family: inet nfs.disable: on performance.client-io-threads: off

[[email protected] ~]# gluster volume start rep-volume ##啟動該卷

volume start: rep-volume: success

Ø 在client客戶端節點上安裝gluster客戶端工具並測試掛載；

[[email protected] ~]# yum install -y glusterfs glusterfs-fuse

[[email protected] ~]# mount -t glusterfs data1.linuxfan.cn:rep-volume /mnt/ [[email protected] ~]# ls /mnt/

[[email protected] ~]# df -hT |tail -1

data1.linuxfan.cn:rep-volume fuse.glusterfs 19G 2.0G 17G 11% /mnt

Ø client 客戶端節點測試存放檔案；

[[email protected] ~]# touch /mnt/{1..10}.file

[[email protected] ~]# dd if=/dev/zero of=/mnt/1.txt bs=1G count=1 [[email protected] ~]# ls /mnt/

10.file 1.file 1.txt 2.file 3.file 4.file 5.file 6.file 7.file 8.file 9.file [[email protected] ~]# du -sh /mnt/1.txt

1.0G /mnt/1.txt

Ø 擴充套件：Gluster的管理命令；

Gluster Gluster Gluster

peer status

peer probe name peer detach name

##檢視所有的節點資訊

##新增節點

##刪除節點

Gluster Gluster

volume create xxx volume info

##建立卷

##檢視卷資訊

GlusterFS 分散式儲存

特點：

二、GlusterFS儲存架構；

三、GlusterFS工作原理；資料訪問程：

四、GlusterFS卷的型別；分散式卷、條帶卷、複製卷、分散式條帶卷、分散式複製卷、條帶複製卷、分散式條帶復制卷；

特點：

建立方法：

2.條帶卷

特點：

建立方法：

3.複製卷

特點：

建立方法：

建立方法：

建立方法：

6.條帶複製卷

建立方法：

五、案例：搭建Gluster分散式檔案系統；案例環境：

Ø 配置主機之間的解析（在此所有主機配置相同，在此只列舉data1節點的配置）；

Ø 在所有data節點上安裝GlusterFS（在此所有主機配置相同，在此只列舉data1節點的配置）；

Ø 在data1節點上進行建立叢集，其他節點會同步；

Ø 在多個data節點建立資料儲存的位置；

Ø 在 data1節點建立資料儲存的卷（複製卷），其他節點會同步配置；

Ø 在client客戶端節點上安裝gluster客戶端工具並測試掛載；

Ø client 客戶端節點測試存放檔案；

Ø 擴充套件：Gluster的管理命令；

相關推薦