1. 程式人生 > >oracle11.2.0.4 rac asm啟動故障

oracle11.2.0.4 rac asm啟動故障

數據庫 Oracle

現場人員反映一套庫啟動失敗,集群資源啟動失敗,發截圖和日誌過來分析

分析日誌後發現: ASM沒有啟動 OCR等不能啟動

問題1:

2018-05-08 11:59:39.980: [? OCRASM][1639659296]proprasmo: Error in open/create file in dg [OCRDG]
[? OCRASM][1639659296]SLOS : SLOS: cat=7, opn=kgfoAl06, dep=15077, loc=kgfokge

2018-05-08 11:59:39.980: [? OCRASM][1639659296]ASM Error Stack : ORA-15077: could not locate ASM instance serving a required diskgroup

2018-05-08 11:59:39.981: [? OCRASM][1639659296]proprasmo: kgfoCheckMount returned [7]
2018-05-08 11:59:39.981: [? OCRASM][1639659296]proprasmo: The ASM instance is down
2018-05-08 11:59:39.981: [? OCRRAW][1639659296]proprioo: Failed to open [+OCRDG]. Returned proprasmo() with [26]. Marking location as UNAVAILABLE.

?

第二個報錯提示: 大概心跳網絡有問題

問題2:

2018-05-08 13:06:36.097: [??? CSSD][3379095296]clssnmvDHBValidateNcopy: node 1, rac1, has a disk HB, but no network HB, DHB has rcfg 421249059, wrtcnt, 94952864, LATS 861554, lastSeqNo 94952861, uniqueness 1525754229, timestamp 1525755985/867374

ping驗證私有網絡沒有問題

?

問題1很明顯,Oracle rac asm方式,OCR存放在asm裏面,啟動之前確保asm實例啟動

由於服務器重啟後沒把asm實例帶起來。我們必須手工啟動asm實例

sqlplus /? sysasm

startup

啟動asm實例後我們啟動集群相關資源

[root@rac1 ~]# crsctl start has
[root@rac1 ~]# crsctl start crs
[root@rac1 ~]# crsctl check crs

CRS-4638: Oracle High Availability Services is online
CRS-4537: Cluster Ready Services is online
CRS-4529: Cluster Synchronization Services is online
CRS-4533: Event Manager is online

啟動實例資源等

crsctl start cluster –all

驗證登陸節點1登陸正常

?

處理第二個節點

?

?

?

?

?

?

總結:現場人員rac啟動順序不熟悉,Oracle grid root用戶命令混用

oracle11.2.0.4 rac asm啟動故障