首页 > 编程语言 >11.2.0.1 Grid Infrastructure Installation Failed at Second Nodes While Running root.sh

11.2.0.1 Grid Infrastructure Installation Failed at Second Nodes While Running root.sh

时间:2023-06-25 18:06:18浏览次数:57  
标签:CRS 26 Infrastructure Installation 0.1 ctx 19 SKGXP 2010


 

11.2.0.1 Grid Infrastructure Installation Failed at Second Nodes While Running root.sh Due To ASM Crash Caused by lmon Timeout [ID 1239123.1]

--------------------------------------------------------------------------------
 
  修改时间 06-JAN-2011     类型 PROBLEM     状态 PUBLISHED  

In this Document
  Symptoms
  Cause
  Solution

 

--------------------------------------------------------------------------------

 

Applies to:
Oracle Server - Enterprise Edition - Version: 11.2.0.1 and later   [Release: 11.2 and later ]
Information in this document applies to any platform.

Symptoms
While installation Oracle Grid Infrastructure 11.2.0.1, root.sh has run successfully on the first node, but failed on the second node, indicating:  The OCR location in an ASM disk group is inaccessible.

alert_nodename.log:

2010-08-26 19:16:15.416
[cssd(17484)]CRS-1605:CSSD voting file is online: /db/app/oracle/ocr_vote_n01; details in /db/app/crs
/11.2_Grid_Home/log/rmodbd03/cssd/ocssd.log.
2010-08-26 19:16:17.432
[cssd(17484)]CRS-1601:CSSD Reconfiguration complete. Active nodes are d02 d03 .
2010-08-26 19:16:19.057
[ctssd(17512)]CRS-2403:The Cluster Time Synchronization Service on host d03 is in observer mode.
2010-08-26 19:16:19.063
[ctssd(17512)]CRS-2407:The new Cluster Time Synchronization Service reference node is host d02.
2010-08-26 19:16:19.961
[ctssd(17512)]CRS-2401:The Cluster Time Synchronization Service started on host d03.
2010-08-26 19:21:22.696
[ohasd(15890)]CRS-2757:Command 'Start' timed out waiting for response from the resource 'ora.asm'. Details at
(:CRSPE00111:) in /db/app/crs/11.2_Grid_Home/log/rmodbd03/ohasd/ohasd.log.
2010-08-26 19:21:24.798
[crsd(19090)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /db/app/crs/11.2_Grid_Home/log/rmodbd03/crsd/crsd.log.
2010-08-26 19:21:25.427
[ohasd(15890)]CRS-2765:Resource 'ora.crsd' has failed on server 'd03'.
2010-08-26 19:21:26.523
[crsd(19119)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /db/app/crs/11.2_Grid_Home/log/rmodbd03/crsd/crsd.log.

alert_+ASM2.log shows:

Thu Aug 26 19:16:25 2010
Reconfiguration started (old inc 0, new inc 4)
ASM instance
List of instances:
1 2 (myinst: 2)
Global Resource Directory frozen
* allocate domain 0, invalid = TRUE
Communication channels reestablished
Thu Aug 26 19:21:57 2010
IPC Send timeout detected. Sender: ospid 17593 [oracle@rmodbd03 (PING)]
Receiver: inst 1 binc 63701371 ospid 7549
Thu Aug 26 19:22:16 2010
Received an instance abort message from instance 1
Please check instance 1 alert and LMON trace files for detail.
LMS0 (ospid: 17603): terminating the instance due to error 481

 

The lmon trace shows:

SKGXP:[fffffd7ffcbecd28.6]:[ctx]: (ms) prev wait(ms) before
SKGXP:[fffffd7ffcbecd28.7]:[ctx]: --------- -------------- ----------- --------- -----------
SKGXP:[fffffd7ffcbecd28.8]:[ctx]: 88 0 0 NORMAL TIMEDOUT
SKGXP:[fffffd7ffcbecd28.9]:[ctx]: 80 0 0 NORMAL TIMEDOUT
SKGXP:[fffffd7ffcbecd28.10]:[ctx]: 88 0 0 NORMAL TIMEDOUT

SKGXP:[fffffd7ffcbecd28.35]:[ctx]: admno 0x3911544a admport:
SKGXP:[fffffd7ffcbecd28.36]:[ctx]: SSKGXPT 0xfcbee024 flags SSKGXPT_LOCAL_PORT sockno 10 IP 192.168.1.78 UDP 40467

SKGXP:[fffffd7ffcbecd28.70]:[ctx]: flags=8 nreqs=1100 free_rbufs=1100 msgsz=8192 min_frag_sz_ach=8192
SKGXP:[fffffd7ffcbecd28.71]:[ctx]: OS Level Port
SKGXP:[fffffd7ffcbecd28.72]:[ctx]: SSKGXPT 0xfca36a80 flags SSKGXPT_LOCAL_PORT sockno 25 IP 192.168.1.178 UDP 40469
SKGXP:[fffffd7ffcbecd28.73]:[ctx]: OS Level Port ID
SKGXP:[fffffd7ffcbecd28.74]:[ctx]: SKGXPGPID Internet address 192.168.1.78 UDP port number 40469
SKGXP:[fffffd7ffcbecd28.317]:[obj]: SSKGXPT 0xfca2352c flags SSKGXPT_WRITE sockno 10 IP 192.168.1.162 UDP 63320
SKGXP:[fffffd7ffcbecd28.318]:[obj]: Remote data port
SKGXP:[fffffd7ffcbecd28.319]:[obj]: SSKGXPT 0xfca23598 flags SSKGXPT_WRITE sockno 10 IP 192.168.1.162 UDP 63322
SKGXP:[fffffd7ffcbecd28.320]:[obj]: next seqno 32770 last ack 32765 credits 3 total credits 8 ertt 16 resends on con 116390

SKGXP:[fffffd7ffcbecd28.70]:[ctx]: flags=8 nreqs=1100 free_rbufs=1100 msgsz=8192 min_frag_sz_ach=8192
ICMP Time exceeded during reassembly from bd02 (192.168.1.78)

The package size is 8k, the timeout of which matches the ping err message:

So it's due to the package size 8k package cannot go through the network.  This can be caused by the fact that the MTU size setting at NIC is appropriate for using jumbo frames but the MTU size setting is not right at the switch.

Note that this was an issue in versions prior to 11gR2, it would show as CRS hang on the second node.  Since 11gR2 Grid Infrastructure includes ASM, the symptom shows as an ASM crash due to the lmon timeout.

Cause
In this case, the MTU size had been set to 9000 but the switch was not configured to be compatible with that MTU size, so the related database package could not be transferred to the remote node,  causing ASM to crash on the second node (due to lmon timeout), which in turn prevented CSS from reading the OCR (from ASM).

$ ping -s 192.168.1.78 8192
-- Use 8k package size to ping the remote Note. It's failed at customer side.

Solution
The switch setting needs to be modified to accommodate the 8k package size ping for MTU size 9000. Following this change, ping command as below should succeed. Then the Grid Infrastructure installation should complete successfully.

$ ping -s IP 8192  (Solaris)
$ ping IP -s 8192 (Linux)

 相关内容

 

--------------------------------------------------------------------------------
产品
--------------------------------------------------------------------------------

•Oracle Database Products > Oracle Database > Oracle Database > Oracle Server - Enterprise Edition
关键字
--------------------------------------------------------------------------------
TIME OUT; INFRASTRUCTURE; ASM; ROOT.SH; 11GR2; GRID; LMON
错误
--------------------------------------------------------------------------------
CRS-1013; CRS-2765; CRS-2401; CRS-2757; CRS-2407; CRS-1601; CRS-2403; CRS-1605; ERROR 481

 

 

标签:CRS,26,Infrastructure,Installation,0.1,ctx,19,SKGXP,2010
From: https://blog.51cto.com/prudentwoo/6547451

相关文章

  • PostgreSQL 10.12 安装系列 - Yum+rpm 安装
    Yum+rpm安装Yum仓库下载:https://download.postgresql.org/pub/repos/yum/reporpms/EL-7-x86_64/pgdg-redhat-repo-latest.noarch.rpm 安装yum仓库:[root@open_source~]#lsanaconda-ks.cfgpgdg-redhat-repo-latest.noarch.rpm[root@open_source~]#rpm-ivhpgdg-redhat-re......
  • vulnhun靶场:jangow-01-1.0.1
    kali:172.88.6.144靶场:172.88.6.64靶场地址:Jangow:1.0.1~VulnHub 访问ip  点击右上角的buscar   写入一句话:echo'<?phpeval($_POST['shell']);?>'>upfine.php  在wordpress里面有两个文件,分别是config.php、index.html 在/var/www/html/.back......
  • npm安装淘宝镜像cnpm报错:npm ERR! Windows_NT 10.0.19045
    ☺1、最终解决方法:镜像不需要下载,配置即可#设置代理为cnpmnpmconfigsetregistryhttps://registry.npm.taobao.org压根就不需要执行下载cnpm命令:npminstall-gcnpm--registry=https://registry.npm.taobao.org2、关于npm安装淘宝镜像的报错情况:和node版本有关......
  • macOS Catalina 10.15.7安装graphviz库
    参考:https://zhuanlan.zhihu.com/p/526601310说明:我通过参考链接里的【方法2:通过软件包管理工具MacPorts,进行间接安装graphviz库】,安装成功pipinstallgraphviz,这样安装的graphviz只是graphviz的调用接口,而并非graphviz程序;graphviz程序,通过sudoportinstallgraphviz在本......
  • AI文案撰写客户端 OpenAI ChatGPT v0.11.0
    本文转载自:AI文案撰写客户端OpenAIChatGPTv0.11.0更多内容请访问钻芒博客:https://www.zuanmang.net软件介绍ChatGPTv0.11.0是一款由 OpenAI 官方开发出品的深度学习技术的人工智能聊天机器人软件,它通过大量的语言训练,可以回答各种问题,如科技、历史、地理、数学等,并能生......
  • ubuntu10.10 下安装android 2.2开发环境
    一、安装Java6安装jre/jdksudoapt-getinstallsun-java6-binsun-java6-jresun-java6-jdk使用这个命令安装后,默认是安装在/usr/lib/jvm/ 目录下,以我为例,我这里是/usr/lib/jvm/java-6-sun-1.6.0.24目录,/usr/lib/jvm/下还有一个快捷方式java-6-sun有时候会安装多个jdk,......
  • ERROR: nginx-1.22.1 installation failed.
    libraries.Youcaneitherdonotenablethemoduleorinstallthelibraries.make:***Noruletomaketarget'build',neededby'default'.Stop.make:***Noruletomaketarget'install'.Stop.================================......
  • Ubuntu10.04升级10.10
    默认的升级是没有升级到10.10的选项的,可以采用下面步骤升级:1、在网络运行正常的情况下,Alt+F2打开运行update-manage-d,然后就会出现下面窗口,按照提示升级即可。......
  • apache绑定于127.0.1.1
    Ubuntu下装Apache后,有时候,会绑定的地址为127.0.1.1。即提示信息:apache2:Couldnotreliablydeterminetheserver'sfullyqualifieddomainname,using127.0.1.1forServerName其实只要是127.开头的都一样,都是回环地址。你随便访问一个127.*都访问到的是本机。不过看起来......
  • Windows 2008服务器多界面和IIS的安装教程 140.210.16.x
    当你在使用服务器时是否有遇到这样一个问题?当你正在服务器里进行工作时,突然一个小伙伴在没有告知你的情况下进入了服务器里,导致你服务器失去连接了,这种情况是非常常见的现象。主要原因就是因为服务器没有安装多界面,服务器多开界面是占用的同一台服务器的资源,服务器多开数量没有限制......