首页 > 其他分享 >gpu机器没有开启ipv6

gpu机器没有开启ipv6

时间:2024-04-27 18:22:19浏览次数:16  
标签:lid InfiniBand CA 开启 version ipv6 gpu GUID Port

 

 

参考:

https://blog.csdn.net/asdfaa/article/details/137884414

 

检查系统是否支持 IPv6,查看被禁用了


在启用 IPv6 之前,首先要确保您的系统支持 IPv6。要检查内核是否启用了 IPv6,可以运行以下命令:

cat /proc/sys/net/ipv6/conf/all/disable_ipv6

如果返回的结果为 0,则说明您的系统支持 IPv6。如果结果为 1,您需要启用 IPv6,然后重新检查。

 

检查下面两个服务。opensmd 没有启动,直接启动,然后用ibstat命令

systemctl status opensmd
systemctl status openibd

 

ibstat
cat /proc/sys/net/ipv6/conf/all/disable_ipv6
echo 1 >/proc/sys/net/ipv6/conf/all/disable_ipv6

 

一开始端口是down, 现在是active,然后就可以用了

 

root@gpumcw2:/home/mcw# ibstat
CA 'mlx5_0'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc36c
        System image GUID: 0x946dae03009cc36c
        Port 1:
                State: Initializing
                Physical state: LinkUp
                Rate: 200
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cc36c
                Link layer: InfiniBand
CA 'mlx5_1'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc364
        System image GUID: 0x946dae03009cc364
        Port 1:
                State: Initializing
                Physical state: LinkUp
                Rate: 200
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cc364
                Link layer: InfiniBand
CA 'mlx5_2'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae0300b00278
        System image GUID: 0x946dae0300b00278
        Port 1:
                State: Down
                Physical state: Polling
                Rate: 10
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae0300b00278
                Link layer: InfiniBand
CA 'mlx5_3'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae0300b00279
        System image GUID: 0x946dae0300b00278
        Port 1:
                State: Down
                Physical state: Polling
                Rate: 10
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae0300b00279
                Link layer: InfiniBand
CA 'mlx5_4'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc374
        System image GUID: 0x946dae03009cc374
        Port 1:
                State: Initializing
                Physical state: LinkUp
                Rate: 200
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cc374
                Link layer: InfiniBand
CA 'mlx5_5'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cef04
        System image GUID: 0x946dae03009cef04
        Port 1:
                State: Initializing
                Physical state: LinkUp
                Rate: 200
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cef04
                Link layer: InfiniBand
root@gpumcw2:/home/mcw#

 

root@gpumcw2:/home/mcw# ibstat
CA 'mlx5_0'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc36c
        System image GUID: 0x946dae03009cc36c
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 2
                LMC: 0
                SM lid: 2
                Capability mask: 0xa651e84a
                Port GUID: 0x946dae03009cc36c
                Link layer: InfiniBand
CA 'mlx5_1'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc364
        System image GUID: 0x946dae03009cc364
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 16
                LMC: 0
                SM lid: 2
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cc364
                Link layer: InfiniBand
CA 'mlx5_2'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae0300b00278
        System image GUID: 0x946dae0300b00278
        Port 1:
                State: Down
                Physical state: Polling
                Rate: 10
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae0300b00278
                Link layer: InfiniBand
CA 'mlx5_3'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae0300b00279
        System image GUID: 0x946dae0300b00278
        Port 1:
                State: Down
                Physical state: Polling
                Rate: 10
                Base lid: 65535
                LMC: 0
                SM lid: 0
                Capability mask: 0xa651e848
                Port GUID: 0x946dae0300b00279
                Link layer: InfiniBand
CA 'mlx5_4'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cc374
        System image GUID: 0x946dae03009cc374
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 20
                LMC: 0
                SM lid: 2
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cc374
                Link layer: InfiniBand
CA 'mlx5_5'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.38.1002
        Hardware version: 0
        Node GUID: 0x946dae03009cef04
        System image GUID: 0x946dae03009cef04
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 5
                LMC: 0
                SM lid: 2
                Capability mask: 0xa651e848
                Port GUID: 0x946dae03009cef04
                Link layer: InfiniBand
root@gpumcw2:/home/mcw#

 

另外一个机器上指定接口去ping没有用ipv6的这个接口的IP

root@gpumcw2:/home/mcw# ifconfig ibs13
ibs13: flags=4163<UP,BROADCAST,RUNNING,MULTICAST>  mtu 2044
        inet 10.xx.xx.8  netmask 255.255.255.0  broadcast 10.110.10.255
        inet6 fe80::966d:ae03:9c:c36c  prefixlen 64  scopeid 0x20<link>
        unspec 00-00-08-0D-FE-80-00-00-00-00-00-00-00-00-00-00  txqueuelen 256  (UNSPEC)
        RX packets 2146  bytes 379613 (379.6 KB)
        RX errors 0  dropped 0  overruns 0  frame 0
        TX packets 2151  bytes 278364 (278.3 KB)
        TX errors 0  dropped 0 overruns 0  carrier 0  collisions 0

root@gpumcw2:/home/mcw# 

root@gpumcw2:/home/mcw# ping -I ibs13 10.xx.xx.8
PING 10.xx.xx.8 (10.xx.xx.8) from 10.110.10.1 ibs13: 56(84) bytes of data.
64 bytes from 10.xx.xx.8: icmp_seq=1 ttl=64 time=0.324 ms
64 bytes from 10.xx.xx.8: icmp_seq=2 ttl=64 time=0.103 ms
^C
--- 10.xx.xx.8 ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1032ms
rtt min/avg/max/mdev = 0.103/0.213/0.324/0.110 ms
root@gpumcw2:/home/mcw# 

没好之前lo状态

 好了之后状态

 

标签:lid,InfiniBand,CA,开启,version,ipv6,gpu,GUID,Port
From: https://www.cnblogs.com/machangwei-8/p/18162336

相关文章

  • kubernetes安装配置使用vGPU
    前言AI落地时,在某些场景下AI模型在训练或者是推理时,其算力要求不需要占用整卡的GPU,比如只需要0.5卡GPU即可满足需求。在这种情况下,可以使用GPU虚拟化技术来解决这个问题,将整卡的GPU虚拟化为两个0.5卡的GPU,这样就可以在一张卡上同时跑两个AI训练或者AI推理应用服......
  • springboot链接redis IPV6
    <dependency><groupId>org.springframework.boot</groupId><artifactId>spring-boot-starter-data-redis</artifactId><exclusions><exclusion>......
  • 学习笔记447—本地部署 Llama3 – 8B/70B 大模型!最简单的方法: 支持CPU /GPU运行 【3种
    本地部署Llama3–8B/70B大模型!最简单的方法:支持CPU/GPU运行【3种方案】目前在开源大模型领域,Llama3无疑是最强的!这次Meta不仅免费公布了8B和70B两个性能强悍的大模型,400B也即将发布,这是可以和GPT-4对打的存在!今天我们就来介绍3各本地部署方法,简单易懂,非常适合新手!1.G......
  • ipv6服务器如何访问ipv4的website
    环境AWSlightsailipv6onlywindowsinstance网络公网ipv6,可以访问internet内网ipv4,但不能访问internet故障每次用internetexplorer访问stackoverflow.com都无法打开,命令行解析这个网址只有ipv4的。我理解windows对ipv4的地址用ipv4访问。解决将ipv6的地址,dns设置为200......
  • Pyinstaller打包 openvino,但未带上 openvino的依赖,找不到CPU,GPU
    命令:pyinstaller--onefile--collect-submodulesopenvino--collect-binariesopenvino--collect-dataopenvinoserver.pyserver.spec(自动生成)#-*-mode:python;coding:utf-8-*-fromPyInstaller.utils.hooksimportcollect_data_filesfromPyInstaller.util......
  • nvidia官方AI框架软件的命令行操作接口 —— NVIDIA GPU Cloud (NGC) CLI
    NVIDIAGPUCloud(NGC)CLI安装介绍地址:https://org.ngc.nvidia.com/setup/installers/cli安装好后需要输入自己的NVIDIANGC的APIKEY,该信息在下面地址中生成:https://org.ngc.nvidia.com/setup/api-key......
  • UOS 开启 VisualStudio 远程调试 .NET 应用之旅
    本文记录的是在Windows系统里面,使用VisualStudio2022远程调试运行在UOS里面dotnet应用的配置方法本文写于2024.03.19如果你阅读本文的时间距离本文编写的时间过于长,那本文可能包含过期的知识我将以我的UOS虚拟机作为例子告诉大家如何在Windows系统里面,使用Visua......
  • WPF 已知问题 开启 IsManipulationEnabled 之后触摸长按 RepeatButton 不会触发连续的
    本文记录WPF的一个已知问题,在RepeatButton上开启IsManipulationEnabled漫游支持之后,将会导致触摸长按到RepeatButton之上时,不会收到源源不断的Click事件这是有个伙伴在WPF官方仓库报告的问题,详细请看https://github.com/dotnet/wpf/issues/8223原始的问题是他发现......
  • docker配置Nvidia环境,使用GPU
    前言需要nvdiadriver安装好,请参考UbuntuNvidiadriver驱动安装及卸载docker安装配置apt阿里云的镜像源sudocurl-fsSLhttps://mirrors.aliyun.com/docker-ce/linux/ubuntu/gpg|sudoapt-keyadd-sudoadd-apt-repository"deb[arch=amd64]http://mirrors.aliy......
  • 开启、关闭HDD读、写缓存状态
    sg3一、sg3查看缓存状态您可以使用sg_modes命令来查看SAS盘和SATA盘的缓存状态。例如,要查看/dev/sdb设备的缓存状态,您可以执行以下命令:sg_modes-p8,0/dev/sdb二、sg3关闭机械盘写缓存状态(仅适用于SAS盘)对于SAS盘,您可以按照以下步骤更改其读写缓存状态:1、编辑缓存状态......