Description
问题描述/What happened:
两台 DELL R420 均遇到,可稳定复现。通过 ocboot 初始化之后,默认会开启 iommu,启动会遇到错误:
DMAR: DRHD: hangling fault status req xxx...PTE Read access is not set
CPU:
grep -m 1 'model name' /proc/cpuinfo
model name : Intel(R) Xeon(R) CPU E5-2420 0 @ 1.90GHz
ocboot 对应代码片段,不确定边界条件在哪里,可能还需要再判断一下?:
https://github.com/yunionio/ocboot/blob/211440c0b65fdc49c59686119a5050911e797caf/onecloud/roles/utils/gpu-init/files/gpu_setup.sh#L33
机器就一个:06:00.0 VGA compatible controller: Matrox Electronics Systems Ltd. G200eR2
手工编辑 grub,临时关闭 iommu=off 后可正常启动使用。
linux16 /vmlinuz-5.4.130-1.yn20221208.el7.x86_64 root=/dev/mapper/centos-root ro crashkernel=auto rhgb quiet rdblacklist=nouveau mgag200.modeset=0 vfio_iommu_type1.allow_unsafe_interrupts=1 intel_iommu=off nouveau.modese
t=0 hugepagesz=1G default_hugepagesz=1G
环境/Environment:
- OS (e.g.
cat /etc/os-release
):CentOS7 - Kernel (e.g.
uname -a
): 5.4.130-1.yn20221208.el7.x86_64 scheduler: fix specified network not enough still pass. #1 SMP Thu Dec 8 12:09:17 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux - Host: (e.g.
dmidecode | egrep -i 'manufacturer|product' |sort -u
)
Manufacturer:
Manufacturer: 00CE04B300CE
Manufacturer: Dell Inc.
Manufacturer: Intel
Product Name: 05M7VK
Product Name: PowerEdge R420
+---------------+--------------------------------------------+
| Field | Value |
+---------------+--------------------------------------------+
| ansible | release/3.11(3ffea07d3124042402) |
| apimap | release/3.11(3ffea07d3124042402) |
| cloudevent | release/3.11(3ffea07d3124042402) |
| cloudid | release/3.11(3ffea07d3124042402) |
| cloudmon | release/3.11(3ffea07d3124042402) |
| cloudproxy | release/3.11(3ffea07d3124042402) |
| compute_v2 | release/3.11(3ffea07d3124042402) |
| devtool | release/3.11(3ffea07d3124042402) |
| identity | release/3.11(3ffea07d3124042402) |
| image | release/3.11(3ffea07d3124042402) |
| k8s | heads/v3.11.3-20240422.2(e6c3e48724042402) |
| log | release/3.11(3ffea07d3124042402) |
| monitor | release/3.11(3ffea07d3124042402) |
| notify | release/3.11(3ffea07d3124042402) |
| scheduledtask | release/3.11(3ffea07d3124042402) |
| scheduler | release/3.11(3ffea07d3124042402) |
| vpcagent | release/3.11(3ffea07d3124042402) |
| webconsole | release/3.11(3ffea07d3124042402) |
| yunionconf | release/3.11(3ffea07d3124042402) |
+---------------+--------------------------------------------+