FT2000+ qemu kvm openEuer crash 分析 频繁设置CPU online及cgroup导致进程卡死、不调度故障
测试用例1 openEuler 20.03 默认内核
https://hknaruto.blog.csdn.net/article/details/130498823
内核版本信息
突然就坚挺起来,长时间稳定运行
待续
测试用例2:CentOS8 (16C16G:无法crash分析,放弃)
十几分钟后,终端已卡死 ,两个终端均无响应,但是gnome还活着,图形界面还能动
dump
virsh dump --memory-only --format=kdump-zlib centos8 centos8-tcti-0505.zlib
crash分析
重启虚拟机
kvm内,安装好crash分析环境
yum install -y crash nfs-utils
下载安装debuginfo包
http://debuginfo.centos.org/8/aarch64/Packages/kernel-debuginfo-4.18.0-240.el8.aarch64.rpm
http://debuginfo.centos.org/8/aarch64/Packages/kernel-debuginfo-common-aarch64-4.18.0-240.el8.aarch64.rpm
执行分析
virsh reset centos8
登录虚拟机,挂载共享目录后执行crash,居然报错不匹配:crash /usr/lib/debug/lib/modules/4.18.8-240.el8.aarch64/vmlinux and /mnt/hostshare/qemu-virtual-machine/centos8-tcti-0505.zlib not match
手动触发一次crash验证下(kdump安装时,默认开启了)
重启后,可以看到生成了vmcore
这又可以
故障分析
WARNING: kimage_voffset cannot be determined from the dumpfile. WARING: cannot read linux_banner string crash /usr/lib/debug/lib/modules/4.18.8-240.el8.aarch64/vmlinux and /mnt/hostshare/qemu-virtual-machine/centos8-tcti-0505.zlib not match
尝试手动编译crash
https://github.com/crash-utility/crash/archive/refs/tags/8.0.3.tar.gz
$ tar -xf crash-8.0.3.tar.gz
$ cd crash-8.0.3
$ make
报错,放弃。
参考
[Crash-utility] crash failing with CentOS 5 under VMware
[Crash-utility] [PATCH 1/1] arm64: Fix missing offset formodules_vaddr with aarch64 guest dump
=====================================================================
失败的分析记录:
dump
virsh dump --memory-only --format=kdump-zlib openEuler openEuler-20.03-linux5.10.0-tcti-0505.zlib
crash分析
重启虚拟机
kvm内,安装好crash分析环境,执行分析
virsh reset openEuler
登录虚拟机,挂载共享目录后执行
crash linux-5.10.38/vmlinux /mnt/hostos/qemu-virtual-machine/openEuler-20.03-linux5.10.0-tcti-0505.zlib
报错,放弃。
故障
crash: cannot determine VA_BITS_ACTUAL
放弃该版本内核,转向分析20.03自带内核。
参考
it cannot determine VA_BITS_ACTUAL on Linux 5.4 for ARM64 · Issue #52 · crash-utility/crash · GitHub
参考资料
https://www.cnblogs.com/organic/p/17321523.html