observer启动错误

【 使用环境 】测试环境
【 OB or 其他组件 】
【 使用版本 】4.2.1
【问题描述】执行重启命令启动不起来
【复现路径】obd cluster restart myoceanbase执行了这个命令,然后在observer restart一直转圈
【附件及日志】推荐使用OceanBase敏捷诊断工具obdiag收集诊断信息,详情参见链接(右键跳转查看):这个是observer .log错误日志

【SOP系列 22 】——故障诊断第一步(系统巡检和诊断信息收集)

报错信息:2024-01-25 14:37:31.423503] ERROR inner_aio (ob_io_manager.cpp:806) [971548][T1006_ReplaySrv][T1006][Y0-0000000000000000-0-0] [lt=0][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423506] ERROR inner_aio (ob_io_manager.cpp:806) [967166][T1_ReplaySrv19][T1][Y0-0000000000000000-0-0] [lt=2][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423512] ERROR inner_aio (ob_io_manager.cpp:806) [970188][T1004_WriteCkpt][T1004][YB42C0A81A2C-00060FBF4F812845-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423514] ERROR inner_aio (ob_io_manager.cpp:806) [968362][T1002_WriteCkpt][T1002][YB42C0A81A2C-00060FBF4AC12846-0-0] [lt=0][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423516] ERROR inner_aio (ob_io_manager.cpp:806) [967149][T1_ReplaySrv2][T1][Y0-0000000000000000-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423516] ERROR inner_aio (ob_io_manager.cpp:806) [967156][T1_ReplaySrv9][T1][Y0-0000000000000000-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423524] ERROR inner_aio (ob_io_manager.cpp:806) [970188][T1004_WriteCkpt][T1004][YB42C0A81A2C-00060FBF4F812845-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423529] ERROR inner_aio (ob_io_manager.cpp:806) [968362][T1002_WriteCkpt][T1002][YB42C0A81A2C-00060FBF4AC12846-0-0] [lt=0][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423540] ERROR inner_aio (ob_io_manager.cpp:806) [968362][T1002_WriteCkpt][T1002][YB42C0A81A2C-00060FBF4AC12846-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)
[2024-01-25 14:37:31.423520] ERROR inner_aio (ob_io_manager.cpp:806) [968414][T1002_ReplaySrv][T1002][Y0-0000000000000000-0-0] [lt=1][errcode=-4392] disk is hung(msg=“data disk has fatal error”)

看这个报错信息像是磁盘故障

我iotop查看也不是很高


怎么确定是磁盘故障呢老师

使用的什么方式部署的ob呢?。是ocp-express还是ocp

ocp部署的

这个方式试试 重启 OCP