obd cluster start hloceanbase 报错,一直卡在启动obagent上,互信已经配置

【 使用环境 】生产环境 or 测试环境
测试环境
【 OB or 其他组件 】
OB
【 使用版本 】
4.1.0

【问题描述】清晰明确描述问题
[2023-06-01 13:05:42.854] [DEBUG] – local execute: rsync -a -W -e “ssh -p 22” /tmp/tmpiye8v_q1 root@172.18.2.104:/root/hloceanbase/obagent/conf/agentctl.yaml
[2023-06-01 13:05:43.032] [DEBUG] – exited code 0
[2023-06-01 13:05:43.032] [DEBUG] – root@172.18.2.104 execute: chmod 755 /root/hloceanbase/obagent/conf/agentctl.yaml
[2023-06-01 13:05:43.060] [DEBUG] – exited code 0
[2023-06-01 13:05:43.060] [DEBUG] – root@172.18.2.104 execute: cd /root/hloceanbase/obagent;/root/hloceanbase/obagent/bin/ob_agentctl config -u agent.log.path=log/monagent.log,agent.http.basic.auth.username=admin,agent.http.basic.auth.password=root,ocp.agent.manager.http.port=8089,mgragent.log.maxsize.mb=30,ocp.agent.monitor.http.port=8088,monagent.ob.monitor.password=,monagent.ob.sql.port=2881,monagent.ob.rpc.port=2882,monagent.ob.cluster.name=hloceanbase,monagent.ob.cluster.id=1,monagent.ob.zone.name=zone1,monagent.log.level=info,monagent.pipeline.ob.status=active,ocp.agent.home.path=/root/hloceanbase/obagent,monagent.host.ip=172.18.2.104,ob.log.path=/root/hloceanbase/oceanbase/store,ob.data.path=/root/hloceanbase/oceanbase/store,ob.install.path=/root/hloceanbase/oceanbase,observer.log.path=/root/hloceanbase/oceanbase/log && touch /root/hloceanbase/obagent/.configured
[2023-06-01 13:05:43.186] [DEBUG] – exited code 0
[2023-06-01 13:05:43.186] [DEBUG] – root@172.18.2.104 execute: cd /root/hloceanbase/obagent;/root/hloceanbase/obagent/bin/ob_agentctl start
[2023-06-01 13:05:53.757] [DEBUG] – exited code 255, error output:
[2023-06-01 13:05:53.758] [DEBUG] {“successful”:false,“message”:null,“error”:"Module=agent, kind=DEADLINE_EXCEEDED, code=wait_for_ready_timeout; "}
[2023-06-01 13:05:53.758] [ERROR] failed to start 172.18.2.104 obagent.
[2023-06-01 13:05:53.758] [DEBUG] - sub start ref count to 0
[2023-06-01 13:05:53.758] [DEBUG] - export start
[2023-06-01 13:05:53.758] [ERROR] obagent start failed
[2023-06-01 13:05:53.759] [DEBUG] - Call oceanbase-ce-py_script_display-3.1.0 for oceanbase-ce-4.1.0.0-100000202023040520.el7-d598937b1cfb1df85e2c2231acf024e4994db533

obd cluster start hloceanbase 错误日志,一直卡在启动obagent上,互信已经配置
obd cluster stop hloceanbase 可以正常关闭各个节点组件包括obagent

【复现路径】问题出现前后相关操作
【问题现象及影响】

【附件】

1 个赞

已经解决

1 个赞

:call_me_hand::call_me_hand::call_me_hand:

1 个赞

请问可以描述一下解决方法吗,帮助社区内容沉淀

1.可以尝试手动去拉起obagent节点
/root/hloceanbase/obagent/bin/ob_agentctl start
再启动集群obd cluster start hloceanbase

2.如果上面方法不行,出现obd cluster start hloceanbase时候obagent起不来, 可以先obd cluster stop hloceanbase, 然后多次执行obd cluster start hloceanbase 命令