【 使用环境 】生产环境
【 OB or 其他组件 】
【 使用版本 】社区版4.3.5
【问题描述】通过OBD新部署的集群,安装没有问题,启动报错 obagent报错,具体如下:
2026-01-23 13:49:18.014] [DEBUG] – admin@10.29.201.2 execute: cd /home/admin/szst_oceanbase_xian/obagent;/home/admin/szst_oceanbase_xian/obagent/bin/ob_agentctl config -u agent.log.path=log/monagent.log,agent.http.basic.auth.username=admin,agent.http.basic.auth.password=,ocp.agent.manager.http.port=8089,mgragent.log.maxsize.mb=30,ocp.agent.monitor.http.port=8088,monagent.ob.monitor.password=,monagent.ob.monitor.user=ocp_monitor,monagent.ob.sql.port=2881,monagent.ob.rpc.port=2882,monagent.ob.cluster.name=szst_oceanbase_xian,monagent.ob.cluster.id=1769073513,monagent.ob.zone.name=zone1,monagent.log.level=info,monagent.pipeline.ob.status=active,obagent.home.path=/home/admin/szst_oceanbase_xian/obagent,monagent.host.ip=10.29.201.2,ob.log.path=/home/admin/szst_oceanbase_xian/oceanbase/store,ob.data.path=/home/admin/szst_oceanbase_xian/oceanbase/store,ob.install.path=/home/admin/szst_oceanbase_xian/oceanbase,observer.log.path=/home/admin/szst_oceanbase_xian/oceanbase/log && touch /home/admin/szst_oceanbase_xian/obagent/.configured
[2026-01-23 13:49:18.157] [ERROR] Traceback (most recent call last):
[2026-01-23 13:49:18.157] [ERROR] File “core.py”, line 2277, in start_cluster
[2026-01-23 13:49:18.157] [ERROR] File “core.py”, line 2348, in _start_cluster
[2026-01-23 13:49:18.157] [ERROR] File “core.py”, line 256, in run_workflow
[2026-01-23 13:49:18.157] [ERROR] File “core.py”, line 298, in run_plugin_template
[2026-01-23 13:49:18.157] [ERROR] File “core.py”, line 347, in call_plugin
[2026-01-23 13:49:18.157] [ERROR] File “_plugin.py”, line 348, in call
[2026-01-23 13:49:18.158] [ERROR] File “_plugin.py”, line 304, in _new_func
[2026-01-23 13:49:18.158] [ERROR] File “/home/admin/.obd/plugins/obagent/1.3.0/start.py”, line 101, in start
[2026-01-23 13:49:18.158] [ERROR] res = client.execute_command(cmd)
[2026-01-23 13:49:18.158] [ERROR] File “_plugin.py”, line 221, in new_method
[2026-01-23 13:49:18.158] [ERROR] File “_stdio.py”, line 1164, in func_wrapper
[2026-01-23 13:49:18.158] [ERROR] File “ssh.py”, line 521, in execute_command
[2026-01-23 13:49:18.158] [ERROR] File “_stdio.py”, line 1164, in func_wrapper
[2026-01-23 13:49:18.158] [ERROR] File “ssh.py”, line 486, in _execute_command
[2026-01-23 13:49:18.158] [ERROR] ValueError: invalid literal for int() with base 10: ‘\n126{“successful”:true,“message”:“success”,“error”:""}’
[2026-01-23 13:49:18.158] [ERROR]
[2026-01-23 13:49:18.159] [ERROR] [ERROR] failed to set config to 10.29.201.2 obagent.
[2026-01-23 13:49:18.160] [DEBUG] - sub start ref count to 0
[2026-01-23 13:49:18.160] [DEBUG] - export start
[2026-01-23 13:49:18.160] [WARNING] [WARN] obagent-py_script_start-1.3.0 has animation not been closed
这个中控机上还有其他集群通过OBD都正常,只有这个新部署的启动不了,节点机手动可以启动
【复现路径】问题出现前后相关操作
OceanBase Deploy: 3.6.0
REVISION: b36013bb09a84516e56db51dba78a9d9096735e7
BUILD_BRANCH: HEAD
BUILD_TIME: Sep 04 2025 10:51:01OURCE
Copyright (C) 2025 OceanBase
License Apache 2.0: Apache version 2 or later https://www.apache.org/licenses/LICENSE-2.0.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
【备注】基于 LLM 和开源文档 RAG 的论坛小助手已开放测试,在发帖时输入 [@论坛小助手] 即可召唤小助手,欢迎试用!