社区版OCP创建集群问题

使用社区版OCP单独部署在一台机器,版本如下:
ocp-all-in-one-4.3.6-20250709105610.el7.x86_64.tar.gz
然后使用OCP白屏为其他三台机器创建集群会一直卡住最终导致失败(推倒重来试过几次都是一样失败),信息如下:


查看OCP工具手册,上面说可以通过ocp创建集群,但是询问其AI助手描述为不支持,何解?


因执行失败,已经执行回滚了,附上创建集群设置和日志如下:


日志如下:
log_task_102.zip (53.6 KB)

OCP白屏信息



6 个赞

日志发出来。看着像网络不通?

2 个赞

点击右上方的日志下载发下

创建集群 这个页面的配置发下

3 个赞

肯定不是主机防护墙问题,各机器的防火墙都停了

3 个赞

已上传日志

2 个赞

看看ob进程起来了没,感觉ob都没起来。

3 个赞

看起来是 6.102 的OB实例没起来,你看下这台机是否有observer.log 或者bootstrap.log 生成?

2025-07-25 14:20:36.643  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.e.internal.template.HttpTemplate   : POST request to agent, url:http://192.168.6.102:62888/api/v1/ob/observer/access, request body:AccessObServerProcessRequest(ip=192.168.6.102, port=2881, username=root), params:null
2025-07-25 14:20:36.649  WARN 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.s.t.b.c.helper.ObServerTaskHelper  : Failed to check observer accessible, reason:[AgentClient]:http request is failed, response:Unexpected error: dial tcp 127.0.0.1:2881: connect: connection refused, cause:null
2025-07-25 14:20:36.652  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.ocp.common.lang.pattern.Retry        : wait for 5 seconds
2025-07-25 14:20:41.703  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.agent.HostAgentServiceImpl       : Finding OCP agent: hostId=2
2025-07-25 14:20:41.707  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.a.p.HostAgentProcessServiceImpl  : Getting all OCP agent processes on host 2
2025-07-25 14:20:41.721  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.e.internal.template.HttpTemplate   : POST request to agent, url:http://192.168.6.102:62888/api/v1/ob/observer/access, request body:AccessObServerProcessRequest(ip=192.168.6.102, port=2881, username=root), params:null
2025-07-25 14:20:41.727  WARN 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.s.t.b.c.helper.ObServerTaskHelper  : Failed to check observer accessible, reason:[AgentClient]:http request is failed, response:Unexpected error: dial tcp 127.0.0.1:2881: connect: connection refused, cause:null
2025-07-25 14:20:41.729  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.ocp.common.lang.pattern.Retry        : wait for 5 seconds
2025-07-25 14:20:46.754  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.agent.HostAgentServiceImpl       : Finding OCP agent: hostId=2
2025-07-25 14:20:46.762  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.a.p.HostAgentProcessServiceImpl  : Getting all OCP agent processes on host 2
2025-07-25 14:20:46.817  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.e.internal.template.HttpTemplate   : POST request to agent, url:http://192.168.6.102:62888/api/v1/ob/observer/access, request body:AccessObServerProcessRequest(ip=192.168.6.102, port=2881, username=root), params:null
2025-07-25 14:20:46.824  WARN 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.s.t.b.c.helper.ObServerTaskHelper  : Failed to check observer accessible, reason:[AgentClient]:http request is failed, response:Unexpected error: dial tcp 127.0.0.1:2881: connect: connection refused, cause:null
2025-07-25 14:20:46.827  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.ocp.common.lang.pattern.Retry        : wait for 5 seconds
2025-07-25 14:20:51.863  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.agent.HostAgentServiceImpl       : Finding OCP agent: hostId=2
2025-07-25 14:20:51.870  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.a.p.HostAgentProcessServiceImpl  : Getting all OCP agent processes on host 2
2025-07-25 14:20:51.919  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.e.internal.template.HttpTemplate   : POST request to agent, url:http://192.168.6.102:62888/api/v1/ob/observer/access, request body:AccessObServerProcessRequest(ip=192.168.6.102, port=2881, username=root), params:null
2025-07-25 14:20:51.925  WARN 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.s.t.b.c.helper.ObServerTaskHelper  : Failed to check observer accessible, reason:[AgentClient]:http request is failed, response:Unexpected error: dial tcp 127.0.0.1:2881: connect: connection refused, cause:null
2025-07-25 14:20:51.928  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.ocp.common.lang.pattern.Retry        : wait for 5 seconds
2025-07-25 14:20:56.957  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.agent.HostAgentServiceImpl       : Finding OCP agent: hostId=2
2025-07-25 14:20:56.968  INFO 9623 --- [manual-subtask-executor16,840818ceae3ab1e1,153bf570cdd7760a] c.o.o.c.a.p.HostAgentProcessServiceImpl  : Getting all OCP agent processes on host 2
3 个赞

这两个日志都没有!

3 个赞

observer没启动,observer.log 或者bootstrap.log日志都没有!

3 个赞

另外两个节点有日志吗

3 个赞

都没有

2 个赞

你看在这几个目录是否存在?权限是否对?

installPath=/home/admin/oceanbase,
dataPath=/data/1,
logPath=/data/log1

1 个赞

f9a6872f-bf14-4f8c-98e0-680420bea876

1 个赞

进到这个目录 /home/admin/oceanbase 看下目录结构

1 个赞

这目录里面什么也没有!
[admin@node ~]$ pwd
/home/admin
[admin@node ~]$
[admin@node ~]$
[admin@node ~]$
[admin@node ~]$ cd oceanbase/
[admin@node oceanbase]$ ls -la
total 0
drwxr-xr-x 2 admin admin 6 Jul 25 14:49 .
drwx------ 8 admin admin 156 Jul 25 16:43 …
[admin@node oceanbase]$
[admin@node oceanbase]$
[admin@node oceanbase]$ pwd
/home/admin/oceanbase
[admin@node oceanbase]$

1 个赞

可以看下 /home/admin/ocp_agent/log/mgragent.log 搜索 “observer/start”关键字看看

2 个赞

目录是空的,3个节点都是这样吗?
OB的安装软件都没解压出来,你部署的哪个版本?

在 102 分别执行 (-o后面是另外两节点的IP)
clockdiff -o xx.xx.xx.xx

2 个赞

mgragent.log 发下看看

2 个赞

mgragent.log (8.7 MB)

1 个赞

日志信息如下mgragent.log (8.7 MB)

1 个赞