启动集群时 一直处于 Start obagent

【 使用环境 】生产环境
【 OB or 其他组件 】
【 使用版本 】4.1 CE
【问题描述】清晰明确描述问题
【复现路径】问题出现前后相关操作
【问题现象及影响】

启动集群时 一直处于 Start obagent

[oceanbase@localhost data]$ obd cluster start fund
Get local repositories ok
Search plugins ok
Open ssh connection ok
Load cluster param plugin ok
Check before start observer ok
[WARN] OBD-1011: (10.0.10.102) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[WARN] OBD-1011: (10.0.10.101) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[WARN] OBD-2000: (10.0.10.101) not enough memory. (Free: 453.8M, Need: 55.0G)
[WARN] OBD-1011: (10.0.10.103) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)

Check before start obproxy ok
Check before start obagent ok
Check before start ocp-express ok
Start observer ok
observer program health check ok
Connect to observer ok
Start obproxy ok
obproxy program health check ok
Connect to obproxy ok
Initialize obproxy-ce ok
Start obagent |

monagent.log (7.6 MB)

【附件】

麻烦提供下yaml文件的内容,看看配置信息

白屏安装的, 没有找到yaml 在哪里. 下图, observer.conf.bin

我通过obd cluster start fund -s 10.0.10.103 启动了一台机器
连上去报 ERROR 8001 (08004): Server is initializing
我现在只想能启动一台就OK了, 把数据导出来. 谢谢!!

[oceanbase@localhost data]$ obd cluster start fund -s 10.0.10.103
Get local repositories ok
Search plugins ok
Open ssh connection ok
Load cluster param plugin ok
Check before start observer ok
[WARN] OBD-1011: (10.0.10.103) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)

Check before start obagent ok
Start observer ok
observer program health check ok
Connect to observer ok
Start obagent x
[ERROR] failed to start 10.0.10.103 obagent.
[ERROR] obagent start failed
See https://www.oceanbase.com/product/ob-deployer/error-codes .
Trace ID: a1f2512a-f54c-11ed-922d-00155d1f7e06
If you want to view detailed obd logs, please run: obd display-trace a1f2512a-f54c-11ed-922d-00155d1f7e06

一共有几个observer,如果是3节点的话,单独起一个是不行的,如果只想起 ob 的话,可以加上 -c oceanbase-ce

太谢了, obd cluster start fund -s 10.0.10.103 -c oceanbase-ce 一下子就启成功了,
但是连上去还是报ERROR 8001 (08004): Server is initializing ,
一共三台服务器, 只要能启一台也行, 能连上去, 把数据导出来我就满足了, 帮忙指导指导, 谢谢!!!

[oceanbase@localhost data]$ obd cluster start fund -s 10.0.10.103 -c oceanbase-ce
Get local repositories ok
Search plugins ok
Open ssh connection ok
Load cluster param plugin ok
Check before start observer ok
[WARN] OBD-1011: (10.0.10.103) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)

Start observer ok
observer program health check ok
Connect to observer ok
succeed
Trace ID: 5050e1f0-f552-11ed-82ba-00155d1f7e06
If you want to view detailed obd logs, please run: obd display-trace 5050e1f0-f552-11ed-82ba-00155d1f7e06

启动两台也成功了,
目前报错: ERROR 4012 (HY000): Timeout

[oceanbase@localhost data]$ obd cluster start fund -s 10.0.10.103,10.0.10.101 -c oceanbase-ce
Get local repositories ok
Search plugins ok
Open ssh connection ok
Load cluster param plugin ok
Check before start observer ok
[WARN] OBD-1011: (10.0.10.101) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[WARN] OBD-2000: (10.0.10.101) not enough memory. (Free: 1.9G, Need: 55.0G)
[WARN] OBD-1011: (10.0.10.103) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)

Start observer ok
observer program health check ok
Connect to observer ok
succeed
Trace ID: e2e7ed1e-f553-11ed-a02d-00155d1f7e06
If you want to view detailed obd logs, please run: obd display-trace e2e7ed1e-f553-11ed-a02d-00155d1f7e06

麻烦提供下连接的命令吧,截个图

observer刚部署起来,应该还没有创建fund租户吧。部署后自动建租户可以在init.sql里面设置的,但一般不会去改这个文件。
那么不妨试试:obclient -h10.0.10.103 -P2881 -uroot -Doceanbase -A

感谢您的帮忙, 集群特然就不能正常启动, 因为里有很多数据, 我只要启动一台或二台, 把数据导出就好. 不是刚部署的情况.

不客气,有任何问题随时交流 :smile: