【 使用环境 】 测试环境 redhat7.7
【 使用版本 】 4.2.0
【问题描述】 部署ocp卡在最后一步,总是部署失败,日志没有详细提示!
服务器信息:32G内存 512G存储
~/.ocp-installer/log/ocp 日志发一下呢
好的
~/.ocp-installer/log/ocp 日志:
[2023-11-27 13:57:44.765] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – 192.168.1.155 program health check
[2023-11-27 13:57:44.770] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – local execute: ls /proc/4893
[2023-11-27 13:57:44.793] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – exited code 0
[2023-11-27 13:57:44.798] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{print $2,$10}’ | grep ‘00000000:1F90’ | awk -F’ ’ ‘{print $2}’ | uniq
[2023-11-27 13:57:44.834] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – exited code 0
[2023-11-27 13:57:44.836] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – failed to start 192.168.1.155 ocp-server, remaining retries: 1
[2023-11-27 13:57:47.839] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – 192.168.1.155 program health check
[2023-11-27 13:57:47.842] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – local execute: ls /proc/4893
[2023-11-27 13:57:47.859] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – exited code 0
[2023-11-27 13:57:47.863] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{print $2,$10}’ | grep ‘00000000:1F90’ | awk -F’ ’ ‘{print $2}’ | uniq
[2023-11-27 13:57:47.894] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] – exited code 0
[2023-11-27 13:57:47.901] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [ERROR] failed to start 192.168.1.155 ocp-server
[2023-11-27 13:57:47.902] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [ERROR] start ocp-server failed
[2023-11-27 13:57:47.903] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] - sub start ref count to 0
[2023-11-27 13:57:47.903] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [DEBUG] - export start
[2023-11-27 13:57:47.904] [c939d4f6-8ce6-11ee-840d-3a33b54c4dee] [ERROR] ocp-server start failed
需要看一下相关日志~/.ocp-installer/log/ocp
用得是root用户安装得 还是其他用户(是否有权限)
cd /home/admin/ocp; export JDBC_URL=jdbc:oceanbase://192.168.1.155:2881/meta_database; export JDBC_USERNAME=root@HJY_ORIGIN;export JDBC_PASSWORD="A1@bC_d777"; export JDBC_PUBLIC_KEY=;export OCP_INITIAL_ADMIN_PASSWORD="A1@bC_d777";java -jar -Xms512m -Xmx1g -Docp.iam.encrypted-system-password=oceanbase /home/admin/ocp/lib/ocp-server.jar --bootstrap --progress-log=/home/admin/ocp/log/bootstrap.log --with-property=logging.file.max-size:100MB --with-property=logging.file.total-size-cap:1GB --with-property=server.port:8080 --with-property=ocp.monitordb.host:192.168.1.155 --with-property=ocp.monitordb.username:root@HJY_MONITOR --with-property=ocp.monitordb.port:2881 --with-property=ocp.monitordb.password:A1@bC_d777 --with-property=ocp.monitordb.database:monitor_database --with-property=logging.file.name:/home/admin/ocp/log/ocp-server.log --with-property=ocp.site.url:http://192.168.1.155:8080 --with-property=obsdk.ob.connection.mode:direct
在日志中搜到启动 ocp 的命令是这个,之后一直检查不到 OCP 成功监听端口,可以先查看下 OCP 进程是否起来了,如果没起来可以手动执行一下这个命令看下输出的原因
看下这个日志呢,这是启动的时候进程还在,服务却没拉起 /home/admin/ocp/log/ocp-server.log