使用obd安装4.2.2,在Initialize oceanbase-ce 时卡住,observer日志有报错

【 使用环境 】测试环境
【 OB or 其他组件 】observer
【 使用版本 】4.2.2
【问题描述】使用obd命令进行deploy操作,observer日志中报错
[2024-03-18 23:42:27.281558] ERROR issue_dba_error (ob_log.cpp:1875) [2886454][T1_L0_G0][T1][YF2AAC12AAAC-000613F134B61F84-0-0] [lt=3][errcode=-4388] Unexpected internal error happen, please checkout the internal errcode(errcode=-4012, file=“ob_service.cpp”, line_no=1447, info=“failed to prepare boot strap”)
【复现路径】问题出现前后相关操作 安装失败,删除后重新尝试,仍然报此错误
【附件及日志】推荐使用OceanBase敏捷诊断工具obdiag收集诊断信息,详情参见链接(右键跳转查看):

【SOP系列 22 】——故障诊断第一步(自助诊断和诊断信息收集)
obstack2_172.18.170.172_20240318234930.zip (12.4 KB)

https://www.oceanbase.com/knowledge-base/oceanbase-database-20000000108?back=kb

麻烦发下obd日志呢。(~/.obd/log下的obd日志)

参考报错信息,对集群进行了重装。现在仍是在Initialize oceanbase-ce阶段卡住,obd日志是
[DEBUG] – execute sql: alter system bootstrap REGION “sys_region” ZONE “zone1” SERVER “172.18.170.xxx:3882”,REGION “sys_region” ZONE “zone2” SERVER “172.18.170.xxx:3882”,REGION “sys_region” ZONE “zone3” SERVER “172.18.170.xxx:3882”. args: None

配置文件内容
user:
username: admin
password: xxx
oceanbase-ce:
depends:
- ob-configserver
servers:
- name: server1
ip: 172.18.170.xxx
- name: server2
ip: 172.18.170.xxx
- name: server3
ip: 172.18.170.xxx
global:
devname: ens192
memory_limit: 12G # The maximum running memory for an observer
system_memory: 2G # The reserved system memory. system_memory is reserved for general tenants. The default value is 30G.
datafile_size: 40G
log_disk_size: 30G
cpu_count: 16
production_mode: false
enable_syslog_wf: true
enable_syslog_recycle: true
max_syslog_file_count: 4
appname: obclustercu2
root_password: xxx
proxyro_password: xxx
ocp_meta_tenant:
tenant_name: ocpcu2
max_cpu: 1
memory_size: 1G
log_disk_size: 7680M #
server1:
mysql_port: 3881
rpc_port: 3882
home_path: /app/oceanbase/observer
zone: zone1
server2:
mysql_port: 3881
rpc_port: 3882
home_path: /app/oceanbase/observer
zone: zone2
server3:
mysql_port: 3881
rpc_port: 3882
home_path: /app/oceanbase/observer
zone: zone3
obproxy-ce:
depends:
- oceanbase-ce
- ob-configserver
servers:
- 172.18.170.xxx
global:
listen_port: 3883
prometheus_listen_port: 2884
home_path: /app/oceanbase/obproxy
rs_list: 172.18.170.xxx:3881;172.18.170.xxx:3881;172.18.170.xxx:3881
enable_cluster_checkout: false
cluster_name: obclustercu2
skip_proxy_sys_private_check: true
enable_strict_kernel_release: false
obagent:
depends:
- oceanbase-ce
servers:
- name: server1
ip: 172.18.170.xxx
- name: server2
ip: 172.18.170.xxx
- name: server3
ip: 172.18.170.xxx
global:
home_path: /app/oceanbase/obagent
ocp-express:
depends:
- oceanbase-ce
- obproxy-ce
- obagent
servers:
- 172.18.170.xxx
global:
home_path: /app/oceanbase/ocp-express
memory_size: 1G
ob-configserver:
servers:
- 172.18.170.xxx
global:
listen_port: 8081
home_path: /home/admin/ob-configserver
prometheus:
……
grafana:
……

  1. 当前的部署方式是什么?obd web嘛?
  2. 麻烦提供下完整的obd日志把,

部署方式 命令行执行obd cluster start obtestcu2
obd.rar (17.1 KB)

命令行执行应该是obd cluster deploy obtestcu2 -c 配置文件 然后再obd cluster start obtestcu2
试下这种方式部署呢 OceanBase 数据库-OceanBase文档中心-分布式数据库使用文档

按组件安装仍然报错
obd cluster start obtestcu2 -c oceanbase-ce
在Wait for observer init -阶段:

[2024-03-20 22:27:06.274] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [INFO] Connect to observer
[2024-03-20 22:27:06.275] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – connect 172.18.170.1XX
[2024-03-20 22:27:06.277] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – connect 172.18.170.1XX
[2024-03-20 22:27:06.279] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – connect 172.18.170.1XX
[2024-03-20 22:27:09.283] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – connect 172.18.170.1XX
[2024-03-20 22:27:09.285] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – execute sql: select 1. args: None
[2024-03-20 22:27:09.404] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] - sub connect ref count to 0
[2024-03-20 22:27:09.404] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] - export connect
[2024-03-20 22:27:09.404] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] - Call oceanbase-ce-py_script_display-3.1.0 for oceanbase-ce-4.2.2.0-100000192024011915.el7-aa3053da7370a6685a2ef457cd202d50e5ab75d3
[2024-03-20 22:27:09.405] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] - import display
[2024-03-20 22:27:09.405] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] - add display ref count to 1
[2024-03-20 22:27:09.406] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [INFO] Wait for observer init
[2024-03-20 22:27:09.407] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – execute sql: select * from oceanbase.__all_server. args: None
[2024-03-20 22:27:09.407] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [DEBUG] – OBD-5000: select * from oceanbase.__all_server execute failed
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] Traceback (most recent call last):
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “core.py”, line 2018, in start_cluster
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “core.py”, line 2142, in _start_cluster
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “core.py”, line 186, in call_plugin
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “_plugin.py”, line 346, in call
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “_plugin.py”, line 304, in _new_func
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “/home/admin/.obd/plugins/oceanbase-ce/3.1.0/display.py”, line 37, in display
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] servers = cursor.fetchall(‘select * from oceanbase.__all_server’, raise_exception=True, exc_level=‘verbose’)
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “_stdio.py”, line 886, in func_wrapper
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] File “/home/admin/.obd/plugins/oceanbase-ce/4.2.2.0/connect.py”, line 511, in fetchall
[2024-03-20 22:27:09.409] [e7d37c8a-e6c5-11ee-b802-005056a77a4a] [ERROR] return self.execute(sql, args=args, execute_func=‘fetchall’, raise_exception=raise_exception, exc_level=exc_level, stdio=stdio)

需要看下/app/oceanbase/observer/log/observer.log日志。

后来改用obd web图形界面安装,没问题了