【 使用环境 】测试环境
【 OB or 其他组件 】OMS
【 使用版本 】4.2.13
安装步骤:
1、参考官方文档,Docker安装
https://www.oceanbase.com/docs/community-oms-cn-1000000005764080
2、docker load -i <OMS 社区版安装包>
3、docker load -i tsdb安装包
4、获取部署脚本
docker run --name oms-config-tool reg.docker.alibaba-inc.com/oceanbase/oms-ce:feature_4.2.13_ce_el8_x86 bash && sudo docker cp oms-config-tool:/root/docker_remote_deploy.sh . && sudo docker rm -f oms-config-tool
5、部署,根据提示生成配置文件
bash docker_remote_deploy.sh -o /data1/oms -c /data1/config.yaml -i 192.168.9.240 -d reg.docker.alibaba-inc.com/oceanbase/oms-ce:feature_4.2.13_ce_el8_x86
配置文件如下:
apsara_audit_enable: false
apsara_audit_sls_access_key:
apsara_audit_sls_access_secret:
apsara_audit_sls_endpoint:
apsara_audit_sls_ops_site_topic:
apsara_audit_sls_user_site_topic:
cm_is_default: !!bool true
cm_location: 0
cm_nodes:
- 192.168.9.240
cm_region: default
cm_region_cn: 默认地域
cm_server_port: 8088
cm_url: http://192.168.9.240:8088
drc_cm_db: _cm
drc_cm_heartbeat_db: _cm_hb
drc_rm_db: _rm
ghana_server_port: 8090
init_db: true
meta_ssl_ca:
meta_ssl_client_cert:
meta_ssl_client_key:
meta_ssl_enabled: !!bool false
nginx_server_port: 8089
oms_meta_host: 192.168.4.149
oms_meta_password: Lin@123456
oms_meta_port: 2881
oms_meta_user: root@ocp_meta
sshd_server_port: 2023
supervisor_server_port: 9000
tsdb_enabled: true
tsdb_password: 123
tsdb_service: INFLUXDB
tsdb_url: 192.168.9.240:14444
tsdb_username: root
6、进入web界面重置密码后,数据迁移页面右上角没有新建任务,查看查维监控->机器,状态一直是宕机,如截图
7、根据排查指南
https://www.oceanbase.com/docs/community-oms-cn-1000000005764185
7.1 查看配置文件似乎无问题,参考上面配置文件内容
7.2 查看服务状态,都是RUNNING
supervisorctl status
nginx RUNNING pid 12899, uptime 0:16:45
oms_console RUNNING pid 12902, uptime 0:16:35
oms_drc_cm RUNNING pid 12932, uptime 0:16:25
oms_drc_supervisor RUNNING pid 13197, uptime 0:16:15
sshd RUNNING pid 13619, uptime 0:16:05
7.3 进入容器查看本机IP和OMS_HOST_IP环境变量IP,这里不一致
[root@vm ~]# hostname -i
127.0.0.1
[root@vm ~]# env | grep OMS_HOST_IP
OMS_HOST_IP=192.168.9.240
[root@vm ~]# cat /etc/hosts
127.0.0.1 localhost
7.4 查看数据库状态,数据里的时间是UTC时间,不是北京时间,心跳表 gmt_modified 时间一直在发生变化
mysql> mysql> select * from _\G
*************************** 1. row ***************************
id: 1
ip: 192.168.9.240
resource_group_id: 1
store_limit: -1
store_usage: 0
store_book: 0
congo_limit: -1
congo_usage: 0
congo_book: 0
jdbc_writer_limit: -1
jdbc_writer_usage: 0
jdbc_writer_book: 0
metaq_writer_limit: -1
metaq_writer_usage: 0
metaq_writer_book: 0
full_check_limit: -1
full_check_usage: 0
full_check_book: 0
cpu_limit: -1
cpu_usage: 0
cpu_book: 0
gmt_modified: 2026-04-23 07:26:04
gmt_create: 2026-04-23 07:03:44
version: 0
host_status: ONLINE
ldc: NULL
host_pool_id: NULL
1 row in set (0.00 sec)
mysql> select * from _cm_hb.heart_beat where task_name='supervisor'\G
*************************** 1. row ***************************
id: 1
task_type: supervisor
task_name: supervisor
task_id: NULL
gmt_created: 2026-04-23 07:01:31
gmt_modified: 2026-04-23 07:27:01
host_ip: 192.168.9.240
version: NULL
1 row in set (0.01 sec)
辛苦各位大佬帮忙看下显示宕机原因。
