【 使用环境 】 测试环境
【 OCP Express 】
【问题描述】使用obd一键部署的ocp express还有集群,ocp宕掉以后不知道如何重启,百度查了命令是obd cluster start ,但是我怎么知道OCP name是啥?
~/.obd/cluster路径里面有对应集群名称
我不是要重启集群,我只是想单独把OCP express重新拉起来
obd cluster start name -c ocp express
可以通过 obd cluster list 查看集群名称 然后 使用obd cluster start OCP name -c ocp-express
老师,这是什么原因导致ocp起不来?
提供下obd日志(~/.obd/log/obd)
有改什么配置文件参数么?
df -h && free -h 看下
如果可以登陆数据库可以再执行下:select a.zone, a.SVR_IP,a.SVR_PORT, b.status,cpu_capacity,cpu_assigned_max,cpu_capacity-cpu_assigned_max as cpu_free,round(memory_limit /1024/1024/1024 ,2) as memory_total_gb,round((memory_limit-mem_capacity) /1024/1024/1024 ,2) as system_memory_gb,round(mem_assigned /1024/1024/1024 ,2) as mem_assigned_gb,round((mem_capacity-mem_assigned) /1024/1024/1024 ,2) as memory_free_gb,round(log_disk_capacity /1024/1024/1024 ,2) as log_disk_capacity_gb,round(log_disk_assigned /1024/1024/1024 ,2) as log_disk_assigned_gb,round((log_disk_capacity-log_disk_assigned) /1024/1024/1024 ,2) as log_disk_free_gb,round((data_disk_capacity /1024/1024/1024 ),2) as data_disk_gb,round((data_disk_in_use /1024/1024/1024 ),2) as data_disk_used_gb,round((data_disk_capacity-data_disk_in_use) /1024/1024/1024 ,2) as data_disk_free_gb from gv$ob_servers a join oceanbase.DBA_OB_SERVERS b on a.zone=b.zone\G;
*************************** 1. row ***************************
zone: zone3
SVR_IP: 10.48.1.51
SVR_PORT: 2882
status: ACTIVE
cpu_capacity: 16
cpu_assigned_max: 5
cpu_free: 11
memory_total_gb: 13.44
system_memory_gb: 3.00
mem_assigned_gb: 6.00
memory_free_gb: 4.44
log_disk_capacity_gb: 34.31
log_disk_assigned_gb: 15.00
log_disk_free_gb: 19.31
data_disk_gb: 4.38
data_disk_used_gb: 3.04
data_disk_free_gb: 1.33
*************************** 2. row ***************************
zone: zone2
SVR_IP: 10.48.1.48
SVR_PORT: 2882
status: ACTIVE
cpu_capacity: 16
cpu_assigned_max: 5
cpu_free: 11
memory_total_gb: 13.44
system_memory_gb: 3.00
mem_assigned_gb: 6.00
memory_free_gb: 4.44
log_disk_capacity_gb: 34.31
log_disk_assigned_gb: 15.00
log_disk_free_gb: 19.31
data_disk_gb: 193.00
data_disk_used_gb: 3.42
data_disk_free_gb: 189.58
*************************** 3. row ***************************
zone: zone1
SVR_IP: 10.48.1.47
SVR_PORT: 2882
status: ACTIVE
cpu_capacity: 16
cpu_assigned_max: 5
cpu_free: 11
memory_total_gb: 13.44
system_memory_gb: 3.00
mem_assigned_gb: 6.00
memory_free_gb: 4.44
log_disk_capacity_gb: 34.31
log_disk_assigned_gb: 15.00
log_disk_free_gb: 19.31
data_disk_gb: 193.00
data_disk_used_gb: 0.66
data_disk_free_gb: 192.34
老师,你觉得可能是51这个节点的磁盘空间不足导致46服务器的ocp express起不来吗?
有可能,可以提供下observer.log日志看看。
也可以使用obdiag进行下巡检和日志分析。
- obdiag check 巡检
- obdiag analyze log 日志分析
obdiag文档:OceanBase分布式数据库-海量数据 笔笔算数)