【 使用环境 】生产环境,64G内存,300G/obdata 200G/obredo 150G/obhome
【 OB or 其他组件 】
【 使用版本 】4.2.1
【问题描述】部署集群正常,启动集群时,到了OCP-express步骤失败
【复现路径】
在启动到 Start ocp-express时长时间等待后失败
失败的内容为:connect failed
使用obclient -h127.0.0.1 -P2881 -uroot -p’xxxx’ -Doceanbase -A 可以登录
使用obclient -h127.0.0.1 -P2881 -uroot@ocp_meta -p’xxxx’ -Doceanbase -A可以登录
使用obclient -h127.0.0.1 -P2883 -uroot@ocp_meta -p’xxx’ -Doceanbase -A无法登录
使用obclient -h127.0.0.1 -P2883 -uroot -p’xxxx’ -Doceanbase -A无法登录
以下是相关内容:
部署的命令:
[admin@iBSM-2Q-SER obsetup_file]$ obd cluster deploy ob_ibsm -c ibsm-single-with-obp-obt-opc.yaml
±-------------------------------------------------------------------------------------------+
| Packages |
±-------------±--------±-----------------------±-----------------------------------------+
| Repository | Version | Release | Md5 |
±-------------±--------±-----------------------±-----------------------------------------+
| oceanbase-ce | 4.2.1.9 | 109000032024092015.el7 | 2d2109a5da1b037403c50d6351649688eae62d7c |
| obproxy-ce | 4.3.1.0 | 4.el7 | 835f4803c1f4da186439323b66c51db4662678a3 |
| obagent | 4.2.2 | 100000042024011120.el7 | 19739a07a12eab736aff86ecf357b1ae660b554e |
| ocp-express | 4.2.2 | 100000022024011120.el7 | 09ffcf156d1df9318a78af52656f499d2315e3f7 |
±-------------±--------±-----------------------±-----------------------------------------+
Repository integrity check ok
Load param plugin ok
Open ssh connection ok
Parameter check ok
Cluster status check ok
Initializes observer work home ok
Initializes obproxy work home ok
Initializes obagent work home ok
Initializes ocp-express work home ok
Remote oceanbase-ce-4.2.1.9-109000032024092015.el7-2d2109a5da1b037403c50d6351649688eae62d7c repository install ok
Remote oceanbase-ce-4.2.1.9-109000032024092015.el7-2d2109a5da1b037403c50d6351649688eae62d7c repository lib check ok
Remote obproxy-ce-4.3.1.0-4.el7-835f4803c1f4da186439323b66c51db4662678a3 repository install ok
Remote obproxy-ce-4.3.1.0-4.el7-835f4803c1f4da186439323b66c51db4662678a3 repository lib check ok
Remote obagent-4.2.2-100000042024011120.el7-19739a07a12eab736aff86ecf357b1ae660b554e repository install ok
Remote obagent-4.2.2-100000042024011120.el7-19739a07a12eab736aff86ecf357b1ae660b554e repository lib check ok
Remote ocp-express-4.2.2-100000022024011120.el7-09ffcf156d1df9318a78af52656f499d2315e3f7 repository install ok
Remote ocp-express-4.2.2-100000022024011120.el7-09ffcf156d1df9318a78af52656f499d2315e3f7 repository lib check ok
ob_ibsm deployed
Please execute obd cluster start ob_ibsm
to start
Trace ID: e19ac4ea-aad8-11ef-ba05-fa163d8684dd
If you want to view detailed obd logs, please run: obd display-trace e19ac4ea-aad8-11ef-ba05-fa163d8684dd
======================================
启动:
[admin@iBSM-2Q-SER obsetup_file]$ obd cluster start ob_ibsm
Get local repositories ok
Search plugins ok
Load cluster param plugin ok
Open ssh connection ok
Check before start observer ok
[WARN] OBD-1011: (127.0.0.1) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
Check before start obproxy ok
[WARN] OBD-4521: The config observer_sys_password in obproxy-ce did not take effect, please config it in oceanbase-ce
Check before start obagent ok
Check before start ocp-express ok
Start observer ok
observer program health check ok
Connect to observer 127.0.0.1:2881 ok
Initialize oceanbase-ce ok
Create tenant ocp_meta ok
Exec oceanbase-ce-4.2.1.9-109000032024092015.el7-2d2109a5da1b037403c50d6351649688eae62d7c import_time_zone_info.py ok
Exec oceanbase-ce-4.2.1.9-109000032024092015.el7-2d2109a5da1b037403c50d6351649688eae62d7c import_srs_data.py ok
obclient -h127.0.0.1 -P2881 -uroot@ocp_meta -Doceanbase -A
Start obproxy ok
obproxy program health check ok
Connect to obproxy ok
Initialize obproxy-ce ok
Start obagent ok
obagent program health check ok
Connect to Obagent ok
Start ocp-express x
[ERROR] 127.0.0.1: failed to connect meta db
[ERROR] ocp-express start failed
Wait for observer init ok
±--------------------------------------------+
| oceanbase-ce |
±----------±--------±-----±------±-------+
| ip | version | port | zone | status |
±----------±--------±-----±------±-------+
| 127.0.0.1 | 4.2.1.9 | 2881 | zone1 | ACTIVE |
±----------±--------±-----±------±-------+
obclient -h127.0.0.1 -P2881 -uroot -p’Uinnova@2024’ -Doceanbase -A
cluster unique id: 289942bf-1f8a-59ee-9de3-c6164bf07086-193614134c1-09010204
±--------------------------------------------+
| obproxy-ce |
±----------±-----±----------------±-------+
| ip | port | prometheus_port | status |
±----------±-----±----------------±-------+
| 127.0.0.1 | 2883 | 2884 | active |
±----------±-----±----------------±-------+
obclient -h127.0.0.1 -P2883 -uroot@proxysys -p’VMeU4y0wwh’ -Doceanbase -A
±-------------------------------------------------------------+
| obagent |
±----------±-------------------±-------------------±-------+
| ip | mgragent_http_port | monagent_http_port | status |
±----------±-------------------±-------------------±-------+
| 127.0.0.1 | 8089 | 8088 | active |
±----------±-------------------±-------------------±-------+
See https://www.oceanbase.com/product/ob-deployer/error-codes .
Trace ID: f077c36e-aad8-11ef-bf54-fa163d8684dd
If you want to view detailed obd logs, please run: obd display-trace f077c36e-aad8-11ef-bf54-fa163d8684dd
======================================
yaml脚本
oceanbase-ce:
servers:
- 127.0.0.1
global:
home_path: /obhome/observer
data_dir: /obdata
redo_dir: /obredo
mysql_port: 2881
rpc_port: 2882
obshell_port: 2886
zone: zone1
cluster_id: 1
memory_limit: 25G
system_memory: 8G
datafile_size: 50G
datafile_next: 10G
datafile_maxsize: 225G
log_disk_size: 160G
cpu_count: 16
production_mode: false
enable_syslog_wf: true
max_syslog_file_count: 200
max_syslog_file_time: 7d
enable_syslog_cycle: true
appname: obcluster
root_password: Uinnova@2024
obproxy-ce:
depends:
- oceanbase-ce
servers:
- 127.0.0.1
global:
listen_port: 2883
prometheus_listen_port: 2884
home_path: /obhome/obporxy
enable_cluster_checkout: false
cluster_name: obcluster
skip_proxy_sys_private_check: true
enable_strict_kernel_release: false
observer_sys_password: Uinnova@2024
obagent:
depends:
- oceanbase-ce
servers:
- 127.0.0.1
global:
home_path: /obhome/obagent
ocp-express:
depends:
- oceanbase-ce
- obproxy-ce
- obagent
servers:
- 127.0.0.1
global:
home_path: /obhome/ocp-express
memory_size: 1G
报错:
[2024-11-25 09:57:15.510] [INFO] Start ocp-express
[2024-11-25 09:57:15.511] [DEBUG] – local execute: cat /obhome/ocp-server/run/ocp-express.pid
[2024-11-25 09:57:15.519] [DEBUG] – exited code 1, error output:
[2024-11-25 09:57:15.520] [DEBUG] cat: /obhome/ocp-server/run/ocp-express.pid: 没有那个文件或目录
[2024-11-25 09:57:15.520] [DEBUG]
[2024-11-25 09:57:15.520] [DEBUG] – local execute: ls /obhome/ocp-server/.bootstrapped
[2024-11-25 09:57:15.529] [DEBUG] – exited code 2, error output:
[2024-11-25 09:57:15.530] [DEBUG] ls: 无法访问 ‘/obhome/ocp-server/.bootstrapped’: 没有那个文件或目录
[2024-11-25 09:57:15.530] [DEBUG]
[2024-11-25 09:57:15.531] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp_meta -p******
[2024-11-25 09:57:17.533] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp -p******
[2024-11-25 09:57:19.536] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp_meta -p******
[2024-11-25 09:57:21.535] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp -p******
[2024-11-25 09:57:23.540] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp_meta -p******
[2024-11-25 09:57:25.542] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp -p******
[2024-11-25 09:57:27.552] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp_meta -p******
[2024-11-25 09:57:29.551] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp -p******
[2024-11-25 09:57:31.552] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp_meta -p******
[2024-11-25 09:57:33.552] [DEBUG] – connect 127.0.0.1 -P2883 -umeta@ocp -p******
…
[2024-11-25 10:07:16.088] [ERROR] 127.0.0.1: failed to connect meta db
[2024-11-25 10:07:16.093] [INFO] [ERROR] 127.0.0.1: failed to connect meta db
[2024-11-25 10:07:16.093] [INFO]
[2024-11-25 10:07:16.094] [DEBUG] - sub start ref count to 0
[2024-11-25 10:07:16.094] [DEBUG] - export start
[2024-11-25 10:07:16.094] [DEBUG] - plugin ocp-express-py_script_start-4.2.2 result: False
[2024-11-25 10:07:16.094] [ERROR] ocp-express start failed
【备注】基于 LLM 和开源文档 RAG 的论坛小助手已开放测试,在发帖时输入 [@论坛小助手] 即可召唤小助手,欢迎试用!