obd部署ocp时总是遇到obd-5000错误

[ERROR] OBD-5000: create resource pool ocp_meta_pool unit=‘ocp_meta_unit’, unit_num=1, zone_list=(‘zone1’,‘zone2’,‘zone3’) execute failed

三节点放在三个zone上/三节点放在一个zone上都报obd-5000,用的all-in-one包(v4.2.1 _BP11)
白屏黑屏都试过,没招了,哪位大神救救小弟

2 个赞

应该是资源不足导致创建失败,麻烦提供一下yaml文件。
~/.obd/cluster/xxxx/

1 个赞

是不是资源不够呀

user:
username: admin
password:
port: 22
oceanbase-ce:
version: 4.2.1.8
release: 108000022024072217.el7
package_hash: 499b676f2ede5a16e0c07b2b15991d1160d972e8
192.168.137.101:
zone: zone1
servers:

  • 192.168.137.101
    global:
    appname: ocp
    root_password: Xupeng996
    mysql_port: 2881
    rpc_port: 2882
    home_path: /home/admin/oceanbase
    data_dir: /home/admin/mdb/data
    redo_dir: /home/admin/mdb/log
    memory_limit: 8GB
    ocp_meta_tenant:
    tenant_name: ocp_meta
    max_cpu: 1.0
    memory_size: 1G
    ocp_meta_username: root
    ocp_meta_password: Xupeng996
    ocp_meta_db: meta_database
    ocp_monitor_tenant:
    tenant_name: ocp_monitor
    max_cpu: 1.0
    memory_size: 2G
    ocp_monitor_username: root
    ocp_monitor_password: Xupeng996
    ocp_monitor_db: monitor_database
    cluster_id: 1764556475
    proxyro_password: 6pek9kenax
    ocp_root_password: JQVruSl4zt
    ocp_meta_tenant_log_disk_size: 6G
    enable_syslog_wf: false
    max_syslog_file_count: 16
    production_mode: false
    datafile_size: 7G
    system_memory: 1G
    log_disk_size: 22G
    cpu_count: 8
    datafile_maxsize: 22G
    datafile_next: 2G
    obproxy-ce:
    version: 4.3.4.0
    package_hash: fba87ccf12faba9ba599cd7b0ca4a8149d1abb0e
    release: 1.el7
    servers:
  • 192.168.137.101
    global:
    home_path: /home/admin/obproxy
    prometheus_listen_port: 2884
    listen_port: 2883
    enable_obproxy_rpc_service: false
    obproxy_sys_password: cjGeVYYB75
    skip_proxy_sys_private_check: true
    enable_strict_kernel_release: false
    enable_cluster_checkout: false
    depends:
  • oceanbase-ce
    192.168.137.101:
    proxy_id: 6283
    client_session_id_version: 2
    ocp-server-ce:
    version: 4.3.6
    package_hash: 9cb90dad471bb9ff8fe94c67402ce9560cc0b6fe
    release: 20250709105610.el7
    servers:
  • 192.168.137.101
    global:
    home_path: /home/admin/ocp
    soft_dir: /home/admin/software
    log_dir: /home/admin/logs
    ocp_site_url: http://192.168.137.101:8080
    port: 8080
    admin_password: Xupeng996
    memory_size: 2G
    manage_info:
    machine: 3
    depends:
  • oceanbase-ce
  • obproxy-ce

机器内存很少么,你给的memory limit太小了

那请问至少应该给到多少呢

给个12G,然后各个租户最新給2G

Create tenant ocp_meta ok
Create tenant ocp_monitor x
[ERROR] OBD-5000: create resource pool ocp_monitor_pool unit=‘ocp_monitor_unit’, unit_num=1, zone_list=(‘zone1’) execute failed
给了meta租户2G,monitor租户4G,还是报obd-5000

2025-12-01 12:51:13,137 ERROR wrapper (task.py:135) [ba211cb7a46e43e2aa298bfd1eb8660a] task 1 got exception
Traceback (most recent call last):
File “service/common/task.py”, line 121, in wrapper
File “concurrent/futures/_base.py”, line 444, in result
File “concurrent/futures/_base.py”, line 389, in __get_result
File “concurrent/futures/thread.py”, line 57, in run
File “service/handler/ocp_handler.py”, line 837, in _do_install
File “_stdio.py”, line 1111, in wrapper
File “_stdio.py”, line 1098, in func_wrapper
File “tool.py”, line 783, in init
File “tool.py”, line 813, in _connect
File “pymysql/connections.py”, line 353, in init
File “pymysql/connections.py”, line 633, in connect
File “pymysql/connections.py”, line 907, in _request_authentication
File “pymysql/connections.py”, line 725, in _read_packet
File “pymysql/protocol.py”, line 221, in raise_for_error
File “pymysql/err.py”, line 143, in raise_mysql_exception
pymysql.err.OperationalError: (1045, “Access denied for user ‘root’@‘xxx.xxx.xxx.xxx’ (using password: YES)”)

第一次是创建ocp_meta_pool,现在是创建monitor的资源报错。资源給2G试试呢 log盘建议也给大点

不行,还是无法创建monitor

你这个yaml是自己手搓的么,使用obd web白屏化部署试试

我用的就是obd web

当前内存给了多少?
把log盘也扩容一下。
ocp_meta_tenant_log_disk_size: 6G 这个参数可以取消
参数这里填个10
manage_info:
machine: 3