OCP-4.3.2 安装报错

【 使用环境 】测试环境
【 OB or 其他组件 】OCP
【 使用版本 】ocp-server-ce-4.3.2
【问题描述】报连不上数据库
【复现路径】部署报错
网页显示:
install oceanbase-ce-4.2.1.8 for local ok
install obproxy-ce-4.3.1.0 for local ok
install ocp-server-ce-4.3.2 for local ok
±--------------------------------------------------------------------------------------------+
| Packages |
±--------------±--------±-----------------------±-----------------------------------------+
| Repository | Version | Release | Md5 |
±--------------±--------±-----------------------±-----------------------------------------+
| oceanbase-ce | 4.2.1.8 | 108000022024072217.el7 | 499b676f2ede5a16e0c07b2b15991d1160d972e8 |
| obproxy-ce | 4.3.1.0 | 4.el7 | 835f4803c1f4da186439323b66c51db4662678a3 |
| ocp-server-ce | 4.3.2 | 20241012145836.el7 | 610610e2daf63f6df08af686f9a88b6d8cefcc52 |
±--------------±--------±-----------------------±-----------------------------------------+
Repository integrity check ok
Load param plugin ok
Open ssh connection ok
Generate ocp server configuration ok
Parameter check ok
Cluster status check ok
Initializes observer work home ok
Initializes obproxy work home ok
Initializes ocp-server-ce work home ok
Remote oceanbase-ce-4.2.1.8-108000022024072217.el7-499b676f2ede5a16e0c07b2b15991d1160d972e8 repository install ok
Remote oceanbase-ce-4.2.1.8-108000022024072217.el7-499b676f2ede5a16e0c07b2b15991d1160d972e8 repository lib check !!
Remote obproxy-ce-4.3.1.0-4.el7-835f4803c1f4da186439323b66c51db4662678a3 repository install ok
Remote obproxy-ce-4.3.1.0-4.el7-835f4803c1f4da186439323b66c51db4662678a3 repository lib check ok
Remote ocp-server-ce-4.3.2-20241012145836.el7-610610e2daf63f6df08af686f9a88b6d8cefcc52 repository install ok
Remote ocp-server-ce-4.3.2-20241012145836.el7-610610e2daf63f6df08af686f9a88b6d8cefcc52 repository lib check ok
Try to get lib-repository
install oceanbase-ce-libs-4.2.1.8 for local ok
Remote oceanbase-ce-libs-4.2.1.8-108000022024072217.el7-d02f4bfd321370a02550424293beb1be31204038 repository install ok
Remote oceanbase-ce-4.2.1.8-108000022024072217.el7-499b676f2ede5a16e0c07b2b15991d1160d972e8 repository lib check ok
ocpmetadb deployed
Search plugins ok
Load cluster param plugin ok
Check before start observer ok
[WARN] OBD-1011: (10.106.21.31) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[WARN] OBD-1012: (10.106.21.31) clog and data use the same disk (/data)

Start observer ok
observer program health check ok
Connect to observer ok
Initialize oceanbase-ce ok
Create tenant ocp_meta ok
Exec oceanbase-ce-4.2.1.8-108000022024072217.el7-499b676f2ede5a16e0c07b2b15991d1160d972e8 import_time_zone_info.py ok
Exec oceanbase-ce-4.2.1.8-108000022024072217.el7-499b676f2ede5a16e0c07b2b15991d1160d972e8 import_srs_data.py ok
obclient -h10.106.21.31 -P2881 -uroot@ocp_meta -Doceanbase -A

Create tenant ocp_monitor ok
[ERROR] OBD-5000: create resource pool ocp_monitor_pool unit=‘ocp_monitor_unit’, unit_num=1, zone_list=(‘zone1’) execute failed
[ERROR] create resource pool failed, you can try again by using SQL “drop resource pool ocp_monitor_pool” to delete the resource pool, if you are certain that the resource pool is not being used. error info: (4733, ‘zone ‘zone1’ resource not enough to hold 1 unit. You can check resource info by views: DBA_OB_UNITS, GV$OB_UNITS, GV$OB_SERVERS.\nserver ‘“10.106.21.31:2882”’ LOG_DISK resource not enough\n’)

Load cluster param plugin ok
Check before start obproxy ok
Start obproxy ok
obproxy program health check ok
Connect to obproxy ok
Initialize obproxy-ce ok
±-----------------------------------------------+
| obproxy-ce |
±-------------±-----±----------------±-------+
| ip | port | prometheus_port | status |
±-------------±-----±----------------±-------+
| 10.106.21.31 | 2883 | 2884 | active |
±-------------±-----±----------------±-------+
obclient -h10.106.21.31 -P2883 -uroot@proxysys -p’UATdb0101^’ -Doceanbase -A

succeed
Search plugins ok
Load cluster param plugin ok
Check before start ocp-server ok
[ERROR] meta tenant or monitor tenant connect failed
[ERROR] start ocp-server-ce failed
[ERROR] ocp-server-ce start failed

后台日志:
st request time
2025-03-11 17:39:05,221 INFO get_install_task_info (ocp_handler.py:862) [9c7f62d487ec48a2b9b1bfe19a943e64] get ocp install task info
2025-03-11 17:39:05,228 INFO dispatch (request_response_log.py:43) [9c7f62d487ec48a2b9b1bfe19a943e64] app send response, code: 200
2025-03-11 17:39:05,716 WARNING _do_install (ocp_handler.py:819) [None] failed to start component: ocp-server-ce
2025-03-11 17:39:05,716 INFO _do_install (ocp_handler.py:821) [None] end start ocp-server-ce
2025-03-11 17:39:05,721 ERROR wrapper (task.py:140) [1bfe43b132424e3987568a7dbe743e30] task 1 got exception
Traceback (most recent call last):
File “service/common/task.py”, line 126, in wrapper
File “concurrent/futures/_base.py”, line 444, in result
File “concurrent/futures/_base.py”, line 389, in __get_result
File “concurrent/futures/thread.py”, line 57, in run
File “service/handler/ocp_handler.py”, line 826, in _do_install
File “_stdio.py”, line 969, in wrapper
File “_stdio.py”, line 956, in func_wrapper
File “tool.py”, line 767, in init
File “tool.py”, line 797, in _connect
File “pymysql/connections.py”, line 353, in init
File “pymysql/connections.py”, line 633, in connect
File “pymysql/connections.py”, line 907, in _request_authentication
File “pymysql/connections.py”, line 725, in _read_packet
File “pymysql/protocol.py”, line 221, in raise_for_error
File “pymysql/err.py”, line 143, in raise_mysql_exception
pymysql.err.OperationalError: (1045, “Access denied for user ‘root’@‘xxx.xxx.xxx.xxx’ (using password: YES)”)
2025-03-11 17:39:05,723 INFO wrapper (task.py:143) [1bfe43b132424e3987568a7dbe743e30] task 1 finished failed
2025-03-11 17:39:06,985 INFO dispatch (request_response_log.py:40) [f67b28df8b174f359ff5bbdce8499eb6] app receive request, method: GET, url: http://10.106.21.31:8680/api/v1/ocp/deployments/1/install/2/log, query_params: , body: , from: 10.102.144.145:60883
2025-03-11 17:39:06,986 INFO dispatch (idle_shutdown.py:36) [f67b28df8b174f359ff5bbdce8499eb6] dispatch request and update last request time
2025-03-11 17:39:06,986 INFO get_install_task_info (ocp_handler.py:862) [f67b28df8b174f359ff5bbdce8499eb6] get ocp install task info

1 个赞

看上去是任务打包程序出异常了啊

1 个赞

需要重新下载安装包安装吗?

[ERROR] OBD-5000: create resource pool ocp_monitor_pool unit=‘ocp_monitor_unit’, unit_num=1, zone_list=(‘zone1’) execute failed
麻烦确认下集群的资源是否足够,一般这种错误都是因为内存不足导致。
提供一份yaml文件看看。~/.obd/cluster/xxxx/

1 个赞

user:
username: admin
password: XXX
port: 40022
oceanbase-ce:
version: 4.2.1.8
release: 108000022024072217.el7
package_hash: 499b676f2ede5a16e0c07b2b15991d1160d972e8
10.106.21.31:
zone: zone1
servers:

  • 10.106.21.31
    global:
    appname: ocpmetadb
    root_password: XX
    mysql_port: 2881
    rpc_port: 2882
    home_path: /home/admin/oceanbase
    data_dir: /data/1
    redo_dir: /data/log1
    datafile_size: 40GB
    datafile_maxsize: 100GB
    log_disk_size: 24GB
    memory_limit: 24GB
    cpu_count: 16
    ocp_meta_tenant:
    tenant_name: ocp_meta
    max_cpu: 2.0
    memory_size: 4G
    ocp_meta_username: root
    ocp_meta_password: XX
    ocp_meta_db: meta_database
    ocp_monitor_tenant:
    tenant_name: ocp_monitor
    max_cpu: 2.0
    memory_size: 8G
    ocp_monitor_username: root
    ocp_monitor_password: XX
    ocp_monitor_db: monitor_database
    cluster_id: 1741685623
    proxyro_password: XX
    ocp_root_password: XX
    ocp_meta_tenant_log_disk_size: 6G
    enable_syslog_wf: false
    max_syslog_file_count: 4
    system_memory: 6G
    obproxy-ce:
    version: 4.3.1.0
    package_hash: 835f4803c1f4da186439323b66c51db4662678a3
    release: 4.el7
    servers:
  • 10.106.21.31
    global:
    home_path: /home/admin/obproxy
    prometheus_listen_port: 2884
    listen_port: 2883
    enable_obproxy_rpc_service: false
    obproxy_sys_password: XXX
    skip_proxy_sys_private_check: true
    enable_strict_kernel_release: false
    enable_cluster_checkout: false
    depends:
  • oceanbase-ce
    10.106.21.31:
    proxy_id: 5047
    client_session_id_version: 2
    ocp-server-ce:
    version: 4.3.2
    package_hash: 610610e2daf63f6df08af686f9a88b6d8cefcc52
    release: 20241012145836.el7
    servers:
  • 10.106.21.31
    global:
    home_path: /home/admin/ocp
    soft_dir: /home/admin/software
    log_dir: /home/admin/logs
    ocp_site_url: http://10.106.21.31:8080
    port: 8080
    admin_password: XXX
    memory_size: 4G
    manage_info:
    machine: 10
    depends:
  • oceanbase-ce
  • obproxy-ce
1 个赞

log_disk_size: 24GB
这个设置的太小了,建议为memory_limit的3-4倍

是的,改成50G后,部署成功了,谢谢。

不是提示很明显了吗??资源不足,创建pool失败

预检查没报错,检查了个啥,点击部署还报错

问得好,预检查,检查了个啥,你不应该知道吗 :roll_eyes:

检查了个寂寞

1 个赞