使用obd在arm架构的机器上部署oceanbase时,报obshell failed

【 使用环境 】测试环境
【 OB or 其他组件 】obd
【 使用版本 】4.3.2.0
【问题描述】
使用oceanbase-all-in-one-4.3.2.0-100000092024072912.el7.aarch64.tar.gz在arm架构的机器上部署obceanbase,在执行obd cluster start命令启动集群时,报[ERROR] 172.100.40.20 obshell failed。
去172.100.40.20节点上查看,发现没有obshell这个二进制文件。
详细信息如下:

obd cluster start ob-xx

Get local repositories ok
Search plugins ok
Load cluster param plugin ok
Open ssh connection ok
Check before start observer ok
[WARN] OBD-1011: (172.100.40.20) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[WARN] OBD-1007: (172.100.40.20) The recommended number of stack size is unlimited (Current value: 8192)
[WARN] OBD-1012: (172.100.40.20) clog and data use the same disk (/)

cluster scenario: express_oltp
Start observer ok
observer program health check ok
Connect to observer 172.100.40.20:2881 ok
Initialize oceanbase x
[ERROR] 172.100.40.20 obshell failed

[ERROR] Cluster init failed
See https://www.oceanbase.com/product/ob-deployer/error-codes .
Trace ID: d3482a7c-8d23-11ef-8b23-0242ac642801
If you want to view detailed obd logs, please run: obd display-trace d3482a7c-8d23-11ef-8b23-0242ac642801

obd display-trace d3482a7c-8d23-11ef-8b23-0242ac642801



[2024-10-18 07:38:48.318] [DEBUG] – start obshell: cd /data/ob-xx/oceanbase; /data/ob-xx/oceanbase/bin/obshell admin start --ip 172.100.40.20 --port 2886
[2024-10-18 07:38:48.318] [DEBUG] – root@172.100.40.20 execute: cd /data/ob-xx/oceanbase; /data/ob-xx/oceanbase/bin/obshell admin start --ip 172.100.40.20 --port 2886
[2024-10-18 07:38:48.378] [DEBUG] – exited code 127, error output:
[2024-10-18 07:38:48.378] [DEBUG] bash: /data/ob-xx/oceanbase/bin/obshell: No such file or directory
[2024-10-18 07:38:48.378] [DEBUG]
[2024-10-18 07:38:48.379] [ERROR] 172.100.40.20 obshell failed
[2024-10-18 07:38:48.379] [DEBUG] - sub bootstrap ref count to 0
[2024-10-18 07:38:48.379] [DEBUG] - export bootstrap
[2024-10-18 07:38:48.379] [DEBUG] - plugin oceanbase-py_script_bootstrap-4.2.2.0 result: False
[2024-10-18 07:38:48.422] [INFO] [ERROR] 172.100.40.20 obshell failed
[2024-10-18 07:38:48.422] [INFO]
[2024-10-18 07:38:48.422] [ERROR] Cluster init failed
[2024-10-18 07:38:48.424] [INFO] See https://www.oceanbase.com/product/ob-deployer/error-codes .
[2024-10-18 07:38:48.424] [INFO] Trace ID: d3482a7c-8d23-11ef-8b23-0242ac642801
[2024-10-18 07:38:48.424] [INFO] If you want to view detailed obd logs, please run: obd display-trace d3482a7c-8d23-11ef-8b23-0242ac642801
[2024-10-18 07:38:48.425] [DEBUG] - share lock /root/.obd/lock/mirror_and_repo release, count 0
[2024-10-18 07:38:48.425] [DEBUG] - unlock /root/.obd/lock/mirror_and_repo
[2024-10-18 07:38:48.425] [DEBUG] - exclusive lock /root/.obd/lock/deploy_ob-xx release, count 0
[2024-10-18 07:38:48.425] [DEBUG] - unlock /root/.obd/lock/deploy_ob-xx
[2024-10-18 07:38:48.425] [DEBUG] - share lock /root/.obd/lock/global release, count 0
[2024-10-18 07:38:48.425] [DEBUG] - unlock /root/.obd/lock/global

1 个赞

麻烦发下obd版本

1 个赞

obd --version

OceanBase Deploy: 2.9.2
REVISION: d1a445ca1311ab0a34f4ef5f6ff8f8f6b0aad623
BUILD_BRANCH: HEAD
BUILD_TIME: Jun 17 2024 14:16:12OURCE
Copyright (C) 2021 OceanBase
License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.

发一下配置文件呢,好复现一下
顺便执行下
ls ~/.obd/repository/oceanbase-ce/4.3.2.0/c46ec9c8d48d2edae96a2a9ec3f4470a3d8a9483/bin/

麻烦发下yaml配置文件,操作系统版本,cpu架构信息 例如 lscpu

配置文件:
user:
username: root
password: 123456
port: 22
oceanbase:
servers:

  • 172.100.40.20
    global:
    appname: ob-xx
    root_password: ‘123456’
    mysql_port: 2881
    rpc_port: 2882
    home_path: /data/ob-xx/oceanbase
    datafile_size: 50G
    log_disk_size: 50G
    memory_limit: 25G
    system_memory: 6G
    cpu_count: ‘20’
    enable_syslog_recycle: true
    enable_syslog_wf: true
    max_syslog_file_count: 4
    devname: eth0
    scenario: express_oltp
    172.100.40.20:
    zone: zone1

ls bin

bench_io.sh dooba import_srs_data.py import_time_zone_info.py ob_admin oblogminer observer obstack

配置文件:
user:
username: root
password: 123456
port: 22
oceanbase:
servers:

  • 172.100.40.20
    global:
    appname: ob-xx
    root_password: ‘123456’
    mysql_port: 2881
    rpc_port: 2882
    home_path: /data/ob-xx/oceanbase
    datafile_size: 50G
    log_disk_size: 50G
    memory_limit: 25G
    system_memory: 6G
    cpu_count: ‘20’
    enable_syslog_recycle: true
    enable_syslog_wf: true
    max_syslog_file_count: 4
    devname: eth0
    scenario: express_oltp
    172.100.40.20:
    zone: zone1

cat /etc/os-release

NAME=“CentOS Linux”
VERSION=“7 (AltArch)”
ID=“centos”
ID_LIKE=“rhel fedora”
VERSION_ID=“7”
PRETTY_NAME=“CentOS Linux 7 (AltArch)”
ANSI_COLOR=“0;31”
CPE_NAME=“cpe:/o:centos:centos:7:server”
HOME_URL=“https://www.centos.org/
BUG_REPORT_URL=“https://bugs.centos.org/

CENTOS_MANTISBT_PROJECT=“CentOS-7”
CENTOS_MANTISBT_PROJECT_VERSION=“7”
REDHAT_SUPPORT_PRODUCT=“centos”
REDHAT_SUPPORT_PRODUCT_VERSION=“7”

lscpu

Architecture: aarch64
Byte Order: Little Endian
CPU(s): 64
On-line CPU(s) list: 0-63
Thread(s) per core: 1
Core(s) per socket: 32
Socket(s): 2
NUMA node(s): 2
Model: 0
Model name: HUAWEI,Kunpeng 920
CPU max MHz: 2600.0000
CPU min MHz: 200.0000
BogoMIPS: 200.00
L1d cache: 64K
L1i cache: 64K
L2 cache: 512K
L3 cache: 32768K
NUMA node0 CPU(s): 0-31
NUMA node1 CPU(s): 32-63
Flags: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm ssbs

什么方式安装的all in one呢,bin下面的文件包含了一些非部署oceanbase组件的,还缺失obshell。
你可以卸载掉all in one ,然后重新解压,执行里面的install.sh进行安装,再部署

这个问题有进展了吗?