使用ocp安装observer时报错

测试环境
【 OB】
【 使用版本 】
最新版
【问题描述】
错误码CMP10002
错误信息操作 OCP-Agent 失败,错误信息:Execute shell command failed: failed to execute command: Command{user=root, program=sh, outputType=combined, cmd=cd /root/oceanbase; ulimit -s 10240 -u 655350 -n 655350 -c unlimited; LD_LIBRARY_PATH=/root/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD=’’ /root/oceanbase/bin/observer -I ip -p 2881 -P 2882 -n scxx -z zone1 -d /root/oceanbase/store/scxx -l info -oobconfig_url=‘http://ip:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx’,rootservice_list=ip:2882:2881,config_additional_dir=/root/data/log/scxx/etc2’;’/root/data/scxx/etc3,cluster_id=1745563462,enable_cgroup=true, timeout=10s}, exitCode: 132, output: sh: 1: ulimit: too many arguments Illegal instruction (core dumped) , error: exit status 132
错误原因在 OCP-Agent ip 上执行命令 /api/v1/ob/observer/start 失败,错误信息:Execute shell command failed: failed to execute command: Command{user=root, program=sh, outputType=combined, cmd=cd /root/oceanbase; ulimit -s 10240 -u 655350 -n 655350 -c unlimited; LD_LIBRARY_PATH=/root/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD=’’ /root/oceanbase/bin/observer -I ip -p 2881 -P 2882 -n scxx -z zone1 -d /root/oceanbase/store/scxx -l info -oobconfig_url=‘http://ip:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx’,rootservice_list=ip:2882:2881,config_additional_dir=/root/data/log/scxx/etc2’;’/root/data/scxx/etc3,cluster_id=1745563462,enable_cgroup=true, timeout=10s}, exitCode: 132, output: sh: 1: ulimit: too many arguments Illegal instruction (core dumped) , error: exit status 132
解决方案请联系技术支持

操作系统是ubuntu 24.04

OCP版本麻烦发下

lscpu截图发下

OCP版本:

[root@localhost data]# lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
CPU(s): 16
On-line CPU(s) list: 0-15
Thread(s) per core: 1
Core(s) per socket: 4
Socket(s): 4
NUMA node(s): 1
Vendor ID: GenuineIntel
CPU family: 15
Model: 6
Model name: Common KVM processor
Stepping: 1
CPU MHz: 2095.076
BogoMIPS: 4190.15
Hypervisor vendor: KVM
Virtualization type: full
L1d cache: 32K
L1i cache: 32K
L2 cache: 4096K
L3 cache: 16384K
NUMA node0 CPU(s): 0-15
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx lm constant_tsc nopl xt opology eagerfpu pni cx16 x2apic hypervisor lahf_lm

错误码CMP10002
错误信息操作 OCP-Agent 失败,错误信息:Execute shell command failed: failed to execute command: Command{user=root, program=sh, outputType=combined, cmd=cd /home/oceanbase/oceanbase; ulimit -s 10240 -u 655350 -n 655350 -c unlimited; LD_LIBRARY_PATH=/home/oceanbase/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD='' /home/oceanbase/oceanbase/bin/observer -I _my_observer_ip_ -p 2881 -P 2882 -n scxx -z zone1 -d /home/oceanbase/oceanbase/store/scxx -l info -oobconfig_url='http://_my_ip_:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx',rootservice_list=_my_observer_ip_:2882:2881,config_additional_dir=/home/data/log1/scxx/etc2';'/home/data/1/scxx/etc3,cluster_id=1745735323,large_query_threshold=600s,max_syslog_file_count=300,enable_syslog_recycle=1,enable_cgroup=true,enable_record_trace_log=false, timeout=10s}, exitCode: 132, output: sh: line 1: 19669 Illegal instruction (core dumped) LD_LIBRARY_PATH=/home/oceanbase/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD='' /home/oceanbase/oceanbase/bin/observer -I _my_observer_ip_ -p 2881 -P 2882 -n scxx -z zone1 -d /home/oceanbase/oceanbase/store/scxx -l info -oobconfig_url='http://_my_ip_:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx',rootservice_list=_my_observer_ip_:2882:2881,config_additional_dir=/home/data/log1/scxx/etc2';'/home/data/1/scxx/etc3,cluster_id=1745735323,large_query_threshold=600s,max_syslog_file_count=300,enable_syslog_recycle=1,enable_cgroup=true,enable_record_trace_log=false , error: exit status 132
错误原因在 OCP-Agent _my_observer_ip_ 上执行命令 /api/v1/ob/observer/start 失败,错误信息:Execute shell command failed: failed to execute command: Command{user=root, program=sh, outputType=combined, cmd=cd /home/oceanbase/oceanbase; ulimit -s 10240 -u 655350 -n 655350 -c unlimited; LD_LIBRARY_PATH=/home/oceanbase/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD='' /home/oceanbase/oceanbase/bin/observer -I _my_observer_ip_ -p 2881 -P 2882 -n scxx -z zone1 -d /home/oceanbase/oceanbase/store/scxx -l info -oobconfig_url='http://_my_ip_:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx',rootservice_list=_my_observer_ip_:2882:2881,config_additional_dir=/home/data/log1/scxx/etc2';'/home/data/1/scxx/etc3,cluster_id=1745735323,large_query_threshold=600s,max_syslog_file_count=300,enable_syslog_recycle=1,enable_cgroup=true,enable_record_trace_log=false, timeout=10s}, exitCode: 132, output: sh: line 1: 19669 Illegal instruction (core dumped) LD_LIBRARY_PATH=/home/oceanbase/oceanbase/lib:$LD_LIBRARY_PATH LD_PRELOAD='' /home/oceanbase/oceanbase/bin/observer -I _my_observer_ip_ -p 2881 -P 2882 -n scxx -z zone1 -d /home/oceanbase/oceanbase/store/scxx -l info -oobconfig_url='http://_my_ip_:8080/services?Action=ObRootServiceInfo&User_ID=alibaba&UID=ocpmaster&ObRegion=scxx',rootservice_list=_my_observer_ip_:2882:2881,config_additional_dir=/home/data/log1/scxx/etc2';'/home/data/1/scxx/etc3,cluster_id=1745735323,large_query_threshold=600s,max_syslog_file_count=300,enable_syslog_recycle=1,enable_cgroup=true,enable_record_trace_log=false , error: exit status 132

这个问题竟然还能重现,第一次我提交问题的时候是ubuntu,想着可能是操作系统不兼容的问题,后来这次我换到centos7.9,还是一样的问题。

我的OCP、OBServer是分别装的,用的是16C32G的两台虚机分别承担,OCP是用obd进行直接安装,没什么问题,然后OBServer是利用OCP的agant去安装的,每次都能看到这个问题。

万分感谢

是因为CPU不支持avx指令集,如果是虚拟机 并且cpu硬件支持 可以增加一下

1 个赞

最新版本要cpu支持avx指令集了,要么开启要么降版本。4.2.2.1

too many arguments Illegal instruction (core dumped),得看具体的dump文件内容

自己使用的是proxmox创建的vm,首先需要检查宿主机是否支持avx。
使用如下命令检查:

grep -o 'avx[^ ]*' /proc/cpuinfo

如果支持avx。
将proxmox的vm中cpu类型改成“host”

重启虚机,问题得以解决。

谢谢大家的帮助

1 个赞