oceanbase-all-in-one-4.3.4在统信1070系统(ARM服务器)上运行不了数据库

【 使用环境 】生产环境
【 OB or 其他组件 】
【 使用版本 】oceanbase-all-in-one-4.3.4
【问题描述】清晰明确描述问题
【复现路径】问题出现前后相关操作
【附件及日志】
[2024-12-03 15:02:12.299] [DEBUG] - cmd: [‘volador’]
[2024-12-03 15:02:12.299] [DEBUG] - opts: {‘servers’: None, ‘components’: None, ‘force_delete’: None, ‘strict_check’: None, ‘without_parameter’: None}
[2024-12-03 15:02:12.300] [DEBUG] - mkdir /root/.obd/lock/
[2024-12-03 15:02:12.300] [DEBUG] - unknown lock mode
[2024-12-03 15:02:12.301] [DEBUG] - try to get share lock /root/.obd/lock/global
[2024-12-03 15:02:12.301] [DEBUG] - share lock /root/.obd/lock/global, count 1
[2024-12-03 15:02:12.301] [DEBUG] - Get Deploy by name
[2024-12-03 15:02:12.301] [DEBUG] - mkdir /root/.obd/cluster/
[2024-12-03 15:02:12.302] [DEBUG] - mkdir /root/.obd/config_parser/
[2024-12-03 15:02:12.302] [DEBUG] - try to get exclusive lock /root/.obd/lock/deploy_volador
[2024-12-03 15:02:12.303] [DEBUG] - exclusive lock /root/.obd/lock/deploy_volador, count 1
[2024-12-03 15:02:12.314] [DEBUG] - Deploy status judge
[2024-12-03 15:02:12.315] [INFO] Get local repositories
[2024-12-03 15:02:12.316] [DEBUG] - mkdir /root/.obd/repository
[2024-12-03 15:02:12.317] [DEBUG] - Get local repository oceanbase-ce-4.3.4.0-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.317] [DEBUG] - Search repository oceanbase-ce version: 4.3.4.0, tag: ba694e36888c0f06cee150d60d13da507f5c5d37, release: None, package_hash: None
[2024-12-03 15:02:12.317] [DEBUG] - try to get share lock /root/.obd/lock/mirror_and_repo
[2024-12-03 15:02:12.317] [DEBUG] - share lock /root/.obd/lock/mirror_and_repo, count 1
[2024-12-03 15:02:12.318] [DEBUG] - mkdir /root/.obd/repository/oceanbase-ce
[2024-12-03 15:02:12.323] [DEBUG] - Found repository oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.448] [DEBUG] - Get deploy config
[2024-12-03 15:02:12.479] [INFO] Search plugins
[2024-12-03 15:02:12.479] [DEBUG] - Searching start_check plugin for components …
[2024-12-03 15:02:12.480] [DEBUG] - Searching start_check plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.480] [DEBUG] - mkdir /root/.obd/plugins
[2024-12-03 15:02:12.482] [DEBUG] - Found for oceanbase-ce-py_script_start_check-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.482] [DEBUG] - Searching create_tenant plugin for components …
[2024-12-03 15:02:12.482] [DEBUG] - Searching create_tenant plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.484] [DEBUG] - Found for oceanbase-ce-py_script_create_tenant-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.484] [DEBUG] - Searching tenant_optimize plugin for components …
[2024-12-03 15:02:12.484] [DEBUG] - Searching tenant_optimize plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.485] [DEBUG] - Found for oceanbase-ce-py_script_tenant_optimize-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.485] [DEBUG] - Searching start plugin for components …
[2024-12-03 15:02:12.485] [DEBUG] - Searching start plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.486] [DEBUG] - Found for oceanbase-ce-py_script_start-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.486] [DEBUG] - Searching connect plugin for components …
[2024-12-03 15:02:12.487] [DEBUG] - Searching connect plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.488] [DEBUG] - Found for oceanbase-ce-py_script_connect-4.2.2.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.488] [DEBUG] - Searching bootstrap plugin for components …
[2024-12-03 15:02:12.488] [DEBUG] - Searching bootstrap plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.489] [DEBUG] - Found for oceanbase-ce-py_script_bootstrap-4.2.2.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.489] [DEBUG] - Searching display plugin for components …
[2024-12-03 15:02:12.489] [DEBUG] - Searching display plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.490] [DEBUG] - Found for oceanbase-ce-py_script_display-3.1.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.612] [INFO] Load cluster param plugin
[2024-12-03 15:02:12.613] [DEBUG] - Get local repository oceanbase-ce-4.3.4.0-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:12.614] [DEBUG] - Searching param plugin for components …
[2024-12-03 15:02:12.614] [DEBUG] - Search param plugin for oceanbase-ce
[2024-12-03 15:02:12.615] [DEBUG] - Found for oceanbase-ce-param-4.3.3.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:02:12.615] [DEBUG] - Applying oceanbase-ce-param-4.3.3.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:13.952] [INFO] Open ssh connection
[2024-12-03 15:02:14.093] [DEBUG] - Call oceanbase-ce-py_script_start_check-4.3.0.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:14.094] [DEBUG] - import start_check
[2024-12-03 15:02:14.106] [DEBUG] - add start_check ref count to 1
[2024-12-03 15:02:14.107] [INFO] Check before start observer
[2024-12-03 15:02:14.114] [DEBUG] – local execute: ls /home/observer/store/clog/tenant_1/
[2024-12-03 15:02:14.121] [DEBUG] – exited code 0
[2024-12-03 15:02:14.121] [DEBUG] – local execute: cat /home/observer/run/observer.pid
[2024-12-03 15:02:14.125] [DEBUG] – exited code 1, error output:
[2024-12-03 15:02:14.125] [DEBUG] cat: /home/observer/run/observer.pid: 没有那个文件或目录
[2024-12-03 15:02:14.125] [DEBUG]
[2024-12-03 15:02:14.125] [DEBUG] – 127.0.0.1 port check
[2024-12-03 15:02:14.125] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B41’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:02:14.135] [DEBUG] – exited code 0
[2024-12-03 15:02:14.136] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B42’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:02:14.144] [DEBUG] – exited code 0
[2024-12-03 15:02:14.144] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B46’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:02:14.153] [DEBUG] – exited code 0
[2024-12-03 15:02:14.153] [DEBUG] – local execute: ls /home/observer/store/sstable/block_file
[2024-12-03 15:02:14.157] [DEBUG] – exited code 2, error output:
[2024-12-03 15:02:14.158] [DEBUG] ls: 无法访问 ‘/home/observer/store/sstable/block_file’: 没有那个文件或目录
[2024-12-03 15:02:14.158] [DEBUG]
[2024-12-03 15:02:14.160] [DEBUG] – local execute: cat /proc/sys/fs/aio-max-nr /proc/sys/fs/aio-nr
[2024-12-03 15:02:14.164] [DEBUG] – exited code 0
[2024-12-03 15:02:14.165] [WARNING] OBD-1011: (127.0.0.1) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[2024-12-03 15:02:14.165] [DEBUG] – local execute: ulimit -a
[2024-12-03 15:02:14.168] [DEBUG] – exited code 0
[2024-12-03 15:02:14.169] [WARNING] OBD-1007: (127.0.0.1) The recommended number of core file size is unlimited (Current value: 0)
[2024-12-03 15:02:14.169] [WARNING] OBD-1007: (127.0.0.1) The recommended number of stack size is unlimited (Current value: 20480)
[2024-12-03 15:02:14.169] [DEBUG] – local execute: sysctl -a
[2024-12-03 15:02:14.197] [DEBUG] – exited code 0
[2024-12-03 15:02:14.204] [WARNING] OBD-1017: (127.0.0.1) The value of the “vm.max_map_count” must be within [327600, 1310720] (Current value: 65530, Recommended value: 655360)
[2024-12-03 15:02:14.204] [DEBUG] – local execute: cat /proc/meminfo
[2024-12-03 15:02:14.208] [DEBUG] – exited code 0
[2024-12-03 15:02:14.209] [DEBUG] – local execute: df --block-size=1024
[2024-12-03 15:02:14.214] [DEBUG] – exited code 0
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /dev, total: 7551844352 avail: 7551844352
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /dev/shm, total: 7848198144 avail: 7848198144
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /run, total: 7848198144 avail: 7807959040
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /sys/fs/cgroup, total: 7848198144 avail: 7848198144
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /, total: 75125227520 avail: 64769413120
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /tmp, total: 7848198144 avail: 7787839488
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /boot, total: 1063256064 avail: 795799552
[2024-12-03 15:02:14.215] [DEBUG] – get disk info for path /boot/efi, total: 627900416 avail: 609890304
[2024-12-03 15:02:14.216] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:02:14.216] [DEBUG] – get disk info for path /run/user/989, total: 1569587200 avail: 1569521664
[2024-12-03 15:02:14.216] [DEBUG] – get disk info for path /run/user/0, total: 1569587200 avail: 1569587200
[2024-12-03 15:02:14.216] [DEBUG] – local execute: df --block-size=1024 /home/observer/store
[2024-12-03 15:02:14.220] [DEBUG] – exited code 0
[2024-12-03 15:02:14.220] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:02:14.220] [DEBUG] – local execute: df --block-size=1024 /home/observer/store/clog
[2024-12-03 15:02:14.224] [DEBUG] – exited code 0
[2024-12-03 15:02:14.225] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:02:14.225] [DEBUG] – disk: {’/dev’: {‘total’: 7551844352, ‘avail’: 7551844352, ‘need’: 0}, ‘/dev/shm’: {‘total’: 7848198144, ‘avail’: 7848198144, ‘need’: 0}, ‘/run’: {‘total’: 7848198144, ‘avail’: 7807959040, ‘need’: 0}, ‘/sys/fs/cgroup’: {‘total’: 7848198144, ‘avail’: 7848198144, ‘need’: 0}, ‘/’: {‘total’: 75125227520, ‘avail’: 64769413120, ‘need’: 0}, ‘/tmp’: {‘total’: 7848198144, ‘avail’: 7787839488, ‘need’: 0}, ‘/boot’: {‘total’: 1063256064, ‘avail’: 795799552, ‘need’: 0}, ‘/boot/efi’: {‘total’: 627900416, ‘avail’: 609890304, ‘need’: 0}, ‘/home’: {‘total’: 172154994688, ‘avail’: 166702833664, ‘need’: 0}, ‘/run/user/989’: {‘total’: 1569587200, ‘avail’: 1569521664, ‘need’: 0}, ‘/run/user/0’: {‘total’: 1569587200, ‘avail’: 1569587200, ‘need’: 0}}
[2024-12-03 15:02:14.225] [WARNING] OBD-1012: (127.0.0.1) clog and data use the same disk (/home)
[2024-12-03 15:02:14.226] [DEBUG] – local execute: ping -W 1 -c 1 127.0.0.1
[2024-12-03 15:02:14.233] [DEBUG] – exited code 0
[2024-12-03 15:02:14.234] [DEBUG] – local execute: date +%s%N
[2024-12-03 15:02:14.239] [DEBUG] – exited code 0
[2024-12-03 15:02:14.240] [DEBUG] – 127.0.0.1 time delta 1.84033203125
[2024-12-03 15:02:14.245] [INFO] [WARN] OBD-1011: (127.0.0.1) The recommended value of fs.aio-max-nr is 1048576 (Current value: 65536)
[2024-12-03 15:02:14.245] [INFO] [WARN] OBD-1007: (127.0.0.1) The recommended number of core file size is unlimited (Current value: 0)
[2024-12-03 15:02:14.245] [INFO] [WARN] OBD-1007: (127.0.0.1) The recommended number of stack size is unlimited (Current value: 20480)
[2024-12-03 15:02:14.245] [INFO] [WARN] OBD-1017: (127.0.0.1) The value of the “vm.max_map_count” must be within [327600, 1310720] (Current value: 65530, Recommended value: 655360)
[2024-12-03 15:02:14.245] [INFO] [WARN] OBD-1012: (127.0.0.1) clog and data use the same disk (/home)
[2024-12-03 15:02:14.245] [INFO]
[2024-12-03 15:02:14.246] [DEBUG] - sub start_check ref count to 0
[2024-12-03 15:02:14.246] [DEBUG] - export start_check
[2024-12-03 15:02:14.246] [DEBUG] - plugin oceanbase-ce-py_script_start_check-4.3.0.0 result: True
[2024-12-03 15:02:14.246] [DEBUG] - Call oceanbase-ce-py_script_start-4.3.0.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:02:14.246] [DEBUG] - import start
[2024-12-03 15:02:14.251] [DEBUG] - add start ref count to 1
[2024-12-03 15:02:14.253] [INFO] cluster scenario: htap
[2024-12-03 15:02:14.253] [INFO] Start observer
[2024-12-03 15:02:14.255] [DEBUG] – local execute: ls /home/observer/store/clog/tenant_1/
[2024-12-03 15:02:14.260] [DEBUG] – exited code 0
[2024-12-03 15:02:14.261] [DEBUG] – local execute: cat /home/observer/run/observer.pid
[2024-12-03 15:02:14.265] [DEBUG] – exited code 1, error output:
[2024-12-03 15:02:14.266] [DEBUG] cat: /home/observer/run/observer.pid: 没有那个文件或目录
[2024-12-03 15:02:14.266] [DEBUG]
[2024-12-03 15:02:14.266] [DEBUG] – 127.0.0.1 start command construction
[2024-12-03 15:02:14.266] [DEBUG] – update large_query_threshold to 600s because of scenario
[2024-12-03 15:02:14.266] [DEBUG] – update enable_record_trace_log to False because of scenario
[2024-12-03 15:02:14.266] [DEBUG] – update enable_syslog_recycle to 1 because of scenario
[2024-12-03 15:02:14.267] [DEBUG] – update max_syslog_file_count to 300 because of scenario
[2024-12-03 15:02:14.267] [DEBUG] – starting 127.0.0.1 observer
[2024-12-03 15:02:14.268] [DEBUG] – root@127.0.0.1 export LD_LIBRARY_PATH=’/home/observer/lib:’
[2024-12-03 15:02:14.269] [DEBUG] – local execute: cd /home/observer; /home/observer/bin/observer -r ‘127.0.0.1:2882:2881’ -p 2881 -P 2882 -z ‘zone1’ -n ‘volador’ -c 1 -d ‘/home/observer/store’ -I ‘127.0.0.1’ -o __min_full_resource_pool_memory=2147483648,memory_limit=‘6G’,system_memory=‘1G’,datafile_size=‘2G’,datafile_next=‘2G’,datafile_maxsize=‘20G’,log_disk_size=‘14G’,cpu_count=4,enable_syslog_wf=False,max_syslog_file_count=4,large_query_threshold=‘600s’,enable_record_trace_log=False,enable_syslog_recycle=1
[2024-12-03 15:02:14.465] [DEBUG] – exited code -4, error output:
[2024-12-03 15:02:14.465] [DEBUG]
[2024-12-03 15:02:14.466] [DEBUG] – root@127.0.0.1 delete env LD_LIBRARY_PATH
[2024-12-03 15:02:14.516] [ERROR] OBD-2002: Failed to start 127.0.0.1 observer:
[2024-12-03 15:02:14.516] [DEBUG] - sub start ref count to 0
[2024-12-03 15:02:14.516] [DEBUG] - export start
[2024-12-03 15:02:14.517] [DEBUG] - plugin oceanbase-ce-py_script_start-4.3.0.0 result: None
[2024-12-03 15:02:14.517] [ERROR] oceanbase-ce start failed
[2024-12-03 15:02:14.519] [DEBUG] - share lock /root/.obd/lock/mirror_and_repo release, count 0
[2024-12-03 15:02:14.519] [DEBUG] - unlock /root/.obd/lock/mirror_and_repo
[2024-12-03 15:02:14.520] [DEBUG] - exclusive lock /root/.obd/lock/deploy_volador release, count 0
[2024-12-03 15:02:14.520] [DEBUG] - unlock /root/.obd/lock/deploy_volador
[2024-12-03 15:02:14.520] [DEBUG] - share lock /root/.obd/lock/global release, count 0
[2024-12-03 15:02:14.520] [DEBUG] - unlock /root/.obd/lock/global
[2024-12-03 15:02:14.520] [INFO] See
[2024-12-03 15:02:14.520] [INFO] Trace ID: 853c519c-b144-11ef-a221-525400ecfd2d
[2024-12-03 15:02:14.520] [INFO] If you want to view detailed obd logs, please run: obd display-trace 853c519c-b144-11ef-a221-525400ecfd2d

【备注】基于 LLM 和开源文档 RAG 的论坛小助手已开放测试,在发帖时输入 [@论坛小助手] 即可召唤小助手,欢迎试用!

把obd.log日志和具体安装步骤发一下

根据您提供的日志信息,oceanbase-all-in-one-4.3.4 在统信 1070 系统(ARM 服务器)上启动失败。以下是可能的原因和解决方案:

可能的原因

  1. 系统参数配置不足

    • fs.aio-max-nr 当前值为 65536,推荐值为 1048576。
    • 核心文件大小(core file size)当前值为 0,推荐值为 unlimited。
    • 栈大小(stack size)当前值为 20480,推荐值为 unlimited。
    • vm.max_map_count 当前值为 65530,推荐值为 655360。
  2. 磁盘使用问题

    • 日志中提到 clogdata 使用了同一个磁盘(/home),这可能会导致性能问题。
  3. 其他潜在问题

    • 启动命令退出码为 -4,这通常表示信号中断或其他系统级错误。

解决方案

  1. 调整系统参数

    • 修改 fs.aio-max-nr
      sudo sysctl -w fs.aio-max-nr=1048576
      
    • 修改核心文件大小:
      ulimit -c unlimited
      
    • 修改栈大小:
      ulimit -s unlimited
      
    • 修改 vm.max_map_count
      sudo sysctl -w vm.max_map_count=655360
      
  2. 确保 clogdata 分别使用不同的磁盘

    • 如果可能,将 clogdata 目录分别挂载到不同的磁盘上,以提高性能和稳定性。
  3. 检查启动命令

    • 确保启动命令中没有语法错误或其他问题。您可以手动执行启动命令,查看是否有更详细的错误信息:
      cd /home/observer; /home/observer/bin/observer -r '127.0.0.1:2882:2881' -p 2881 -P 2882 -z 'zone1' -n 'volador' -c 1 -d '/home/observer/store' -I '127.0.0.1' -o '__min_full_resource_pool_memory=2147483648,memory_limit='6G',system_memory='1G',datafile_size='2G',datafile_next='2G',datafile_maxsize='20G',log_disk_size='14G',cpu_count=4,enable_syslog_wf=False,max_syslog_file_count=4,large_query_threshold='600s',enable_record_trace_log=False,enable_syslog_recycle=1'
      
  4. 查看详细日志

    • 使用 obd display-trace 命令查看详细的跟踪日志,以便进一步分析问题:
      obd display-trace 853c519c-b144-11ef-a221-525400ecfd2d
      

总结

通过调整系统参数、确保 clogdata 使用不同的磁盘、检查启动命令以及查看详细日志,可以解决 oceanbase-all-in-one-4.3.4 在统信 1070 系统(ARM 服务器)上启动失败的问题。如果问题仍然存在,建议联系 OceanBase 的技术支持人员进行进一步的排查。

更多信息请查看:

  1. 0001 ~ 3999
  2. 4000 ~ 4499
  3. 7000 ~ 7999

[2024-12-03 15:16:53.526] [DEBUG] - cmd: [‘volador’]
[2024-12-03 15:16:53.526] [DEBUG] - opts: {‘servers’: None, ‘components’: None, ‘force_delete’: None, ‘strict_check’: None, ‘without_parameter’: None}
[2024-12-03 15:16:53.527] [DEBUG] - mkdir /root/.obd/lock/
[2024-12-03 15:16:53.527] [DEBUG] - unknown lock mode
[2024-12-03 15:16:53.528] [DEBUG] - try to get share lock /root/.obd/lock/global
[2024-12-03 15:16:53.528] [DEBUG] - share lock /root/.obd/lock/global, count 1
[2024-12-03 15:16:53.528] [DEBUG] - Get Deploy by name
[2024-12-03 15:16:53.529] [DEBUG] - mkdir /root/.obd/cluster/
[2024-12-03 15:16:53.529] [DEBUG] - mkdir /root/.obd/config_parser/
[2024-12-03 15:16:53.530] [DEBUG] - try to get exclusive lock /root/.obd/lock/deploy_volador
[2024-12-03 15:16:53.530] [DEBUG] - exclusive lock /root/.obd/lock/deploy_volador, count 1
[2024-12-03 15:16:53.544] [DEBUG] - Deploy status judge
[2024-12-03 15:16:53.545] [INFO] Get local repositories
[2024-12-03 15:16:53.547] [DEBUG] - mkdir /root/.obd/repository
[2024-12-03 15:16:53.548] [DEBUG] - Get local repository oceanbase-ce-4.3.4.0-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.548] [DEBUG] - Search repository oceanbase-ce version: 4.3.4.0, tag: ba694e36888c0f06cee150d60d13da507f5c5d37, release: None, package_hash: None
[2024-12-03 15:16:53.549] [DEBUG] - try to get share lock /root/.obd/lock/mirror_and_repo
[2024-12-03 15:16:53.549] [DEBUG] - share lock /root/.obd/lock/mirror_and_repo, count 1
[2024-12-03 15:16:53.549] [DEBUG] - mkdir /root/.obd/repository/oceanbase-ce
[2024-12-03 15:16:53.555] [DEBUG] - Found repository oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.680] [DEBUG] - Get deploy config
[2024-12-03 15:16:53.714] [INFO] Search plugins
[2024-12-03 15:16:53.715] [DEBUG] - Searching start_check plugin for components …
[2024-12-03 15:16:53.716] [DEBUG] - Searching start_check plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.716] [DEBUG] - mkdir /root/.obd/plugins
[2024-12-03 15:16:53.720] [DEBUG] - Found for oceanbase-ce-py_script_start_check-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.720] [DEBUG] - Searching create_tenant plugin for components …
[2024-12-03 15:16:53.720] [DEBUG] - Searching create_tenant plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.721] [DEBUG] - Found for oceanbase-ce-py_script_create_tenant-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.721] [DEBUG] - Searching tenant_optimize plugin for components …
[2024-12-03 15:16:53.722] [DEBUG] - Searching tenant_optimize plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.723] [DEBUG] - Found for oceanbase-ce-py_script_tenant_optimize-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.723] [DEBUG] - Searching start plugin for components …
[2024-12-03 15:16:53.723] [DEBUG] - Searching start plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.724] [DEBUG] - Found for oceanbase-ce-py_script_start-4.3.0.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.724] [DEBUG] - Searching connect plugin for components …
[2024-12-03 15:16:53.725] [DEBUG] - Searching connect plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.726] [DEBUG] - Found for oceanbase-ce-py_script_connect-4.2.2.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.726] [DEBUG] - Searching bootstrap plugin for components …
[2024-12-03 15:16:53.726] [DEBUG] - Searching bootstrap plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.727] [DEBUG] - Found for oceanbase-ce-py_script_bootstrap-4.2.2.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.727] [DEBUG] - Searching display plugin for components …
[2024-12-03 15:16:53.728] [DEBUG] - Searching display plugin for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.729] [DEBUG] - Found for oceanbase-ce-py_script_display-3.1.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.850] [INFO] Load cluster param plugin
[2024-12-03 15:16:53.852] [DEBUG] - Get local repository oceanbase-ce-4.3.4.0-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:53.852] [DEBUG] - Searching param plugin for components …
[2024-12-03 15:16:53.852] [DEBUG] - Search param plugin for oceanbase-ce
[2024-12-03 15:16:53.854] [DEBUG] - Found for oceanbase-ce-param-4.3.3.0 for oceanbase-ce-4.3.4.0
[2024-12-03 15:16:53.854] [DEBUG] - Applying oceanbase-ce-param-4.3.3.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:55.330] [INFO] Open ssh connection
[2024-12-03 15:16:55.468] [DEBUG] - Call oceanbase-ce-py_script_start_check-4.3.0.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:55.469] [DEBUG] - import start_check
[2024-12-03 15:16:55.485] [DEBUG] - add start_check ref count to 1
[2024-12-03 15:16:55.486] [INFO] Check before start observer
[2024-12-03 15:16:55.494] [DEBUG] – local execute: ls /home/observer/store/clog/tenant_1/
[2024-12-03 15:16:55.504] [DEBUG] – exited code 0
[2024-12-03 15:16:55.504] [DEBUG] – local execute: cat /home/observer/run/observer.pid
[2024-12-03 15:16:55.510] [DEBUG] – exited code 1, error output:
[2024-12-03 15:16:55.510] [DEBUG] cat: /home/observer/run/observer.pid: 没有那个文件或目录
[2024-12-03 15:16:55.510] [DEBUG]
[2024-12-03 15:16:55.511] [DEBUG] – 127.0.0.1 port check
[2024-12-03 15:16:55.511] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B41’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:16:55.523] [DEBUG] – exited code 0
[2024-12-03 15:16:55.524] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B42’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:16:55.535] [DEBUG] – exited code 0
[2024-12-03 15:16:55.535] [DEBUG] – local execute: bash -c ‘cat /proc/net/{tcp*,udp*}’ | awk -F’ ’ ‘{if($4==“0A”) print $2,$4,$10}’ | grep ‘:0B46’ | awk -F’ ’ ‘{print $3}’ | uniq
[2024-12-03 15:16:55.546] [DEBUG] – exited code 0
[2024-12-03 15:16:55.547] [DEBUG] – local execute: ls /home/observer/store/sstable/block_file
[2024-12-03 15:16:55.554] [DEBUG] – exited code 2, error output:
[2024-12-03 15:16:55.554] [DEBUG] ls: 无法访问 ‘/home/observer/store/sstable/block_file’: 没有那个文件或目录
[2024-12-03 15:16:55.554] [DEBUG]
[2024-12-03 15:16:55.557] [DEBUG] – local execute: cat /proc/sys/fs/aio-max-nr /proc/sys/fs/aio-nr
[2024-12-03 15:16:55.563] [DEBUG] – exited code 0
[2024-12-03 15:16:55.563] [DEBUG] – local execute: ulimit -a
[2024-12-03 15:16:55.568] [DEBUG] – exited code 0
[2024-12-03 15:16:55.570] [DEBUG] – local execute: sysctl -a
[2024-12-03 15:16:55.597] [DEBUG] – exited code 0
[2024-12-03 15:16:55.605] [DEBUG] – local execute: cat /proc/meminfo
[2024-12-03 15:16:55.612] [DEBUG] – exited code 0
[2024-12-03 15:16:55.614] [DEBUG] – local execute: df --block-size=1024
[2024-12-03 15:16:55.620] [DEBUG] – exited code 0
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /dev, total: 7551844352 avail: 7551844352
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /dev/shm, total: 7848198144 avail: 7848198144
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /run, total: 7848198144 avail: 7798587392
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /sys/fs/cgroup, total: 7848198144 avail: 7848198144
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /, total: 75125227520 avail: 64757276672
[2024-12-03 15:16:55.622] [DEBUG] – get disk info for path /tmp, total: 7848198144 avail: 7787380736
[2024-12-03 15:16:55.623] [DEBUG] – get disk info for path /boot, total: 1063256064 avail: 795799552
[2024-12-03 15:16:55.623] [DEBUG] – get disk info for path /boot/efi, total: 627900416 avail: 609890304
[2024-12-03 15:16:55.623] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:16:55.623] [DEBUG] – get disk info for path /run/user/0, total: 1569587200 avail: 1569062912
[2024-12-03 15:16:55.623] [DEBUG] – local execute: df --block-size=1024 /home/observer/store
[2024-12-03 15:16:55.630] [DEBUG] – exited code 0
[2024-12-03 15:16:55.630] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:16:55.630] [DEBUG] – local execute: df --block-size=1024 /home/observer/store/clog
[2024-12-03 15:16:55.636] [DEBUG] – exited code 0
[2024-12-03 15:16:55.637] [DEBUG] – get disk info for path /home, total: 172154994688 avail: 166702833664
[2024-12-03 15:16:55.637] [DEBUG] – disk: {’/dev’: {‘total’: 7551844352, ‘avail’: 7551844352, ‘need’: 0}, ‘/dev/shm’: {‘total’: 7848198144, ‘avail’: 7848198144, ‘need’: 0}, ‘/run’: {‘total’: 7848198144, ‘avail’: 7798587392, ‘need’: 0}, ‘/sys/fs/cgroup’: {‘total’: 7848198144, ‘avail’: 7848198144, ‘need’: 0}, ‘/’: {‘total’: 75125227520, ‘avail’: 64757276672, ‘need’: 0}, ‘/tmp’: {‘total’: 7848198144, ‘avail’: 7787380736, ‘need’: 0}, ‘/boot’: {‘total’: 1063256064, ‘avail’: 795799552, ‘need’: 0}, ‘/boot/efi’: {‘total’: 627900416, ‘avail’: 609890304, ‘need’: 0}, ‘/home’: {‘total’: 172154994688, ‘avail’: 166702833664, ‘need’: 0}, ‘/run/user/0’: {‘total’: 1569587200, ‘avail’: 1569062912, ‘need’: 0}}
[2024-12-03 15:16:55.638] [WARNING] OBD-1012: (127.0.0.1) clog and data use the same disk (/home)
[2024-12-03 15:16:55.638] [DEBUG] – local execute: ping -W 1 -c 1 127.0.0.1
[2024-12-03 15:16:55.646] [DEBUG] – exited code 0
[2024-12-03 15:16:55.647] [DEBUG] – local execute: date +%s%N
[2024-12-03 15:16:55.654] [DEBUG] – exited code 0
[2024-12-03 15:16:55.654] [DEBUG] – 127.0.0.1 time delta 2.2841796875
[2024-12-03 15:16:55.757] [INFO] [WARN] OBD-1012: (127.0.0.1) clog and data use the same disk (/home)
[2024-12-03 15:16:55.758] [INFO]
[2024-12-03 15:16:55.759] [DEBUG] - sub start_check ref count to 0
[2024-12-03 15:16:55.759] [DEBUG] - export start_check
[2024-12-03 15:16:55.759] [DEBUG] - plugin oceanbase-ce-py_script_start_check-4.3.0.0 result: True
[2024-12-03 15:16:55.760] [DEBUG] - Call oceanbase-ce-py_script_start-4.3.0.0 for oceanbase-ce-4.3.4.0-100000162024110717.el8-ba694e36888c0f06cee150d60d13da507f5c5d37
[2024-12-03 15:16:55.760] [DEBUG] - import start
[2024-12-03 15:16:55.767] [DEBUG] - add start ref count to 1
[2024-12-03 15:16:55.768] [INFO] cluster scenario: htap
[2024-12-03 15:16:55.769] [INFO] Start observer
[2024-12-03 15:16:55.772] [DEBUG] – local execute: ls /home/observer/store/clog/tenant_1/
[2024-12-03 15:16:55.778] [DEBUG] – exited code 0
[2024-12-03 15:16:55.779] [DEBUG] – local execute: cat /home/observer/run/observer.pid
[2024-12-03 15:16:55.785] [DEBUG] – exited code 1, error output:
[2024-12-03 15:16:55.785] [DEBUG] cat: /home/observer/run/observer.pid: 没有那个文件或目录
[2024-12-03 15:16:55.785] [DEBUG]
[2024-12-03 15:16:55.785] [DEBUG] – 127.0.0.1 start command construction
[2024-12-03 15:16:55.785] [DEBUG] – update large_query_threshold to 600s because of scenario
[2024-12-03 15:16:55.785] [DEBUG] – update enable_record_trace_log to False because of scenario
[2024-12-03 15:16:55.785] [DEBUG] – update enable_syslog_recycle to 1 because of scenario
[2024-12-03 15:16:55.786] [DEBUG] – update max_syslog_file_count to 300 because of scenario
[2024-12-03 15:16:55.786] [DEBUG] – starting 127.0.0.1 observer
[2024-12-03 15:16:55.787] [DEBUG] – root@127.0.0.1 export LD_LIBRARY_PATH=’/home/observer/lib:’
[2024-12-03 15:16:55.787] [DEBUG] – local execute: cd /home/observer; /home/observer/bin/observer -r ‘127.0.0.1:2882:2881’ -p 2881 -P 2882 -z ‘zone1’ -n ‘volador’ -c 1 -d ‘/home/observer/store’ -I ‘127.0.0.1’ -o __min_full_resource_pool_memory=2147483648,memory_limit=‘6G’,system_memory=‘1G’,datafile_size=‘2G’,datafile_next=‘2G’,datafile_maxsize=‘20G’,log_disk_size=‘14G’,cpu_count=4,enable_syslog_wf=False,max_syslog_file_count=4,large_query_threshold=‘600s’,enable_record_trace_log=False,enable_syslog_recycle=1
[2024-12-03 15:16:56.533] [DEBUG] – exited code -4, error output:
[2024-12-03 15:16:56.534] [DEBUG]
[2024-12-03 15:16:56.534] [DEBUG] – root@127.0.0.1 delete env LD_LIBRARY_PATH
[2024-12-03 15:16:56.555] [ERROR] OBD-2002: Failed to start 127.0.0.1 observer:
[2024-12-03 15:16:56.555] [DEBUG] - sub start ref count to 0
[2024-12-03 15:16:56.555] [DEBUG] - export start
[2024-12-03 15:16:56.556] [DEBUG] - plugin oceanbase-ce-py_script_start-4.3.0.0 result: None
[2024-12-03 15:16:56.556] [ERROR] oceanbase-ce start failed
[2024-12-03 15:16:56.559] [DEBUG] - share lock /root/.obd/lock/mirror_and_repo release, count 0
[2024-12-03 15:16:56.559] [DEBUG] - unlock /root/.obd/lock/mirror_and_repo
[2024-12-03 15:16:56.559] [DEBUG] - exclusive lock /root/.obd/lock/deploy_volador release, count 0
[2024-12-03 15:16:56.559] [DEBUG] - unlock /root/.obd/lock/deploy_volador
[2024-12-03 15:16:56.559] [DEBUG] - share lock /root/.obd/lock/global release, count 0
[2024-12-03 15:16:56.559] [DEBUG] - unlock /root/.obd/lock/global
[2024-12-03 15:16:56.560] [INFO] See https://www.oceanbase.com/product/ob-deployer/error-codes .
[2024-12-03 15:16:56.560] [INFO] Trace ID: 927ccdda-b146-11ef-9b95-525400ecfd2d
[2024-12-03 15:16:56.560] [INFO] If you want to view detailed obd logs, please run: obd display-trace 927ccdda-b146-11ef-9b95-525400ecfd2d

cd /home

obd cluster deploy volador -c mini-single-example.yaml

obd cluster start volador

根据您提供的日志信息,oceanbase-all-in-one-4.3.4 在统信 1070 系统(ARM 服务器)上启动失败。以下是可能的原因和解决方案:

可能的原因

  1. 系统参数配置不足

    • fs.aio-max-nr 当前值为 65536,推荐值为 1048576。
    • 核心文件大小(core file size)当前值为 0,推荐值为 unlimited。
    • 栈大小(stack size)当前值为 20480,推荐值为 unlimited。
    • vm.max_map_count 当前值为 65530,推荐值为 655360。
  2. 磁盘使用问题

    • 日志中提到 clogdata 使用了同一个磁盘(/home),这可能会导致性能问题。
  3. 其他潜在问题

    • 启动命令退出码为 -4,这通常表示信号中断或其他系统级错误。

解决方案

  1. 调整系统参数

    • 修改 fs.aio-max-nr
      sudo sysctl -w fs.aio-max-nr=1048576
      
    • 修改核心文件大小:
      ulimit -c unlimited
      
    • 修改栈大小:
      ulimit -s unlimited
      
    • 修改 vm.max_map_count
      sudo sysctl -w vm.max_map_count=655360
      
  2. 确保 clogdata 分别使用不同的磁盘

    • 如果可能,将 clogdata 目录分别挂载到不同的磁盘上,以提高性能和稳定性。
  3. 检查启动命令

    • 确保启动命令中没有语法错误或其他问题。您可以手动执行启动命令,查看是否有更详细的错误信息:
      cd /home/observer; /home/observer/bin/observer -r '127.0.0.1:2882:2881' -p 2881 -P 2882 -z 'zone1' -n 'volador' -c 1 -d '/home/observer/store' -I '127.0.0.1' -o '__min_full_resource_pool_memory=2147483648,memory_limit='6G',system_memory='1G',datafile_size='2G',datafile_next='2G',datafile_maxsize='20G',log_disk_size='14G',cpu_count=4,enable_syslog_wf=False,max_syslog_file_count=4,large_query_threshold='600s',enable_record_trace_log=False,enable_syslog_recycle=1'
      
  4. 查看详细日志

    • 使用 obd display-trace 命令查看详细的跟踪日志,以便进一步分析问题:
      obd display-trace 927ccdda-b146-11ef-9b95-525400ecfd2d
      

总结

通过调整系统参数、确保 clogdata 使用不同的磁盘、检查启动命令以及查看详细日志,可以解决 oceanbase-all-in-one-4.3.4 在统信 1070 系统(ARM 服务器)上启动失败的问题。如果问题仍然存在,建议联系 OceanBase 的技术支持人员进行进一步的排查。

更多信息请查看:

  1. 0001 ~ 3999
  2. 4000 ~ 4499
  3. 7000 ~ 7999

详情请参考:

  1. 配置文件说明

(小助手的答复已结束,如未能解决您的问题,请继续提问并等待其他同学的回复,谢谢!)

[root@localhost observer]# cd /home/observer; /home/observer/bin/observer -r ‘127.0.0.1:2882:2881’ -p 2881 -P 2882 -z ‘zone1’ -n ‘volador444’ -c 1 -d ‘/home/observer/store’ -I ‘127.0.0.1’ -o __min_full_resource_pool_memory=2147483648,memory_limit=‘6G’,system_memory=‘1G’,datafile_size=‘2G’,datafile_next=‘2G’,datafile_maxsize=‘20G’,log_disk_size=‘14G’,cpu_count=4,enable_syslog_wf=False,max_syslog_file_count=4,large_query_threshold=‘600s’,enable_record_trace_log=False,enable_syslog_recycle=1
非法指令 (核心已转储)

你尝试部署一下421bp3之前的版本试试。或者使用nonles版安装包
部署一个421bp1的版本,可能是cpu不支持

1 个赞

OK 我试试

谢了 用了4.2.1.1 正常了


推荐使用最新版本的nolse包该版本是适用于ARM的