OAT添加安装ocp的服务器时,如何跳过cpu和内存检查

【 使用环境 】 测试环境
【 OB or 其他组件 】
OAT 4.1.0
OCP 3.3.4

【问题描述】
OAT添加安装ocp的服务器时,由于cpu<32 内存<128G,precheck失败,
只是做学习使用,如何跳过这两项检查

[2023-04-14T10:33:45.879+0800] INFO - Dependencies all met for <TaskInstance: init_server_with_tag.precheck manual__2023-04-14T02:33:00.888252+00:00 [queued]>
[2023-04-14T10:33:45.893+0800] INFO - Dependencies all met for <TaskInstance: init_server_with_tag.precheck manual__2023-04-14T02:33:00.888252+00:00 [queued]>
[2023-04-14T10:33:45.894+0800] INFO - 
--------------------------------------------------------------------------------
[2023-04-14T10:33:45.894+0800] INFO - Starting attempt 1 of 1
[2023-04-14T10:33:45.894+0800] INFO - 
--------------------------------------------------------------------------------
[2023-04-14T10:33:45.932+0800] INFO - Executing <Task(_PythonDecoratedOperator): precheck> on 2023-04-14 02:33:00.888252+00:00
[2023-04-14T10:33:45.937+0800] INFO - Started process 5393 to run task
[2023-04-14T10:33:45.946+0800] INFO - Running: ['airflow', 'tasks', 'run', 'init_server_with_tag', 'precheck', 'manual__2023-04-14T02:33:00.888252+00:00', '--job-id', '15', '--raw', '--subdir', 'DAGS_FOLDER/init_server_with_tag.py', '--cfg-path', '/tmp/tmp500it2u3']
[2023-04-14T10:33:45.951+0800] INFO - Job 15: Subtask precheck
[2023-04-14T10:33:46.203+0800] INFO - Running <TaskInstance: init_server_with_tag.precheck manual__2023-04-14T02:33:00.888252+00:00 [running]> on host oceanbase-test-7
[2023-04-14T10:33:46.340+0800] INFO - Exporting the following env vars:
AIRFLOW_CTX_DAG_OWNER=airflow
AIRFLOW_CTX_DAG_ID=init_server_with_tag
AIRFLOW_CTX_TASK_ID=precheck
AIRFLOW_CTX_EXECUTION_DATE=2023-04-14T02:33:00.888252+00:00
AIRFLOW_CTX_TRY_NUMBER=1
AIRFLOW_CTX_DAG_RUN_ID=manual__2023-04-14T02:33:00.888252+00:00
[2023-04-14T10:33:46.343+0800] INFO - Running statement: select oat_server.id, oat_credential.id as credential_id,  ip, ssh_port, username, password, auth_type, key_data, passphrase from oat_server, oat_credential where oat_server.credential_id=oat_credential.id and oat_server.id=%s, parameters: [1]
[2023-04-14T10:33:46.345+0800] INFO - Rows affected: 1
[2023-04-14T10:33:46.373+0800] INFO - Connected (version 2.0, client OpenSSH_7.4)
[2023-04-14T10:33:46.505+0800] INFO - Authentication (password) successful!
[2023-04-14T10:33:46.599+0800] INFO - execute command on 192.168.2.134:
/tmp/precheck.sh6rAM3Hyh -m obproduct
[2023-04-14T10:33:46.680+0800] INFO - Machine Role: obproduct
[2023-04-14T10:33:46.683+0800] INFO - Peer IP List:
[2023-04-14T10:33:46.688+0800] INFO - Machine Type: PHY
[2023-04-14T10:33:46.689+0800] INFO - Inspect Mode: FALSE
[2023-04-14T10:33:46.689+0800] INFO - 
[2023-04-14T10:33:46.704+0800] INFO - check CPU count: 8 < 32 ... EXPECT >= 32 ... FAIL
[2023-04-14T10:33:46.707+0800] INFO - TIPS: replace another machine with more CPU
[2023-04-14T10:33:46.713+0800] INFO - check total MEM: 15 GB < 128 GB ... EXPECT >= 128 GB ... FAIL
[2023-04-14T10:33:46.716+0800] INFO - TIPS: replace another machine with more MEM
[2023-04-14T10:33:46.719+0800] INFO - check SELinux status: Disabled ... PASS
[2023-04-14T10:33:46.721+0800] INFO - check /home/admin, exist ... PASS
[2023-04-14T10:33:46.726+0800] INFO - check /home/admin owner: admin ... PASS
[2023-04-14T10:33:46.735+0800] INFO - check /home/admin disk usage, total: 49G, used: 6.8G, use%: 14% < 50% ... PASS
[2023-04-14T10:33:46.746+0800] INFO - check account [admin] and home dir, exist ... PASS
[2023-04-14T10:33:46.810+0800] INFO - check clock sync service: chronyd, chrony server: 36.110.233.85 ... PASS
[2023-04-14T10:33:46.832+0800] INFO - check chrony clock offset: 0.504ms <= 50ms ... PASS
[2023-04-14T10:33:46.852+0800] INFO - sysctl /proc/sys/net/core/somaxconn = 2048, correct ... PASS
[2023-04-14T10:33:46.868+0800] INFO - sysctl /proc/sys/net/core/netdev_max_backlog = 10000, correct ... PASS
[2023-04-14T10:33:46.882+0800] INFO - sysctl /proc/sys/net/core/rmem_default = 16777216, correct ... PASS
[2023-04-14T10:33:46.897+0800] INFO - sysctl /proc/sys/net/core/wmem_default = 16777216, correct ... PASS
[2023-04-14T10:33:46.912+0800] INFO - sysctl /proc/sys/net/core/rmem_max = 16777216, correct ... PASS
[2023-04-14T10:33:46.928+0800] INFO - sysctl /proc/sys/net/core/wmem_max = 16777216, correct ... PASS
[2023-04-14T10:33:46.947+0800] INFO - sysctl /proc/sys/net/ipv4/conf/default/rp_filter = 1, correct ... PASS
[2023-04-14T10:33:46.965+0800] INFO - sysctl /proc/sys/net/ipv4/conf/default/accept_source_route = 0, correct ... PASS
[2023-04-14T10:33:46.981+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_syncookies = 1, correct ... PASS
[2023-04-14T10:33:46.997+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_rmem = 4096 87380 16777216, correct ... PASS
[2023-04-14T10:33:47.015+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_wmem = 4096 65536 16777216, correct ... PASS
[2023-04-14T10:33:47.032+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_max_syn_backlog = 16384, correct ... PASS
[2023-04-14T10:33:47.048+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_fin_timeout = 15, correct ... PASS
[2023-04-14T10:33:47.065+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_max_syn_backlog = 16384, correct ... PASS
[2023-04-14T10:33:47.083+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_tw_reuse = 1, correct ... PASS
[2023-04-14T10:33:47.101+0800] INFO - sysctl /proc/sys/net/ipv4/tcp_slow_start_after_idle = 0, correct ... PASS
[2023-04-14T10:33:47.119+0800] INFO - sysctl /proc/sys/vm/swappiness = 0, correct ... PASS
[2023-04-14T10:33:47.138+0800] INFO - sysctl /proc/sys/kernel/core_pattern = /data/1/core-%e-%p-%t, correct ... PASS
[2023-04-14T10:33:47.156+0800] INFO - sysctl /proc/sys/vm/min_free_kbytes = 2097152, correct ... PASS
[2023-04-14T10:33:47.174+0800] INFO - sysctl /proc/sys/vm/max_map_count = 655360, correct ... PASS
[2023-04-14T10:33:47.190+0800] INFO - sysctl /proc/sys/fs/aio-max-nr = 1048576, correct ... PASS
[2023-04-14T10:33:47.207+0800] INFO - sysctl /proc/sys/vm/overcommit_memory = 0, correct ... PASS
[2023-04-14T10:33:47.225+0800] INFO - sysctl /proc/sys/vm/nr_hugepages = 0, correct ... PASS
[2023-04-14T10:33:47.245+0800] INFO - sysctl /proc/sys/net/ipv4/ip_forward = 1, correct ... PASS
[2023-04-14T10:33:47.257+0800] INFO - sysctl /proc/sys/net/ipv4/ip_local_port_range = 10000 65535, correct ... PASS
[2023-04-14T10:33:47.342+0800] INFO - check service [crond]: enabled ... PASS
[2023-04-14T10:33:47.522+0800] INFO - check service [sshd]: enabled ... PASS
[2023-04-14T10:33:47.600+0800] INFO - check service [firewalld]: inactive ... PASS
[2023-04-14T10:33:47.606+0800] INFO - check service [firewalld]: disabled ... PASS
[2023-04-14T10:33:47.611+0800] INFO - check sshd_config PubkeyAuthentication: yes ... PASS
[2023-04-14T10:33:47.615+0800] INFO - check sshd_config UseDNS: no ... PASS
[2023-04-14T10:33:47.619+0800] INFO - check sshd_config ClientAliveInterval: 60 ... PASS
[2023-04-14T10:33:47.623+0800] INFO - check sshd_config ClientAliveCountMax: 10 ... PASS
[2023-04-14T10:33:47.625+0800] INFO - check hugepage: disabled ... PASS
[2023-04-14T10:33:47.625+0800] INFO - check oceanbase_limits.conf, exist ... PASS
[2023-04-14T10:33:47.764+0800] INFO - check hard limit of new session open_files (ulimit -H -n): 655360 ... PASS
[2023-04-14T10:33:47.765+0800] INFO - check hard limit of open_files (ulimit -H -n): 655360 ... PASS
[2023-04-14T10:33:47.815+0800] INFO - check soft limit of new session open_files (ulimit -S -n): 655360 ... PASS
[2023-04-14T10:33:47.816+0800] INFO - check soft limit of open_files (ulimit -S -n): 655360 ... PASS
[2023-04-14T10:33:47.880+0800] INFO - check hard limit of new session max_user_processes (ulimit -H -u): 655360 ... PASS
[2023-04-14T10:33:47.881+0800] INFO - check hard limit of max_user_processes (ulimit -H -u): 655360 ... PASS
[2023-04-14T10:33:47.932+0800] INFO - check soft limit of new session max_user_processes (ulimit -S -u): 655360 ... PASS
[2023-04-14T10:33:47.934+0800] INFO - check soft limit of max_user_processes (ulimit -S -u): 655360 ... PASS
[2023-04-14T10:33:47.999+0800] INFO - check hard limit of new session stack_size (ulimit -H -s): 10240 ... PASS
[2023-04-14T10:33:48.000+0800] INFO - check hard limit of stack_size (ulimit -H -s): 10240 ... PASS
[2023-04-14T10:33:48.053+0800] INFO - check soft limit of new session stack_size (ulimit -S -s): 10240 ... PASS
[2023-04-14T10:33:48.055+0800] INFO - check soft limit of stack_size (ulimit -S -s): 10240 ... PASS
[2023-04-14T10:33:48.128+0800] INFO - check hard limit of new session core_file_size (ulimit -H -c): unlimited ... PASS
[2023-04-14T10:33:48.130+0800] INFO - check hard limit of core_file_size (ulimit -H -c): unlimited ... PASS
[2023-04-14T10:33:48.190+0800] INFO - check soft limit of new session core_file_size (ulimit -S -c): unlimited ... PASS
[2023-04-14T10:33:48.191+0800] INFO - check soft limit of core_file_size (ulimit -S -c): unlimited ... PASS
[2023-04-14T10:33:48.269+0800] INFO - check hard limit of new session cpu_time (ulimit -H -t): unlimited ... PASS
[2023-04-14T10:33:48.271+0800] INFO - check hard limit of cpu_time (ulimit -H -t): unlimited ... PASS
[2023-04-14T10:33:48.326+0800] INFO - check soft limit of new session cpu_time (ulimit -S -t): unlimited ... PASS
[2023-04-14T10:33:48.329+0800] INFO - check soft limit of cpu_time (ulimit -S -t): unlimited ... PASS
[2023-04-14T10:33:48.340+0800] INFO - check numa stat, pass ... PASS
[2023-04-14T10:33:48.344+0800] INFO - check elevator policy: deadline ... PASS
[2023-04-14T10:33:48.346+0800] INFO - check current_clocksource: tsc ... PASS
[2023-04-14T10:33:48.367+0800] INFO - check logical sector size of /dev/vda: 512 ... PASS
[2023-04-14T10:33:48.370+0800] INFO - check logical sector size of /dev/vdb: 512 ... PASS
[2023-04-14T10:33:49.051+0800] INFO - check RPM: mariadb-5.5.68-1.el7.x86_64 mariadb-libs-5.5.68-1.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:49.713+0800] INFO - check RPM: python-devel-2.7.5-92.el7_9.x86_64 is installed ... PASS
[2023-04-14T10:33:50.324+0800] INFO - check RPM: net-tools-2.0-0.25.20131004git.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:50.963+0800] INFO - check RPM: mtr-0.85-7.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:51.607+0800] INFO - check RPM: selinux-policy-targeted-3.13.1-266.el7.noarch tar-1.26-35.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:52.243+0800] INFO - check RPM: binutils-2.27-44.base.el7_9.1.x86_64 is installed ... PASS
[2023-04-14T10:33:52.837+0800] INFO - check RPM: bind-utils-9.11.4-26.P2.el7_9.13.x86_64 is installed ... PASS
[2023-04-14T10:33:53.520+0800] INFO - check RPM: libaio-0.3.109-13.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:54.148+0800] INFO - check RPM: libcurl-7.29.0-59.el7_9.1.x86_64 curl-7.29.0-59.el7_9.1.x86_64 python-pycurl-7.19.0-19.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:54.717+0800] INFO - check RPM: libatomic-4.8.5-44.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:55.313+0800] INFO - check RPM: ncurses-base-5.9-14.20130511.el7_4.noarch irqbalance-1.0.7-12.el7.x86_64 vim-enhanced-7.4.629-8.el7_9.x86_64 qrencode-libs-3.4.1-3.el7.x86_64 ncurses-libs-5.9-14.20130511.el7_4.x86_64 perl-Encode-2.51-7.el7.x86_64 nmap-ncat-6.40-19.el7.x86_64 ncurses-5.9-14.20130511.el7_4.x86_64 is installed ... PASS
[2023-04-14T10:33:55.970+0800] INFO - check RPM: iproute-4.11.0-30.el7.x86_64 is installed ... PASS
[2023-04-14T10:33:55.994+0800] INFO - check mysql client, working ... PASS
[2023-04-14T10:33:56.007+0800] INFO - checking irq affinity ...
[2023-04-14T10:33:56.021+0800] INFO - checking eth0 ...
[2023-04-14T10:33:56.029+0800] INFO - check irq channels, NIC: eth0, Channel Combined: 1 ... PASS
[2023-04-14T10:33:56.050+0800] INFO - check irq affinity, NIC: eth0, smp_affinity count: 3 ... PASS
[2023-04-14T10:33:56.060+0800] INFO - check irqbalance status: unknown ... PASS
[2023-04-14T10:33:56.060+0800] INFO - check irqbalance service: disabled ... PASS
[2023-04-14T10:33:56.062+0800] INFO - df: ‘/data/1’: No such file or directory
[2023-04-14T10:33:56.102+0800] INFO - 
[2023-04-14T10:33:56.102+0800] INFO - 
[2023-04-14T10:33:56.103+0800] INFO - ### SUMMARY OF ISSUES IN PRE-CHECK ###
[2023-04-14T10:33:56.104+0800] INFO - check CPU count: 8 < 32 ... EXPECT >= 32 ... FAIL
[2023-04-14T10:33:56.104+0800] INFO - TIPS: replace another machine with more CPU
[2023-04-14T10:33:56.104+0800] INFO - check total MEM: 15 GB < 128 GB ... EXPECT >= 128 GB ... FAIL
[2023-04-14T10:33:56.105+0800] INFO - TIPS: replace another machine with more MEM
[2023-04-14T10:33:56.106+0800] INFO - execute command on 192.168.2.134:
rm -f /tmp/precheck.sh6rAM3Hyh
[2023-04-14T10:33:56.169+0800] ERROR - Task failed with exception
Traceback (most recent call last):
  File "/usr/local/lib/python3.9/site-packages/airflow/decorators/base.py", line 217, in execute
    return_value = super().execute(context)
  File "/usr/local/lib/python3.9/site-packages/airflow/operators/python.py", line 175, in execute
    return_value = self.execute_callable()
  File "/usr/local/lib/python3.9/site-packages/airflow/operators/python.py", line 192, in execute_callable
    return self.python_callable(*self.op_args, **self.op_kwargs)
  File "/oat/task_engine/dags/init_server_with_tag.py", line 79, in precheck
    common.server_precheck(ctx, logger=logger)
  File "/oat/task_engine/plugins/common.py", line 1542, in server_precheck
    raise RuntimeError('server precheck failed, please see the summary info above for details')
RuntimeError: server precheck failed, please see the summary info above for details
[2023-04-14T10:33:56.184+0800] INFO - Marking task as FAILED. dag_id=init_server_with_tag, task_id=precheck, execution_date=20230414T023300, start_date=20230414T023345, end_date=20230414T023356
[2023-04-14T10:33:56.186+0800] INFO - Running statement: update oat_audit set status='failed', update_time=utc_timestamp(), failed_reason=%s where id=%s, parameters: ['failed task instance is init_server_with_tag__precheck__20230414 and exception information is server precheck failed, please see the summary info above for details', 4]
[2023-04-14T10:33:56.187+0800] INFO - Rows affected: 1
[2023-04-14T10:33:56.229+0800] ERROR - Failed to execute job 15 for task precheck (server precheck failed, please see the summary info above for details; 5393)
[2023-04-14T10:33:56.261+0800] INFO - Task exited with return code 1
[2023-04-14T10:33:56.305+0800] INFO - 0 downstream tasks scheduled from follow-on schedule check
1 个赞

稍等,已经转发给相关负责人员

由于任务失败,点了放弃任务,提示说是会逆向清理服务器初始化时的内容,可是结束后,服务上的所有文件和安装的东西都没清理,比如创建的admin用户,安装的docker等。。。

oat是商业版相关的工具链,咨询下你们的支持吧

1 个赞


这样可以直接跳过去

3 个赞

牛!