【 使用环境 】生产环境
【 OB or 其他组件 】
【 使用版本 】ocp-4.0.0-ce-bp1-x86_64
【问题描述】清晰明确描述问题
【复现路径】
ocp容器没有存活,所以无法看到容器内的日志,存活的进程有observer、obproxy
报错详情
2023-04-15 10:48:02 - INFO - 1 - [ob_install.py:100] - deploy obcluster command: obd cluster autodeploy obcluster -c /tmp/ocp_cluster.yaml
2023-04-15 10:49:01 - INFO - 1 - [ob_install.py:71] - create meta tenant with command obd cluster tenant create obcluster -n meta_tenant --max-cpu 2 --min-cpu 2 --max-memory 4294967296 --min-memory 4294967296
2023-04-15 10:49:05 - INFO - 1 - [ob_install.py:76] - create monitor tenant with command obd cluster tenant create obcluster -n monitor_tenant --max-cpu 4 --min-cpu 4 --max-memory 8589934592 --min-memory 8589934592
2023-04-15 10:49:07 - INFO - 1 - [ob_install.py:124] - create meta user
2023-04-15 10:49:07 - INFO - 1 - [ob_install.py:126] - grant privilege to meta user
2023-04-15 10:49:07 - INFO - 1 - [ob_install.py:128] - set meta user password
2023-04-15 10:49:07 - INFO - 1 - [ob_install.py:137] - create monitor user
2023-04-15 10:49:08 - INFO - 1 - [ob_install.py:139] - grant privilege to monitor user
2023-04-15 10:49:08 - INFO - 1 - [ob_install.py:141] - set monitor user password
2023-04-15 10:49:08 - INFO - 1 - [ocp_precheck.py:25] - ocp precheck using command: sudo /tmp/precheck-d9954562-a5d9-4a0d-892f-495a55032f24.sh -m ocp
2023-04-15 10:49:08 - INFO - 1 - [ocp_precheck.py:31] - precheck for ocp on host xx.xx.xx.10 success
2023-04-15 10:49:08 - INFO - 1 - [ocp_precheck.py:32] - ocp precheck result: Machine Role: ocp
Peer IP List:
Machine Type: PHY
Inspect Mode: FALSE
check CPU count: 8 > 8 ... PASS
check total MEM: 125 GB > 32 GB ... PASS
check SELinux status: Disabled ... PASS
check account [admin] and home dir, exist ... PASS
check service [firewalld]: inactive ... PASS
check service [firewalld]: disabled ... PASS
check docker version: 23.0.3 ... PASS
,
2023-04-15 10:49:08 - INFO - 1 - [image_load.py:21] - check image got result: 0 43808a7c5a36
2023-04-15 10:49:08 - INFO - 1 - [image_load.py:23] - image reg.docker.alibaba-inc.com/oceanbase/ocp-all-in-one:4.0.0-ce-bp1-20230119103000 already loaded
2023-04-15 10:49:08 - INFO - 1 - [metadb_init.py:13] - run with context: {'precheck_ignore': True, 'create_metadb_cluster': True, 'clean_metadb_cluster': False, 'ob_cluster': {'name': 'obcluster', 'home_path': '/home/admin/oceanbase', 'root_password': 'xxxx', 'data_path': '/data', 'redo_path': '/redo', 'sql_port': 2881, 'rpc_port': 2882, 'zones': [{'name': 'zone1', 'servers': ['xx.xx.xx.10']}], 'meta': {'tenant': 'meta_tenant', 'user': 'meta_user', 'password': 'xxxx', 'database': 'meta_database', 'cpu': 2, 'memory': 4}, 'monitor': {'tenant': 'monitor_tenant', 'user': 'monitor_user', 'password': 'xxxx', 'database': 'monitor_database', 'cpu': 4, 'memory': 8}}, 'obproxy': {'home_path': '/home/admin/obproxy', 'port': 2883, 'servers': ['xx.xx.xx.10'], 'vip': {'address': 'xx.xx.xx.10', 'port': 2883}}, 'ssh': {'port': 22, 'user': 'root', 'auth_method': 'password', 'timeout': 60, 'password': 'xxxx', 'key_file': '/root/keys/id_rsa'}, 'ocp': {'name': 'ocp', 'process': {'port': 8080, 'log_dir': '/home/admin/ocp/log'}, 'servers': ['xx.xx.xx.10'], 'resource': {'cpu': 4, 'memory': 8}, 'auth': {'user': 'admin', 'password': 'xxxx'}, 'metadb': {'host': 'xx.xx.xx.10', 'port': 2883, 'meta_user': 'meta_user@meta_tenant#obcluster', 'meta_password': 'xxxx', 'meta_database': 'meta_database', 'monitor_user': 'monitor_user@monitor_tenant#obcluster', 'monitor_password': 'xxxx', 'monitor_database': 'monitor_database'}, 'image': '/root/ocp.tar.gz', 'vip': {'address': 'xx.xx.xx.10', 'port': 8080}, 'image_tag': 'reg.docker.alibaba-inc.com/oceanbase/ocp-all-in-one:4.0.0-ce-bp1-20230119103000'}, 'operation': 'install'}
Traceback (most recent call last):
File "pipeline/install_ocp.py", line 43, in <module>
run(context)
File "pipeline/install_ocp.py", line 33, in run
install_ocp_pipeline.run()
File "/root/installer/core/pipeline.py", line 10, in run
task.run()
File "/root/installer/task/metadb_init.py", line 31, in run
ocp_options['ocp.web.tracking.enabled'] = self.context['ocp']['options']['enable_web_tracking']
KeyError: 'options'
配置:
# OCP deploy config
# Note:
# Do not use 127.0.0.1 or hostname as server address
# When a server has both public ip and private ip, if private ip is connectable, use private ip for faster connection
# If a vip is configured, it should be already created and bonded to the right server and port, the installation script won't do any work with vip maintainance, just use it to connect to the service
# Ignore precheck errors
# It's recommanded to not ignore precheck errors
precheck_ignore: true
# Create an obcluster as OCP's metadb
create_metadb_cluster: true
# Clean OCP's metadb cluster when uninstall
clean_metadb_cluster: false
# Metadb cluster deploy config
ob_cluster:
name: obcluster
home_path: /home/admin/oceanbase
root_password: 'xxxx'
# The directory for data storage, it's recommanded to use an independent path
data_path: /data
# The directory for clog, ilog, and slog, it's recommanded to use an independent path.
redo_path: /redo
sql_port: 2881
rpc_port: 2882
zones:
- name: zone1
servers:
- xx.xx.xx.10
# Meta user info
meta:
tenant: meta_tenant
user: meta_user
password: 'xxxx'
database: meta_database
cpu: 2
# Memory configs in GB, 4 means 4GB
memory: 4
# Monitor user info
monitor:
tenant: monitor_tenant
user: monitor_user
password: 'xxxx'
database: monitor_database
cpu: 4
# Memory configs in GB, 8 means 8GB
memory: 8
# Obproxy to connect metadb cluster
obproxy:
home_path: /home/admin/obproxy
port: 2883
servers:
- xx.xx.xx.10
# Vip is optional, if vip is not configured, one of obproxy servers's address will be used
# vip:
# address: 1.1.1.1
# port: 2883
# Ssh auth config
ssh:
port: 22
user: root
# auth method, support password and pubkey
auth_method: password
timeout: 60
password: 'xxxx'
# OCP config
ocp:
# ocp container's name
name: 'ocp'
# OCP process listen port and log dir on host
process:
port: 8080
log_dir: /home/admin/ocp/log
servers:
- xx.xx.xx.10
# OCP container's resource
resource:
cpu: 4
# Memory configs in GB, 8 means 8GB
memory: 8
# Vip is optional, if vip is not configured, one of ocp servers's address will be used
# vip:
# address: 1.1.1.1
# port: 8080
# OCP basic auth config, used when upgrade ocp
auth:
user: admin
password: xxxx
# OCP metadb config, for ocp installation, if "create_metadb_cluster" is configured true, this part will be replaced with the configuration of metadb cluster and obproxy
metadb:
host: xx.xx.xx.10
port: 2883
meta_user: meta_user@meta_tenant#obcluster
meta_password: 'xxxx'
meta_database: meta_database
monitor_user: monitor_user@monitor_tenant#obcluster
monitor_password: 'xxxx'
monitor_database: monitor_database
【问题现象及影响】
【附件】