oceanbase集群加入机器报错

oceanbase版本社区版4.2.1.3:
我在往集群里面添加机器的时候报错ALTER SYSTEM ADD SERVER ‘10.201.171.35:2882’ ZONE ‘zone2’;应该怎么处理,我新的机器没有写数据啊
ERROR 4179 (HY000): add non-empty server “10.201.171.35:2882” not allowed,
详细报错如下
] INFO [RS] add_server (ob_root_service.cpp:7213) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=34] sys tenant data version >= 4.2, add_server(arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0}, timeout_ts=9999650)
[2024-03-14 21:19:04.709147] INFO [STORAGE] ~ObStorageTableGuard (ob_storage_table_guard.cpp:153) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=43] throttle statics(need_speed_limit=false, last_throttle_status=false, last_print_log_time=1710421448384608, stat={total_throttle_time_us:0, total_skip_throttle_time_us:0, last_log_timestamp:1710422344708132, last_throttle_status:false, 0=0, 1=0, 2=0, 3=0})
[2024-03-14 21:19:04.709273] INFO [SHARE] fetch_new_max_id (ob_max_id_fetcher.cpp:274) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=43] fetch_new_max_id(ret=0, ret=“OB_SUCCESS”, size=1, tenant_id=1, fetch_id=104, max_id_type=4, fetch_max_id_type=4, id=18446744073709551615, initial=18446744073709551615)
[2024-03-14 21:19:04.709799] INFO [SERVER] execute_write_inner (ob_inner_sql_connection.cpp:1546) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=30] execute write sql(ret=0, tenant_id=1, affected_rows=1, sql=UPDATE __all_sys_stat SET VALUE = ‘105’, gmt_modified = now(6) WHERE ZONE = ‘’ AND NAME = ‘ob_max_used_server_id’ AND TENANT_ID = 0)
[2024-03-14 21:19:04.711836] WDIAG [RS] add_servers (ob_server_zone_op_service.cpp:151) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=28][errcode=-4179] adding non-empty server is not allowed(ret=-4179, ret=“OB_OP_NOT_ALLOW”)
[2024-03-14 21:19:04.711867] WDIAG add_servers (ob_server_zone_op_service.cpp:152) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=27][errcode=-4179] add non-empty server “10.201.171.35:2882” not allowed
[2024-03-14 21:19:04.711880] WDIAG [RS] add_server (ob_root_service.cpp:7215) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=10][errcode=-4179] fail to add servers(ret=-4179, ret=“OB_OP_NOT_ALLOW”, arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0})
[2024-03-14 21:19:04.713356] INFO [RS] load_server_statuses (ob_server_manager.cpp:1463) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=37] update server admin status, before update(server=“10.201.164.19:2882”, status={server:“10.201.164.19:2882”, id:2, zone:“zone2”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710422344569015, block_migrate_in_time:0, stop_time:0, start_service_time:1710410160802529, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:true, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.53515625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974648, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-14 21:19:04.713448] INFO [RS] load_server_statuses (ob_server_manager.cpp:1474) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=90] update server admin status, after update(server=“10.201.164.19:2882”, status={server:“10.201.164.19:2882”, id:2, zone:“zone2”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710422344569015, block_migrate_in_time:0, stop_time:0, start_service_time:1710410160802529, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:true, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.53515625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974648, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-14 21:19:04.713493] INFO [RS] submit_update_all_server_task (ob_root_service.cpp:1368) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=41] self is RS and self status change, submit update rslist task(server=“10.201.164.19:2882”)
[2024-03-14 21:19:04.713537] INFO [RS] submit_update_rslist_task (ob_root_service.cpp:1665) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=12] added async task to update rslist(force_update=false)
[2024-03-14 21:19:04.713557] INFO [RS] on_server_status_change (ob_root_service.cpp:181) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=18] on_server_status_change finish(ret=0, ret=“OB_SUCCESS”, server=“10.201.164.19:2882”)
[2024-03-14 21:19:04.713571] INFO [RS] load_server_statuses (ob_server_manager.cpp:1463) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=13] update server admin status, before update(server=“10.201.172.14:2882”, status={server:“10.201.172.14:2882”, id:1, zone:“zone1”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710422344330628, block_migrate_in_time:0, stop_time:0, start_service_time:1710410159083441, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:false, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.5390625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974657, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-14 21:19:04.713609] INFO [RS] load_server_statuses (ob_server_manager.cpp:1474) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=37] update server admin status, after update(server=“10.201.172.14:2882”, status={server:“10.201.172.14:2882”, id:1, zone:“zone1”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710422344330628, block_migrate_in_time:0, stop_time:0, start_service_time:1710410159083441, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:false, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.5390625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974657, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-14 21:19:04.713645] INFO [RS] submit_update_all_server_task (ob_root_service.cpp:1368) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=35] self is RS and self status change, submit update rslist task(server=“10.201.172.14:2882”)
[2024-03-14 21:19:04.713655] INFO [RS] try_lock (ob_update_rs_list_task.cpp:54) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=9] update rslist task exist, do not submit again(cnt=2)
[2024-03-14 21:19:04.713666] WDIAG [RS] submit_update_rslist_task (ob_root_service.cpp:1671) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=10][errcode=0] fail to submit update rslist task, need retry(force_update=false)
[2024-03-14 21:19:04.713676] INFO [RS] on_server_status_change (ob_root_service.cpp:181) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=10] on_server_status_change finish(ret=0, ret=“OB_SUCCESS”, server=“10.201.172.14:2882”)
[2024-03-14 21:19:04.713747] INFO [SHARE] add_event (ob_event_history_table_operator.h:290) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=18] event table add task(ret=0, event_table_name="__all_rootservice_event_history", sql=INSERT INTO all_rootservice_event_history (gmt_create, module, event, name1, value1, name2, value2, rs_svr_ip, rs_svr_port) VALUES (usec_to_time(1710422344713696), ‘server’, ‘load_servers’, ‘ret’, 0, ‘has_build’, 1, ‘10.201.164.19’, 2882))
[2024-03-14 21:19:04.713768] INFO [RS] add_server (ob_root_service.cpp:7226) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=18] add server(ret=-4179, ret=“OB_OP_NOT_ALLOW”, arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0})
[2024-03-14 21:19:04.713785] WDIAG [RS] process
(ob_rs_rpc_processor.h:212) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=16][errcode=-4179] process failed(ret=-4179)
[2024-03-14 21:19:04.713796] INFO [RS] process
(ob_rs_rpc_processor.h:232) [14580][T1_L0_G0][T1][YB420AC9A413-0006139BF212CED7-0-0] [lt=10] [DDL] execute ddl like stmt(ret=-4179, cost=6520, ddl_arg=NULL)

https://www.oceanbase.com/docs/common-oceanbase-database-cn-1000000000507927
按这个文档操作应该没问题

那我直接用obd部署报错ERROR 4179 (HY000): add non-empty server “10.201.171.35:2882” not allowed有问题吗。两种方式都是报同样的错误

上面有observer 进程或者脏文件?

目前只有一个observer进程,脏文件的话你是指的在哪里,data目录还是redo目录还是home_path还是哪里呢,不排除你说的这种可能,因为我安装过好多次有可能导致这个问题,这次直接用obd安装的,但是我看进程和用observer启动一样的,那我应该如何查找脏文件呢,
我的observer.log报错如下;
WDIAG [RPC] send (ob_poc_rpc_proxy.h:161) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=11][errcode=-4179] execute rpc fail(addr=“10.201.164.19:2882”, pcode=267, ret=-4179)
[2024-03-15 11:08:16.915739] WDIAG log_user_error_and_warn (ob_poc_rpc_proxy.cpp:228) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=42][errcode=-4179] add non-empty server “10.201.171.35:2882” not allowed
[2024-03-15 11:08:16.915772] WDIAG [SQL.ENG] execute (ob_alter_system_executor.cpp:622) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=15][errcode=-4179] common rpc proxy add server failed(arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0}, ret=-4179)
[2024-03-15 11:08:16.915910] INFO [SHARE] add_event (ob_event_history_table_operator.h:290) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=30] event table add task(ret=0, event_table_name=“all_server_event_history", sql=INSERT INTO all_server_event_history (gmt_create, module, event, name1, value1, name2, value2, name3, value3, name4, value4, svr_ip, svr_port) VALUES (usec_to_time(1710472096915809), ‘sql’, ‘execute_cmd’, ‘cmd_type’, 98, ‘sql_text’, X’414C5445522053595354454D2041444420534552564552202731302E3230312E3137312E33353A3238383227205A4F4E4520277A6F6E653227’, ‘return_code’, -4179, ‘tenant_id’, 1, ‘10.201.164.19’, 2882))
[2024-03-15 11:08:16.915944] WDIAG [SQL] open_cmd (ob_result_set.cpp:98) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=31][errcode=-4179] execute cmd failed(ret=-4179)
[2024-03-15 11:08:16.915964] WDIAG [SQL] open (ob_result_set.cpp:157) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=18][errcode=-4179] execute plan failed(ret=-4179)
[2024-03-15 11:08:16.915978] WDIAG [SERVER] response_result (ob_sync_cmd_driver.cpp:145) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=9][errcode=-4179] close result set fail(cret=-4179)
[2024-03-15 11:08:16.916005] WDIAG [SERVER] after_func (ob_query_retry_ctrl.cpp:947) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=16][errcode=-4179] [RETRY] check if need retry(v={force_local_retry:false, stmt_retry_times:0, local_retry_times:0, err
:-4179, err
:“OB_OP_NOT_ALLOW”, retry_type:0, client_ret:-4179}, need_retry=false, THIS_WORKER.can_retry()=true, v.ctx
.multi_stmt_item
={is_part_of_multi_stmt:false, seq_num:0, sql:“ALTER SYSTEM ADD SERVER ‘10.201.171.35:2882’ ZONE ‘zone2’”, batched_queries:NULL, is_ps_mode:false, ab_cnt:0})
[2024-03-15 11:08:16.916049] WDIAG [SERVER] response_result (ob_sync_cmd_driver.cpp:151) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=43][errcode=-4179] result set open failed, check if need retry(ret=-4179, cli_ret=-4179, retry_ctrl_.need_retry()=0)
[2024-03-15 11:08:16.916116] INFO [SERVER] send_error_packet (obmp_packet_sender.cpp:319) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=14] sending error packet(ob_error=-4179, client error=-4179, extra_err_info=NULL, lbt()=“0x117ed57c 0x96de922 0x968b695 0x4c0cfa3 0x499af9a 0x49942fd 0x498a995 0x4985d2e 0x95a7f04 0x11ab9452 0x7f7e9d79de25 0x7f7e9d4c7bad”)
WDIAG [SERVER] do_process (obmp_query.cpp:808) [14551][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=23][errcode=-4179] execute query fail(ret=-4179, timeout_timestamp=1710472106908444)
rootservice.log报错如下:
INFO [RS] add_server (ob_root_service.cpp:7213) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=47] sys tenant data version >= 4.2, add_server(arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0}, timeout_ts=9999545)
[2024-03-15 11:08:16.910789] INFO [COMMON] advance_clock (ob_fifo_arena.cpp:374) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=35] current clock is (clock_=0, max_seq_=0, mem_limit=4341943490, cur_mem_hold=167772160, attr_.tenant_id_=1)
[2024-03-15 11:08:16.910844] INFO [STORAGE] ~ObStorageTableGuard (ob_storage_table_guard.cpp:153) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=38] throttle statics(need_speed_limit=false, last_throttle_status=false, last_print_log_time=1710470254554846, stat={total_throttle_time_us:0, total_skip_throttle_time_us:0, last_log_timestamp:1710472096910359, last_throttle_status:false, 0=0, 1=0, 2=0, 3=0})
[2024-03-15 11:08:16.910956] INFO [SHARE] fetch_new_max_id (ob_max_id_fetcher.cpp:274) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=33] fetch_new_max_id(ret=0, ret=“OB_SUCCESS”, size=1, tenant_id=1, fetch_id=204, max_id_type=4, fetch_max_id_type=4, id=18446744073709551615, initial=18446744073709551615)
[2024-03-15 11:08:16.911405] INFO [SERVER] execute_write_inner (ob_inner_sql_connection.cpp:1546) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=27] execute write sql(ret=0, tenant_id=1, affected_rows=1, sql=UPDATE __all_sys_stat SET VALUE = ‘205’, gmt_modified = now(6) WHERE ZONE = ‘’ AND NAME = ‘ob_max_used_server_id’ AND TENANT_ID = 0)
[2024-03-15 11:08:16.913651] WDIAG [RS] add_servers (ob_server_zone_op_service.cpp:151) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=27][errcode=-4179] adding non-empty server is not allowed(ret=-4179, ret=“OB_OP_NOT_ALLOW”)
[2024-03-15 11:08:16.913681] WDIAG add_servers (ob_server_zone_op_service.cpp:152) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=25][errcode=-4179] add non-empty server “10.201.171.35:2882” not allowed
[2024-03-15 11:08:16.913703] WDIAG [RS] add_server (ob_root_service.cpp:7215) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=20][errcode=-4179] fail to add servers(ret=-4179, ret=“OB_OP_NOT_ALLOW”, arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0})
[2024-03-15 11:08:16.915136] INFO [RS] load_server_statuses (ob_server_manager.cpp:1463) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=33] update server admin status, before update(server=“10.201.164.19:2882”, status={server:“10.201.164.19:2882”, id:2, zone:“zone2”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710472095769205, block_migrate_in_time:0, stop_time:0, start_service_time:1710410160802529, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:true, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.6640625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974648, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-15 11:08:16.915211] INFO [RS] load_server_statuses (ob_server_manager.cpp:1474) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=73] update server admin status, after update(server=“10.201.164.19:2882”, status={server:“10.201.164.19:2882”, id:2, zone:“zone2”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710472095769205, block_migrate_in_time:0, stop_time:0, start_service_time:1710410160802529, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:true, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.6640625GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974648, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-15 11:08:16.915254] INFO [RS] submit_update_all_server_task (ob_root_service.cpp:1368) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=40] self is RS and self status change, submit update rslist task(server=“10.201.164.19:2882”)
[2024-03-15 11:08:16.915278] INFO [RS] submit_update_rslist_task (ob_root_service.cpp:1665) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=12] added async task to update rslist(force_update=false)
[2024-03-15 11:08:16.915293] INFO [RS] on_server_status_change (ob_root_service.cpp:181) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=14] on_server_status_change finish(ret=0, ret=“OB_SUCCESS”, server=“10.201.164.19:2882”)
[2024-03-15 11:08:16.915308] INFO [RS] load_server_statuses (ob_server_manager.cpp:1463) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=13] update server admin status, before update(server=“10.201.172.14:2882”, status={server:“10.201.172.14:2882”, id:1, zone:“zone1”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710472095447003, block_migrate_in_time:0, stop_time:0, start_service_time:1710410159083441, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:false, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.658203125GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974657, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-15 11:08:16.915351] INFO [RS] load_server_statuses (ob_server_manager.cpp:1474) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=41] update server admin status, after update(server=“10.201.172.14:2882”, status={server:“10.201.172.14:2882”, id:1, zone:“zone1”, build_version:“4.2.1.3_103020042024020317-73d0496c8c63179a37214ed26dee718280569ac9(Feb 3 2024 17:21:33)”, sql_port:2881, register_time:0, last_hb_time:1710472095447003, block_migrate_in_time:0, stop_time:0, start_service_time:1710410159083441, last_offline_time:0, last_server_behind_time:0, last_round_trip_time:0, admin_status:“NORMAL”, hb_status:“lease_expired”, with_rootserver:false, with_partition:true, resource_info:{cpu_capacity:80, cpu_assigned:12, cpu_assigned_max:12, mem_capacity:“224GB”, mem_assigned:“34GB”, mem_in_use:0GB, log_disk_capacity:5120GB, log_disk_assigned:110GB, data_disk_capacity:5120GB, data_disk_in_use:0.658203125GB}, leader_cnt:-1, server_report_status:0, lease_expire_time:1710410189974657, ssl_key_expired_time:0, in_recovery_for_takenover_by_rs:false})
[2024-03-15 11:08:16.915412] INFO [RS] submit_update_all_server_task (ob_root_service.cpp:1368) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=60] self is RS and self status change, submit update rslist task(server=“10.201.172.14:2882”)
[2024-03-15 11:08:16.915422] INFO [RS] try_lock (ob_update_rs_list_task.cpp:54) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=9] update rslist task exist, do not submit again(cnt=2)
[2024-03-15 11:08:16.915433] WDIAG [RS] submit_update_rslist_task (ob_root_service.cpp:1671) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=10][errcode=0] fail to submit update rslist task, need retry(force_update=false)
[2024-03-15 11:08:16.915443] INFO [RS] on_server_status_change (ob_root_service.cpp:181) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=9] on_server_status_change finish(ret=0, ret=“OB_SUCCESS”, server=“10.201.172.14:2882”)
[2024-03-15 11:08:16.915504] INFO [SHARE] add_event (ob_event_history_table_operator.h:290) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=9] event table add task(ret=0, event_table_name=”__all_rootservice_event_history", sql=INSERT INTO all_rootservice_event_history (gmt_create, module, event, name1, value1, name2, value2, rs_svr_ip, rs_svr_port) VALUES (usec_to_time(1710472096915453), ‘server’, ‘load_servers’, ‘ret’, 0, ‘has_build’, 1, ‘10.201.164.19’, 2882))
[2024-03-15 11:08:16.915528] INFO [RS] add_server (ob_root_service.cpp:7226) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=21] add server(ret=-4179, ret=“OB_OP_NOT_ALLOW”, arg={servers:[“10.201.171.35:2882”], zone:“zone2”, force_stop:false, op:0})
[2024-03-15 11:08:16.915545] WDIAG [RS] process
(ob_rs_rpc_processor.h:212) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=16][errcode=-4179] process failed(ret=-4179)
[2024-03-15 11:08:16.915557] INFO [RS] process
(ob_rs_rpc_processor.h:232) [14578][T1_L0_G0][T1][YB420AC9A413-0006139BF472C67E-0-0] [lt=10] [DDL] execute ddl like stmt(ret=-4179, cost=6532, ddl_arg=NULL)

需要提供下完整的操作步骤,obd261已经支持了obd扩容,可以考虑工具方式执行扩容。

单纯看报错 怀疑新加的节点做了初始化,这种情况无法加入节点。

  好的,我的obd版本是2.6.2,我就用配置文件部署了一下,就是obd  cluster  deploy  ob -c a.yaml , obd cluster start ob 然后配置文件如下:配置里面的区域和集群名都是我想加入的区域和集群名:

user:
username: root
password:
port: 29873
oceanbase-ce:
version: 4.2.1.3
release: 103020042024020317.el7
package_hash: 54ff92fc9c0f638b9644672ec4887b2d6d9353ae
10.201.xxxx:
zone: zone2
servers:

  • 10.201xxxxx
    global:
    appname: obcluster
    root_password: xxxxxx
    mysql_port: 2881
    rpc_port: 2882
    home_path: /disk/nvme1n1/oceanbase
    data_dir: /disk/nvme2n1/
    redo_dir: /disk/nvme3n1/
    devname: bond0
    datafile_size: 5120GB
    datafile_maxsize: 5120GB
    log_disk_size: 5120GB
    enable_syslog_recycle: ‘True’
    max_syslog_file_count: ‘8’
    memory_limit: 256GB
    system_memory: 32GB
    cpu_count: ‘80’
    cluster_id: ‘1’
    proxyro_password: xxxxxx
    enable_syslog_wf: true
    enable_sql_audit: true

我理解你是先obd部署了一个单机,并执行start 完成初始化,可正常使用。

然后使用obd 再次部署一个配置如上图的单机,也执行了start 完成初始化,可正常使用。

然后 sql 方式 add server 添加节点 是吧?

如果是上述场景 部署完成 执行 start 是会直接初始化的。 第二次部署的单机可以不执行start 后台sql add server 。

或者参看 obd 命令进行扩容 OceanBase分布式数据库-海量数据 笔笔算数

我第一次部署了一个集群,你的意思是第二次部署的单机的只是deploy一下对吧,那我添加上以后再start一下吗

是的 但不需要再start ,deploy 后进程是已经启动的,start 是进行元数据初始化过程。

扩容新节点 是需要一个部署完成的进程 非完整的数据库。

可以参看这个纯手撸的部署+扩容文章 OceanBase 社区5sxnd3_gaODQzNDgzMjU4LjE2NjM1NzU0MjE._ga_T35KTM57DZ*MTcxMDQ4MjAwNy4xOTYzLjEuMTcxMDQ4MjM1Mi42MC4wLjA.

好像不对,我只是deploy后报错,而且我看deploy那台的机器没有启动observer进程啊:
ALTER SYSTEM ADD SERVER ‘10.201.171.35:2882’ ZONE ‘zone2’;
ERROR 4006 (HY000): The object is not initialized

不好意思 说错了。

deploy 后是安装完成服务,可以手动启动下进程。

cd 安装目录(不能 ./observer )
执行 ./bin/observer
如果报错找不到资源库

#没有将 OceanBase 数据库的 LIB 加到环境变量 LD_LIBRARY_PATH 中(按实际部署路径填写)
echo ‘export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/admin/oceanbase/lib/’ >> ~/.bash_profile
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/admin/oceanbase/lib/

直接./bin/observer 不报错起不来,看rootservice.log报错
ERROR issue_dba_error (ob_log.cpp:1853) [58834][observer][T0][Y0-0000000000000000-0-0] [lt=29][errcode=-4388] Unexpected internal error happen, please checkout the internal errcode(errcode=-4147, file=“ob_server.cpp”, line_no=1884, info=“some config setting is not valid”)
[2024-03-15 15:36:41.713167] ERROR issue_dba_error (ob_log.cpp:1853) [58834][observer][T0][Y0-0000000000000000-0-0] [lt=214][errcode=-4388] Unexpected internal error happen, please checkout the internal errcode(errcode=-4147, file=“ob_server.cpp”, line_no=263, info=“init config failed”)
[2024-03-15 15:36:41.743065] ERROR issue_dba_error (ob_log.cpp:1853) [58834][observer][T0][Y0-0000000000000000-0-0] [lt=10][errcode=-4388] Unexpected internal error happen, please checkout the internal errcode(errcode=-4147, file=“ob_server.cpp”, line_no=506, info="[OBSERVER_NOTICE] fail to init observer")
[2024-03-15 15:36:41.743186] ERROR issue_dba_error (ob_log.cpp:1853) [58834][observer][T0][Y0-0000000000000000-0-0] [lt=25][errcode=-4388] Unexpected internal error happen, please checkout the internal errcode(errcode=-4147, file=“main.cpp”, line_no=585, info=“observer init fail”)

看报错是有些参数未识别,日志太少看不出信息。

可以带参数启动 可以参看上面发的手撸部署扩容的文章,配置按部署时配置文件中的参数值写。

我把所有启动参数都带了,可以了,感谢,就是/bin/observer 带着启动参数启动