【 使用环境 】 测试环境
【 OB or 其他组件 】
【 使用版本 】 3.1.5 (单机版 - ip: 172.17.151.120 )
【问题描述】
使用命令 启动服务几分钟后服务启动失败 , obproxy 服务启动正常,几次启动失败的最后日志 放到最下方,下面是启动方法:
[root@dev020 ~]# more start-ob.sh
#!/bin/bash
start_observer(){
cd /data/oceanbase/obproxy/
./bin/obproxy
sleep 10
cd /usr/local/oceanbase/
./bin/observer
sleep 5
netstat -tnpl|grep 288
}
start_observer
observer 退出时的 observer。log日志:
[2024-09-12 11:42:59.134867] ERROR [CLOG] notify_scan_finished_ (ob_log_scan_runnable.cpp:660) [11641][1203][Y0-0000000000000000] [lt=10] [dc=0] invalid scan_confirmed_log_cnt(ret=-4016, ret=“OB_ERR_UNEXPECTED”, scan_confirmed_log_cnt=586, next_ilog_id=8355, last_replay_log_id=7474, pkey={tid:1099511627983, partition_id:15, part_cnt:0}) BACKTRACE:0x9a98e9e 0x986d141 0x233b8f6 0x233b51b 0x233b1b7 0x781a30e 0x773dc1e 0x773b7b5 0x2ca95d4 0x2cabf02 0x9820da5 0x981f792 0x981c24f
[2024-09-12 11:42:59.134902] ERROR [CLOG] do_scan_log_ (ob_log_scan_runnable.cpp:216) [11641][1203][Y0-0000000000000000] [lt=32] [dc=0] notify_scan_finished_ failed(ret=-4016) BACKTRACE:0x9a98e9e 0x986d141 0x22a7774 0x22a725b 0x22a6fc1 0x22a5c58 0x773bca7 0x2ca95d4 0x2cabf02 0x9820da5 0x981f792 0x981c24f
[2024-09-12 11:42:59.134912] ERROR [CLOG] do_scan_log_ (ob_log_scan_runnable.cpp:223) [11641][1203][Y0-0000000000000000] [lt=9] [dc=0] log scan runnable exit error(ret=-4016) BACKTRACE:0x9a98e9e 0x986d141 0x22a7774 0x22a725b 0x22a6fc1 0x22a5c58 0x773b958 0x2ca95d4 0x2cabf02 0x9820da5 0x981f792 0x981c24f
[2024-09-12 11:42:59.134925] ERROR on_fatal_error (ob_log_define.h:684) [11641][1203][Y0-0000000000000000] [lt=7] [dc=0] ret = -4016 BACKTRACE:0x9a98e9e 0x74ee390 0x773b97a 0x2ca95d4 0x2cabf02 0x9820da5 0x981f792 0x981c24f BACKTRACE:0x9a98e9e 0x986d141 0x986dda6 0x74ee3d3 0x773b97a 0x2ca95d4 0x2cabf02 0x9820da5 0x981f792 0x981c24f
[2024-09-12 11:45:21.321456] WARN [CLOG] handle (ob_clog_history_reporter.cpp:315) [13298][370][Y0-0000000000000000] [lt=3] [dc=0] exec partition task fail(ret=-4023, ret=“OB_EAGAIN”, partition_task={pkey:{tid:1101710651081588, partition_id:388, part_cnt:0}, head:{partition op str:“ONLINE”, svr:“172.17.151.120:2882”, start_log_id:37, start_log_timestamp_:1708450868572323, end_log_id:18446744073709551615, end_log_timestamp:-1, next:null}, tail:{partition op str:“ONLINE”, svr:“172.17.151.120:2882”, start_log_id:37, start_log_timestamp_:1708450868572323, end_log_id:18446744073709551615, end_log_timestamp:-1, next:null}})
[2024-09-12 11:45:21.321733] INFO [SHARE.SCHEMA] ob_schema_getter_guard.cpp:901 [13298][370][YB42AC119778-000621E3E8F9B042] [lt=8] [dc=0] table not exist(fetch_tenant_id=1, tenant_id=1002, database_id=1101710651031553, session_id=18446744073709551615, table_name=__all_clog_history_info_v2, is_index=false, snapshot_version=1, is_schema_split=0, schema_version=1, schema_mgr_tenant_id=0)
[2024-09-12 11:45:21.321780] WARN [CLOG] handle_online_op_ (ob_clog_history_reporter.cpp:1251) [13298][370][Y0-0000000000000000] [lt=10] [dc=0] fail to insert a clog history info record(ret=-5019, ret=“OB_TABLE_NOT_EXIST”, pkey={tid:1101710651081605, partition_id:0, part_cnt:0}, sql="REPLACE INTO oceanbase._all_clog_history_info_v2 (table_id, partition_idx, partition_cnt, start_log_id, start_log_timestamp, svr_ip, svr_port, end_log_id, end_log_timestamp) VALUES(1101710651081605, 0, 0, 7473, 1708450582334832, ‘172.17.151.120’, 2882, 18446744073709551615, -1) ", online_op={partition op str:“ONLINE”, svr:“172.17.151.120:2882”, start_log_id:7473, start_log_timestamp:1708450582334832, end_log_id:18446744073709551615, end_log_timestamp:-1, next:null})
[2024-09-12 11:47:53.042725] INFO [SHARE.SCHEMA] ob_schema_getter_guard.cpp:901 [15817][370][YB42AC119778-000621E3F27DB528] [lt=7] [dc=0] table not exist(fetch_tenant_id=1, tenant_id=1001, database_id=1100611139403777, session_id=18446744073709551615, table_name=__all_clog_history_info_v2, is_index=false, snapshot_version=1, is_schema_split=0, schema_version=1, schema_mgr_tenant_id=0)
[2024-09-12 11:47:53.042733] INFO [SQL.RESV] ob_dml_resolver.cpp:6674 [15817][370][YB42AC119778-000621E3F27DB528] [lt=7] [dc=0] table not exist(tenant_id=1001, database_id=1100611139403777, table_name=__all_clog_history_info_v2, ret=-5019)
[2024-09-12 11:47:53.042738] INFO [SHARE.SCHEMA] ob_synonym_mgr.cpp:462 [15817][370][YB42AC119778-000621E3F27DB528] [lt=3] [dc=0] synonym is not exist(tenant_id=1001, database_id=1100611139403777, name=__all_clog_history_info_v2)
[2024-09-12 11:47:53.042768] WARN [SERVER] query (ob_inner_sql_connection.cpp:861) [15817][370][YB42AC119778-000621E3F27DB528] [lt=3] [dc=0] failed to process record(executor={ObIExecutor:, sql:"REPLACE INTO oceanbase.__all_clog_history_info_v2 (table_id, partition_idx, partition_cnt, start_log_id, start_log_timestamp, svr_ip, svr_port, end_log_id, end_log_timestamp) VALUES(1100611139404027, 0, 0, 7474, 1708450585983966, ‘172.17.151.120’, 2882, 18446744073709551615, -1) "}, record_ret=-5019, ret=-5019)
[2024-09-12 11:47:53.043195] INFO [SHARE.SCHEMA] ob_schema_getter_guard.cpp:901 [15817][370][YB42AC119778-000621E3F27DB52B] [lt=3] [dc=0] table not exist(fetch_tenant_id=1, tenant_id=1, database_id=1099511627777, session_id=18446744073709551615, table_name=__all_clog_history_info_v2, is_index=false, snapshot_version=1, is_schema_split=0, schema_version=1, schema_mgr_tenant_id=0)
[2024-09-12 11:47:53.043201] INFO [SQL.RESV] ob_dml_resolver.cpp:6674 [15817][370][YB42AC119778-000621E3F27DB52B] [lt=5] [dc=0] table not exist(tenant_id=1, database_id=1099511627777, table_name=__all_clog_history_info_v2, ret=-5019)
[2024-09-12 11:47:53.043602] INFO [SHARE.SCHEMA] ob_schema_getter_guard.cpp:901 [15817][370][YB42AC119778-000621E3F27DB52E] [lt=2] [dc=1] table not exist(fetch_tenant_id=1, tenant_id=1002, database_id=1101710651031553, session_id=18446744073709551615, table_name=__all_clog_history_info_v2, is_index=false, snapshot_version=1, is_schema_split=0, schema_version=1, schema_mgr_tenant_id=0)
[2024-09-12 11:47:53.043608] INFO [SQL.RESV] ob_dml_resolver.cpp:6674 [15817][370][YB42AC119778-000621E3F27DB52E] [lt=4] [dc=0] table not exist(tenant_id=1002, database_id=1101710651031553, table_name=__all_clog_history_info_v2, ret=-5019)
[2024-09-12 11:47:53.043916] WARN [SERVER] get_master_root_server (ob_service.cpp:3375) [15866][466][YB42AC119778-000621E3F26D2A13] [lt=3] [dc=0] not master rootserver(ret=-4638, master_rs=“172.17.151.120:2882”)
[2024-09-12 11:47:53.044073] WARN [CLOG] handle (ob_clog_history_reporter.cpp:315) [15817][370][Y0-0000000000000000] [lt=2] [dc=0] exec partition task fail(ret=-4023, ret=“OB_EAGAIN”, partition_task={pkey:{tid:1101710651031814, partition_id:0, part_cnt:0}, head:{partition op str:“ONLINE”, svr:“172.17.151.120:2882”, start_log_id:7474, start_log_timestamp_:1708450578878666, end_log_id:18446744073709551615, end_log_timestamp:-1, next:null}, tail:{partition op str:“ONLINE”, svr:“172.17.151.120:2882”, start_log_id:7474, start_log_timestamp_:1708450578878666, end_log_id:18446744073709551615, end_log_timestamp:-1, next:null}})
[2024-09-12 11:47:53.044280] WARN log_user_error_and_warn (ob_rpc_proxy.cpp:300) [15629][0][Y0-0000000000000000] [lt=1] [dc=0]
[2024-09-12 11:47:53.044410] WARN [SERVER] get_master_root_server (ob_service.cpp:3375) [15866][466][YB42AC119778-000621E3F26D2A18] [lt=4] [dc=0] not master rootserver(ret=-4638, master_rs=“172.17.151.120:2882”)
[2024-09-12 11:47:53.044561] INFO [SHARE] ob_inner_config_root_addr.cpp:172 [15629][0][Y0-0000000000000000] [lt=4] [dc=0] fetch addr_list &readonly_addr_list(ret=0, addr_list=[{server:“172.17.151.120:2882”, role:2, sql_port:2881, replica_type:0, reserved:0, property:{memstore_percent_:100}}], readonly_addr_list=[], cluster_type=1)
[2024-09-12 11:47:53.044722] WARN [SERVER] inner_close (ob_inner_sql_result.cpp:152) [15817][370][YB42AC119778-000621E3F27DB537] [lt=5] [dc=0] result set close failed(ret=-5019, need_retry=false)
[2024-09-12 11:47:53.044929] WARN log_user_error_and_warn (ob_rpc_proxy.cpp:300) [15629][0][Y0-0000000000000000] [lt=4] [dc=0]
[2024-09-12 11:47:53.045064] WARN [SERVER] get_master_root_server (ob_service.cpp:3375) [15866][466][YB42AC119778-000621E3F26D2A1F] [lt=3] [dc=0] not master rootserver(ret=-4638, master_rs=“172.17.151.120:2882”)
[2024-09-12 11:47:53.045230] WARN [RPC.OBRPC] rpc_call (ob_rpc_proxy.ipp:567) [15629][0][Y0-0000000000000000] [lt=3] [dc=0] execute rpc fail(ret=-4638, dst=“172.17.151.120:2882”)
[2024-09-12 11:47:53.045601] WARN log_user_error_and_warn (ob_rpc_proxy.cpp:300) [15629][0][Y0-0000000000000000] [lt=5] [dc=0]
[2024-09-12 11:47:53.045723] WARN [RPC.OBRPC] rpc_call (ob_rpc_proxy.ipp:567) [15629][0][YB42AC119778-000621E3F26D2A26] [lt=3] [dc=0] execute rpc fail(ret=-4638, dst=“172.17.151.120:2882”)
[2024-09-12 11:47:53.045886] WARN [SERVER] get_master_root_server (ob_service.cpp:3375) [15866][466][YB42AC119778-000621E3F26D2A28] [lt=3] [dc=0] not master rootserver(ret=-4638, master_rs=“172.17.151.120:2882”)
[2024-09-12 11:47:53.046052] WARN [SERVER] get_master_root_server (ob_service.cpp:3375) [15866][466][YB42AC119778-000621E3F26D2A2A] [lt=3] [dc=0] not master rootserver(ret=-4638, master_rs=“172.17.151.120:2882”)
[2024-09-12 11:47:53.046184] WARN log_user_error_and_warn (ob_rpc_proxy.cpp:300) [15629][0][Y0-0000000000000000] [lt=4] [dc=0]
[2024-09-12 11:47:53.046389] INFO [STORAGE] ob_pg_sstable_garbage_collector.cpp:188 [15761][262][Y0-0000000000000000] [lt=8] [dc=0] do one gc free sstable by queue(ret=0, free sstable cnt=0)
[2024-09-12 11:47:53.046513] WARN [SQL.RESV] resolve_table_relation_recursively (ob_dml_resolver.cpp:6639) [15817][370][YB42AC119778-000621E3F27DB53A] [lt=4] [dc=0] synonym not exist(tenant_id=1, database_id=1099511627777, table_name=__all_clog_history_info_v2, ret=-5019)
[2024-09-12 11:47:53.046692] WARN [RPC.OBRPC] rpc_call (ob_rpc_proxy.ipp:567) [15629][0][YB42AC119778-000621E3F26D2A2F] [lt=3] [dc=0] execute rpc fail(ret=-4638, dst=“172.17.151.120:2882”)
[2024-09-12 11:47:53.046840] WARN [STORAGE] get_curr_leader_and_memberlist (ob_partition_service.cpp:4548) [15629][0][Y0-0000000000000000] [lt=4] [dc=0] fail to get leader from election(ret=-7002, pkey={tid:1099511627777, partition_id:0, part_cnt:1})
[2024-09-12 11:47:53.046933] WARN log_user_error_and_warn (ob_rpc_proxy.cpp:300) [15629][0][Y0-0000000000000000] [lt=3] [dc=0]
【复现路径】问题出现前后相关操作
【附件及日志】推荐使用OceanBase敏捷诊断工具obdiag收集诊断信息,详情参见链接(右键跳转查看):
【备注】基于 LLM 和开源文档 RAG 的论坛小助手已开放测试,在发帖时输入 [@论坛小助手] 即可召唤小助手,欢迎试用!