ob租户存在长事务

【 使用环境 】生产环境
【 OB or 其他组件 】ob
【 使用版本 】ob 4.3.5.1
【问题描述】集群:myoceanbase,租户:szsjy 存在长事务。会话ID:0,事务ID:22484164,事务类型:unknown,事务创建时间:2025-11-23T17:21:31.857+08:00,事务最长持续时长:15 小时 10 分 41.9 秒。

select * from __all_server;

gmt_create |gmt_modified |svr_ip |svr_port|id|zone |inner_port|with_rootserver|status|block_migrate_in_time|build_version |stop_time|start_service_time|first_sessid|with_partition|last_offline_time|
-----------------------±----------------------±------------±-------±-±----±---------±--------------±-----±--------------------±----------------------------------------------------------------------------------------±--------±-----------------±-----------±-------------±----------------+
2025-10-11 19:10:41.333|2025-11-05 20:12:27.331|172.14.202.52| 2882| 1|zone1| 2881| 1|ACTIVE| 0|4.3.5.1_101010042025042417-0c7ffd37c2904f4d8191fb2d056738a93cce6d1d(Apr 24 2025 17:58:09)| 0| 1760181256716062| 0| 1| 0|
2025-10-11 19:10:41.355|2025-10-11 19:14:18.411|172.14.202.53| 2882| 2|zone2| 2881| 0|ACTIVE| 0|4.3.5.1_101010042025042417-0c7ffd37c2904f4d8191fb2d056738a93cce6d1d(Apr 24 2025 17:58:09)| 0| 1760181257480116| 0| 1| 0|
2025-10-11 19:10:41.372|2025-11-05 20:12:27.331|172.14.202.54| 2882| 3|zone3| 2881| 0|ACTIVE| 0|4.3.5.1_101010042025042417-0c7ffd37c2904f4d8191fb2d056738a93cce6d1d(Apr 24 2025 17:58:09)| 0| 1760181253424523| 0| 1| 0|

SELECT *
FROM __all_virtual_trans_stat
WHERE is_exiting !=1 AND part_trans_action <= 2 AND ctx_create_time < DATE_SUB(NOW(), INTERVAL 1200 SECOND)
LIMIT 100;

tenant_id|svr_ip |svr_port|trans_type|trans_id|session_id|scheduler_addr |is_decided|ls_id|participants|ctx_create_time |expired_time |ref_cnt|last_op_sn|pending_write|state|part_trans_action|trans_ctx_addr|mem_ctx_id|pending_log_size|flushed_log_size|role|is_exiting|coordinator|last_request_time |gtrid|bqual|format_id|start_scn |end_scn |rec_scn |transfer_blocking|busy_cbs|replay_complete|serial_log_final_scn|callback_list_stats |
---------±------------±-------±---------±-------±---------±-------------------±---------±----±-----------±----------------------±----------------------±------±---------±------------±----±----------------±-------------±---------±---------------±---------------±—±---------±----------±----------------------±----±----±--------±------------------±-------------------±------------------±----------------±-------±--------------±-------------------±-----------------------------------------------------------------------------------------+
1002|172.14.202.52| 2882| 0|22484164|3222103621|“172.14.202.54:2882”| 0| 1| |2025-11-23 17:21:31.857|2025-11-24 17:21:31.850| 2| 4| 0| 10| 2|0xfffed41db550| -1| 0| 0| 0| 0| -1|2025-11-23 17:21:31.857| | | -1|1763920803576769002|18446744073709551615|1763920803576769002| 0| 0| 1| -1|[“id, length, logged, removed, branch_removed, sync_scn”, [0,1,1,0,0,1763920803576769002]]|
1002|172.14.202.54| 2882| 0|22484164| 0|“172.14.202.54:2882”| 0| 1| |2025-11-24 02:00:03.799| | 2| 0| 0| 10| 1|0xfffee5359d50| -1| 0| 0| 1| 0| -1|2025-11-24 02:00:03.799| | | -1|1763920803576769002|18446744073709551615|1763920803576769002| 0| 0| 1| -1|[“id, length, logged, removed, branch_removed, sync_scn”, [0,1,1,0,0,0]] |
1002|172.14.202.53| 2882| 0|22484164| 0|“172.14.202.54:2882”| 0| 1| |2025-11-24 02:00:03.714| | 2| 0| 0| 10| 1|0xfffee826df50| -1| 0| 0| 1| 0| -1|2025-11-24 02:00:03.714| | | -1|1763920803576769002|18446744073709551615|1763920803576769002| 0| 0| 1| -1|[“id, length, logged, removed, branch_removed, sync_scn”, [0,1,1,0,0,0]] |
【复现路径】问题出现前后相关操作
【附件及日志】推荐使用OceanBase敏捷诊断工具obdiag收集诊断信息,详情参见链接(右键跳转查看):

【SOP系列 22 】——故障诊断第一步(自助诊断和诊断信息收集)

【备注】基于 LLM 和开源文档 RAG 的论坛小助手已开放测试,在发帖时输入 [@论坛小助手] 即可召唤小助手,欢迎试用!

1 个赞

推荐使用 obdiag 工具进行日志采集,这将有助于我们了解长事务的具体情况,并进一步定位问题。

日志采集场景: 长事务
日志采集命令: obdiag gather scene run --scene=observer.long_transaction

请您执行上述命令并提供采集回来的日志压缩包。为了更准确地帮助您解决问题,还需要您提供以下信息:

  1. 该长事务涉及哪些表和操作?
  2. 系统当前的负载情况如何?
  3. 该事务是否有定时任务触发,还是由特定的操作引起?
  4. 系统资源(如 CPU、内存、IO)的使用情况如何?
  5. 该事务之前是否也有过类似的长时间运行的情况?

感谢您的配合,我们会根据提供的信息进一步分析问题。

附上敏捷诊断工具 obdiag 使用帮助链接

2 个赞

:rightwards_hand: :rightwards_hand: :rightwards_hand: :rightwards_hand: :rightwards_hand: :rightwards_hand: :rightwards_hand:

SHOW VARIABLES like ‘version_comment’; 具体的版本信息查看一下 可以查看一下长事务的信息
SELECT *
FROM GV$OB_TRANSACTION_PARTICIPANTS
WHERE CTX_CREATE_TIME < date_sub(now(), INTERVAL 600 SECOND)
AND ROLE = ‘LEADER’
AND ACTION = ‘START’;

SHOW VARIABLES like ‘version_comment’;

Variable_name |Value |
---------------±---------------------------------------------------------------------------------------------------------------+
version_comment|OceanBase_CE 4.3.5.1 (r101010042025042417-0c7ffd37c2904f4d8191fb2d056738a93cce6d1d) (Built Apr 24 2025 17:58:09)|

SELECT *
FROM GV$OB_TRANSACTION_PARTICIPANTS
WHERE CTX_CREATE_TIME < date_sub(now(), INTERVAL 600 SECOND)
AND ROLE = ‘LEADER’
AND ACTION = ‘START’;

TENANT_ID|SVR_IP |SVR_PORT|SESSION_ID|SCHEDULER_ADDR |TX_TYPE |TX_ID |LS_ID|PARTICIPANTS|CTX_CREATE_TIME |TX_EXPIRED_TIME |STATE |ACTION|PENDING_LOG_SIZE|FLUSHED_LOG_SIZE|ROLE |COORD|LAST_REQUEST_TIME |FORMATID|GLOBALID|BRANCHID|
---------±------------±-------±---------±-------------------±--------±-------±----±-----------±----------------------±----------------------±-----±-----±---------------±---------------±-----±----±----------------------±-------±-------±-------+
1002|172.14.202.52| 2882|3221539899|“172.14.202.52:2882”|UNDECIDED|29051501| 1| |2025-11-24 17:21:31.765|2025-11-25 17:21:31.761|ACTIVE|START | 0| 0|LEADER| -1|2025-11-24 17:21:31.765| -1| | |

昨天清掉了 今天又有新的

INSERT /*+ monitor enable_parallel_dml parallel(1) opt_param(‘ddl_execution_id’, 1) opt_param(‘ddl_task_id’, 36033726) opt_param(‘enable_newsort’, ‘false’) use_px /INTO wdyf_szsjy_fastgpt.__idx_501228_idx_1764062492773017_index_snapshot_data_table (__key_17_1761297681614082, __vid_1761297679523485, __vector_17_1761297680423739, __data_17_1761297681614480) SELECT /+ index(modeldata primary) ob_ddl_schema_version(modeldata, 1764062764008680) */ __key_17_1761297681614082 AS __key_17_1761297681614082, __vid_1761297679523485 AS __vid_1761297679523485, __vector_17_1761297680423739 AS __vector_17_1761297680423739, __data_17_1761297681614480 AS __data_17_1761297681614480 from wdyf_szsjy_fastgpt.modeldata as of snapshot 1764062821489761000 order by 1

这是内部生成的sql吗 每天固定时间执行十几个小时

CREATE TABLE modeldata (
id bigint(20) NOT NULL AUTO_INCREMENT,
vector VECTOR(4096) DEFAULT NULL,
team_id varchar(50) NOT NULL,
dataset_id varchar(50) NOT NULL,
collection_id varchar(50) NOT NULL,
createtime timestamp NULL DEFAULT CURRENT_TIMESTAMP,
PRIMARY KEY (id),
KEY team_dataset_collection_index (team_id, dataset_id, collection_id) BLOCK_SIZE 16384 LOCAL,
KEY create_time_index (createtime) BLOCK_SIZE 16384 LOCAL,
VECTOR KEY vector_index (vector) WITH (DISTANCE = INNER_PRODUCT,
TYPE = HNSW,
M = 32,
EF_CONSTRUCTION = 128, LIB=VSAG, EF_SEARCH=64) BLOCK_SIZE 16384,
VECTOR KEY idx_1764062492773017 (vector) WITH (DISTANCE = INNER_PRODUCT,
TYPE = HNSW,
M = 32,
EF_CONSTRUCTION = 128, LIB=VSAG, EF_SEARCH=64) BLOCK_SIZE 16384
) ORGANIZATION INDEX AUTO_INCREMENT = 916044 AUTO_INCREMENT_MODE = ‘ORDER’ DEFAULT CHARSET = utf8mb4 ROW_FORMAT = DYNAMIC COMPRESSION = ‘zstd_1.3.8’ REPLICA_NUM = 3 BLOCK_SIZE = 16384 USE_BLOOM_FILTER = FALSE ENABLE_MACRO_BLOCK_BLOOM_FILTER = FALSE TABLET_SIZE = 134217728 PCTFREE = 0;

idx_1764062492773017 是数据库生成的吗 业务说没建这个

你们查到的长事务的语句 是这个插入的语句么?


差不到是哪个sql 看SESSION表猜测的


告警的是3222180127会话 但是他没有sql

SELECT * FROM oceanbase.GV$OB_PROCESSLIST WHERE STATE = ‘ACTIVE’ AND ID IN (SELECT session_id FROM GV$OB_TRANSACTION_PARTICIPANTS
WHERE CTX_CREATE_TIME < date_sub(now(), INTERVAL 600 SECOND)
AND ROLE = ‘LEADER’
AND ACTION = ‘START’);
–可以这样查一下 看看 是哪个语句


空的