ob集群创建租户失败-显示资源不足

【 使用环境 】测试环境
【 OB or 其他组件 】

【 使用版本 】
【问题描述】

使用OCP 部署了 ob4.3.5版本集群,每个节点64GB内存。设置 内存时候 添加了如下参数

添加配置参数 (根据实际情况填写)

memory_limit 16G
system_memory 8G

集群起来了,目前要创建odc租户,发现 无论如何选 都实现资源不足,如果调大集群内存,应该去哪里调整,亦或有其他办法可以解决?


如图:

1 个赞

你好,从当前信息看出开,机器配置可能是16c64g的?当前集群配置不太合理
1、memory_limit一般设置为服务器的70%-80%,用于分配整个集群的内存资源,包括system_memory
2、除去sys租户使用的cpu和内存资源后,普通租户还有10c可以分配
3、log_disk需要分配为内存大小的3-4倍,在调整memory_limit之后也需要调整log_disk_size大小
4、调整后,再次创建租户

1 个赞

memory_limit 设置 48GB ,创建了 odc_unit (Unit 规格 10c/6G/40GB),然后创建 odc 租户 失败了

下面是全部日志:

2025-03-13 17:10:47.571 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.service.iam.user.UserService : user 100 login with organization 10000000

2

3

2025-03-13 17:10:47.576 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.c.t.e.runner.JavaSubtaskRunner : Run subtask, id=1002428, context=Context{parallelIdx=-1, stringMap={tenant_id=1000002, tenant_name=odc, ob_tenant_parameter_map=log_transport_compress_all=1, task_instance_id=1002357, task_operation=execute, whitelist=172.17.151.125, target_tenant_status=NORMAL, cluster_id=1000001, old_password=xxx new_password=xxx system_variable_map=, unknown_type_param_map=default_load_mode=DISABLED$direct_load_allow_fallback=1, create_tenant_param_json={“charset”:“utf8mb4”,“collation”:“utf8mb4_general_ci”,“enableArbitration”:false,“loadType”:“HTAP”,“mode”:“MYSQL”,“name”:“odc”,“parameters”:[{“name”:“default_load_mode”,“value”:“DISABLED”},{“name”:“direct_load_allow_fallback”,“value”:“1”},{“name”:“log_transport_compress_all”,“parameterType”:“OB_TENANT_PARAMETER”,“value”:“1”}],“primaryZone”:“zone2,zone3”,“rootPassword”:"******",“saveToCredential”:true,“skipImportTenantInfo”:false,“whitelist”:“172.17.151.125”,“zones”:[{“name”:“zone1”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}},{“name”:“zone2”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}},{“name”:“zone3”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}}]}, latest_execution_start_time=2025-03-13T17:10:47.549+08:00, sub_task_instance_name=Create ob tenant, sub_task_instance_id=1002428}, listMap={}}, executor=172.17.151.121

4

5

2025-03-13 17:10:47.588 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.o.i.tenant.task.CreateTenantTask : begin create tenant, param=CreateTenantParam(name=odc, mode=MYSQL, primaryZone=zone2,zone3, charset=utf8mb4, collation=utf8mb4_general_ci, description=null, whitelist=172.17.151.125, timeZone=null, rootPassword=******, saveToCredential=true, enableArbitration=false, skipImportTenantInfo=false, serviceName=null, zones=[CreateTenantParam.ZoneParam(name=zone1, replicaType=FULL, resourcePool=CreateTenantParam.PoolParam(unitSpecName=odc_unit, unitCount=1)), CreateTenantParam.ZoneParam(name=zone2, replicaType=FULL, resourcePool=CreateTenantParam.PoolParam(unitSpecName=odc_unit, unitCount=1)), CreateTenantParam.ZoneParam(name=zone3, replicaType=FULL, resourcePool=CreateTenantParam.PoolParam(unitSpecName=odc_unit, unitCount=1))], parameters=[TenantParameterParam(name=default_load_mode, value=DISABLED, parameterType=null), TenantParameterParam(name=direct_load_allow_fallback, value=1, parameterType=null), TenantParameterParam(name=log_transport_compress_all, value=1, parameterType=OB_TENANT_PARAMETER)], clientToken=null, loadType=HTAP)

6

7

2025-03-13 17:10:47.630 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: set ob_query_timeout = ?, args: [10000000]

8

9

2025-03-13 17:10:47.645 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: CREATE RESOURCE UNIT config_odc_zone2_odc_unit_wix MAX_CPU = ?, MIN_CPU = ?, MEMORY_SIZE = ?, LOG_DISK_SIZE = ?, args: [10.0, 10.0, 6442450944, 42949672960]

10

11

2025-03-13 17:10:48.069 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: set ob_query_timeout = ?, args: [10000000]

12

13

2025-03-13 17:10:48.076 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: SELECT UNIT_CONFIG_ID, NAME, MAX_CPU, MIN_CPU, MEMORY_SIZE AS MAX_MEMORY, MEMORY_SIZE AS MIN_MEMORY, LOG_DISK_SIZE, MAX_IOPS, MIN_IOPS, IOPS_WEIGHT FROM oceanbase.DBA_OB_UNIT_CONFIGS WHERE NAME = ?, args: [config_odc_zone2_odc_unit_wix]

14

15

2025-03-13 17:10:48.086 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: set ob_query_timeout = ?, args: [10000000]

16

17

2025-03-13 17:10:48.095 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: CREATE RESOURCE POOL pool_odc_zone2_wix UNIT = ?, UNIT_NUM = ?, ZONE_LIST=(‘zone2’), args: [config_odc_zone2_odc_unit_wix, 1]

18

19

2025-03-13 17:10:48.927 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: set ob_query_timeout = ?, args: [10000000]

20

21

2025-03-13 17:10:49.263 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: SELECT /*+ QUERY_TIMEOUT(60000000) */ time_to_usec(t1.MODIFY_TIME) AS UPDATE_TIME, t1.RESOURCE_POOL_ID, t1.NAME, t1.UNIT_COUNT, t1.UNIT_CONFIG_ID, t1.ZONE_LIST, t1.TENANT_ID, t1.REPLICA_TYPE, t2.NAME AS UNIT_CONFIG_NAME, t2.MAX_CPU, t2.MIN_CPU, t2.MEMORY_SIZE AS MAX_MEMORY, t2.MEMORY_SIZE AS MIN_MEMORY, t2.LOG_DISK_SIZE, t2.MAX_IOPS, t2.MIN_IOPS, t2.IOPS_WEIGHT FROM oceanbase.DBA_OB_RESOURCE_POOLS AS t1 JOIN oceanbase.DBA_OB_UNIT_CONFIGS AS t2 ON t1.UNIT_CONFIG_ID = t2.UNIT_CONFIG_ID WHERE t1.name = ?, args: [pool_odc_zone2_wix]

22

23

2025-03-13 17:10:49.654 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.o.i.helper.OcpCacheHelperImpl : Ob cluster resource has been changed, cluster 1000001, tenant null, notify ocp cache

24

25

2025-03-13 17:10:49.854 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.o.i.tenant.task.CreateTenantTask : create resource pool success, resourcePoolList=[ResourcePool(id=1002, name=pool_odc_zone1_wvw, unitCount=1, unitConfig=UnitConfig(maxCpuCoreCount=10.0, minCpuCoreCount=10.0, maxMemoryByte=6442450944, minMemoryByte=6442450944, logDiskSizeByte=42949672960, maxIops=0, minIops=0, iopsWeight=10, name=config_odc_zone1_odc_unit_wvw), zoneList=[zone1]), ResourcePool(id=1003, name=pool_odc_zone2_wix, unitCount=1, unitConfig=UnitConfig(maxCpuCoreCount=10.0, minCpuCoreCount=10.0, maxMemoryByte=6442450944, minMemoryByte=6442450944, logDiskSizeByte=42949672960, maxIops=0, minIops=0, iopsWeight=10, name=config_odc_zone2_odc_unit_wix), zoneList=[zone2]), ResourcePool(id=1001, name=pool_odc_zone3_hpj, unitCount=1, unitConfig=UnitConfig(maxCpuCoreCount=10.0, minCpuCoreCount=10.0, maxMemoryByte=6442450944, minMemoryByte=6442450944, logDiskSizeByte=42949672960, maxIops=0, minIops=0, iopsWeight=10, name=config_odc_zone3_odc_unit_hpj), zoneList=[zone3])]

26

27

2025-03-13 17:10:50.180 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.s.o.o.f.ConnectPropertiesBuilder : get credential from obsdk context, clusterName=ob435, tenantName=sys, dbUser=root

28

29

2025-03-13 17:10:50.287 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: set ob_query_timeout = ?, args: [10000000]

30

31

2025-03-13 17:10:50.456 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] sql: CREATE TENANT odc resource_pool_list=(‘pool_odc_zone1_wvw’,‘pool_odc_zone2_wix’,‘pool_odc_zone3_hpj’), LOCALITY = ?, PRIMARY_ZONE = ?, CHARSET = ?, COLLATE = ? SET ob_tcp_invited_nodes=’%’, ob_compatibility_mode = ?, args: [FULL@zone1,FULL@zone2,FULL@zone3, zone2,zone3, utf8mb4, utf8mb4_general_ci, mysql]

32

33

2025-03-13 17:10:53.393 WARN 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] update failed, sql:[CREATE TENANT odc resource_pool_list=(‘pool_odc_zone1_wvw’,‘pool_odc_zone2_wix’,‘pool_odc_zone3_hpj’), LOCALITY = ?, PRIMARY_ZONE = ?, CHARSET = ?, COLLATE = ? SET ob_tcp_invited_nodes=’%’, ob_compatibility_mode = ?], error message:[PreparedStatementCallback; SQL [CREATE TENANT odc resource_pool_list=(‘pool_odc_zone1_wvw’,‘pool_odc_zone2_wix’,‘pool_odc_zone3_hpj’), LOCALITY = ?, PRIMARY_ZONE = ?, CHARSET = ?, COLLATE = ? SET ob_tcp_invited_nodes=’%’, ob_compatibility_mode = ?]; (conn=3221660214) IO error; nested exception is java.sql.SQLTransientConnectionException: (conn=3221660214) IO error]

34

35

2025-03-13 17:10:53.468 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : Last Trace Info:[YB42AC11977C-00063022764A892C-0-0]

36

37

2025-03-13 17:10:53.587 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.ocp.obsdk.connector.ConnectTemplate : [obsdk] slow query, durationMillis=3131, sql=CREATE TENANT odc resource_pool_list=(‘pool_odc_zone1_wvw’,‘pool_odc_zone2_wix’,‘pool_odc_zone3_hpj’), LOCALITY = ?, PRIMARY_ZONE = ?, CHARSET = ?, COLLATE = ? SET ob_tcp_invited_nodes=’%’, ob_compatibility_mode = ?

38

39

2025-03-13 17:10:53.730 ERROR 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,6e1b9c0527bf1e0b] c.o.o.c.t.e.c.w.subtask.SubtaskExecutor : IO error

40

41

java.sql.SQLException: IO error

42

at com.oceanbase.jdbc.internal.protocol.AbstractQueryProtocol.readErrorPacket(AbstractQueryProtocol.java:2366)

43

at com.oceanbase.jdbc.internal.protocol.AbstractQueryProtocol.readPacket(AbstractQueryProtocol.java:2229)

44

at com.oceanbase.jdbc.internal.protocol.AbstractQueryProtocol.getResult(AbstractQueryProtocol.java:2117)

45

at com.oceanbase.jdbc.internal.protocol.AbstractQueryProtocol.executeQuery(AbstractQueryProtocol.java:399)

46

at com.oceanbase.jdbc.JDBC4PreparedStatement.executeInternal(JDBC4PreparedStatement.java:248)

47

at com.oceanbase.jdbc.JDBC4PreparedStatement.execute(JDBC4PreparedStatement.java:171)

48

at com.oceanbase.jdbc.JDBC4PreparedStatement.executeUpdate(JDBC4PreparedStatement.java:205)

49

at com.alibaba.druid.pool.DruidPooledPreparedStatement.executeUpdate(DruidPooledPreparedStatement.java:255)

50

at org.springframework.jdbc.core.JdbcTemplate.lambda$update$2(JdbcTemplate.java:967)

51

at org.springframework.jdbc.core.JdbcTemplate.execute(JdbcTemplate.java:650)

52

at org.springframework.jdbc.core.JdbcTemplate.update(JdbcTemplate.java:962)

53

at org.springframework.jdbc.core.JdbcTemplate.update(JdbcTemplate.java:1017)

54

at org.springframework.jdbc.core.JdbcTemplate.update(JdbcTemplate.java:1027)

55

at com.oceanbase.ocp.obsdk.connector.ConnectTemplate.updateInner(ConnectTemplate.java:293)

56

at com.oceanbase.ocp.obsdk.connector.ConnectTemplate.update(ConnectTemplate.java:264)

57

at com.oceanbase.ocp.obsdk.operator.tenant.MysqlTenantOperator.createTenant(MysqlTenantOperator.java:258)

58

at com.oceanbase.ocp.obops.internal.tenant.TenantOperationServiceImpl.createTenantOnResourcePool(TenantOperationServiceImpl.java:586)

59

at com.oceanbase.ocp.obops.internal.tenant.TenantOperationServiceImpl.createTenantOnResourcePool(TenantOperationServiceImpl.java:556)

60

at com.oceanbase.ocp.obops.internal.tenant.TenantOperationServiceImpl$$FastClassBySpringCG

61

LIB$$44eef4f8.invoke()

62

at org.springframework.cglib.proxy.MethodProxy.invoke(MethodProxy.java:218)

63

at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.invokeJoinpoint(CglibAopProxy.java:792)

64

at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)

65

at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.proceed(CglibAopProxy.java:762)

66

at org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:97)

67

at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:186)

68

at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.proceed(CglibAopProxy.java:762)

69

at org.springframework.aop.framework.CglibAopProxy$DynamicAdvisedInterceptor.intercept(CglibAopProxy.java:707)

70

at com.oceanbase.ocp.obops.internal.tenant.TenantOperationServiceImpl$$EnhancerBySpringCGLIB$$8c0a6072.createTenantOnResourcePool()

71

at com.oceanbase.ocp.obops.internal.tenant.task.CreateTenantTask.run(CreateTenantTask.java:70)

72

at com.oceanbase.ocp.core.task.engine.runner.JavaSubtaskRunner.execute(JavaSubtaskRunner.java:64)

73

at com.oceanbase.ocp.core.task.engine.runner.JavaSubtaskRunner.doRun(JavaSubtaskRunner.java:32)

74

at com.oceanbase.ocp.core.task.engine.runner.JavaSubtaskRunner.run(JavaSubtaskRunner.java:26)

75

at com.oceanbase.ocp.core.task.engine.runner.RunnerFactory.doRun(RunnerFactory.java:76)

76

at com.oceanbase.ocp.core.task.engine.coordinator.worker.subtask.SubtaskExecutor.doRun(SubtaskExecutor.java:207)

77

at com.oceanbase.ocp.core.task.engine.coordinator.worker.subtask.SubtaskExecutor.redirectConsoleOutput(SubtaskExecutor.java:201)

78

at com.oceanbase.ocp.core.task.engine.coordinator.worker.subtask.SubtaskExecutor.lambda$submit$2(SubtaskExecutor.java:137)

79

at java.util.concurrent.FutureTask.run(FutureTask.java:266)

80

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)

81

at java.util.concurrent.ThreadPoo

82

lExecutor$Worker.run(ThreadPoolExecutor.java:624)

83

at java.lang.Thread.run(Thread.java:748)

84

85

86

Set state for subtask: 1002428, operation:EXECUTE, state: FAILED

87

任务失败报错为:io error说明集群所在服务器的磁盘异常,看下OCP告警中心是否有IO类的告警。
建议,ODC的元数据租户分配小些,访问量不大2c4g即可,更多资源留给业务租户。

有个节点挂了 我重启服务器后 任务重新执行 然后卡在最后一直重试。。。

日志如下:

2025-03-13 17:34:09.822 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.ocp.service.iam.user.UserService : user 100 login with organization 10000000

2

3

2025-03-13 17:34:09.826 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.c.t.e.runner.JavaSubtaskRunner : Run subtask, id=1002425, context=Context{parallelIdx=-1, stringMap={tenant_id=1000002, ob_tenant_id=1004, tenant_name=odc, ob_tenant_parameter_map=log_transport_compress_all=1, task_instance_id=1002357, task_operation=execute, whitelist=172.17.151.125, target_tenant_status=NORMAL, resource_pool_list_json=[{“id”:1002,“name”:“pool_odc_zone1_wvw”,“unitConfig”:{“iopsWeight”:10,“logDiskSize”:40,“logDiskSizeByte”:42949672960,“maxCpuCoreCount”:10.00,“maxIops”:0,“maxMemoryByte”:6442450944,“maxMemorySize”:6,“minCpuCoreCount”:10.00,“minIops”:0,“minMemoryByte”:6442450944,“minMemorySize”:6},“unitCount”:1,“zoneList”:[“zone1”]},{“id”:1003,“name”:“pool_odc_zone2_wix”,“unitConfig”:{“iopsWeight”:10,“logDiskSize”:40,“logDiskSizeByte”:42949672960,“maxCpuCoreCount”:10.00,“maxIops”:0,“maxMemoryByte”:6442450944,“maxMemorySize”:6,“minCpuCoreCount”:10.00,“minIops”:0,“minMemoryByte”:6442450944,“minMemorySize”:6},“unitCount”:1,“zoneList”:[“zone2”]},{“id”:1001,“name”:“pool_odc_zone3_hpj”,“unitConfig”:{“iopsWeight”:10,“logDiskSize”:40,“logDiskSizeByte”:42949672960,“maxCpuCoreCount”:10.00,“maxIops”:0,“maxMemoryByte”:6442450944,“maxMemorySize”:6,“minCpuCoreCount”:10.00,“minIops”:0,“minMemoryByte”:6442450944,“minMemorySize”:6},“unitCount”:1,“zoneList”:[“zone3”]}], cluster_id=1000001, old_password=xxx new_password=xxx system_variable_map=, unknown_type_param_map=default_load_mode=DISABLED$direct_load_allow_fallback=1, create_tenant_param_json={“charset”:“utf8mb4”,“collation”:“utf8mb4_general_ci”,“enableArbitration”:false,“loadType”:“HTAP”,“mode”:“MYSQL”,“name”:“odc”,“parameters”:[{“name”:“default_load_mode”,“value”:“DISABLED”},{“name”:“direct_load_allow_fallback”,“value”:“1”},{“name”:“log_transport_compress_all”,“parameterType”:“OB_TENANT_PARAMETER”,“value”:“1”}],“primaryZone”:“zone2,zone3”,“rootPassword”:"******",“saveToCredential”:true,“skipImportTenantInfo”:false,“whitelist”:“172.17.151.125”,“zones”

4

:[{“name”:“zone1”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}},{“name”:“zone2”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}},{“name”:“zone3”,“replicaType”:“FULL”,“resourcePool”:{“unitCount”:1,“unitSpecName”:“odc_unit”}}]}, latest_execution_start_time=2025-03-13T17:34:09.806+08:00, sub_task_instance_name=Import tenant time zone info, sub_task_instance_id=1002425}, listMap={}}, executor=172.17.151.121

5

6

2025-03-13 17:34:09.880 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.c.agent.HostAgentServiceImpl : Finding OCP agent: hostId=1000003

7

8

2025-03-13 17:34:09.888 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.c.a.p.HostAgentProcessServiceImpl : Getting all OCP agent processes on host 1000003

9

10

2025-03-13 17:34:09.947 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.e.internal.template.HttpTemplate : POST request to agent, url:http://172.17.151.124:62888/api/v1/ob/importTimeZoneInfo, request body:ImportTimeZoneInfoRequest(obPath=ObPath(installPath=/usr/local/oceanbase, dataPath=/data, logPath=/redo, diskPathStyle=DEFAULT, runPath=/usr/local/oceanbase), address=172.17.151.124, port=2881, tenantName=odc), params:null

11

12

2025-03-13 17:34:09.958 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.e.internal.template.HttpTemplate : POST request to agent, url:http://172.17.151.124:62888/api/v1/task/status, request body:GetTaskStatusRequest(taskToken=5b31eefb-d371-4f84-b2d6-41c1778d379b), params:null

13

14

2025-03-13 17:34:09.962 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.s.task.util.AgentAsyncTaskHelper : try to request task result(EXECUTE), result:false,null,

15

16

2025-03-13 17:34:09.966 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.ocp.common.lang.pattern.Retry : wait for 10 seconds

17

18

2025-03-13 17:34:19.969 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.e.internal.template.HttpTemplate : POST request to agent, url:http://172.17.151.124:62888/api/v1/task/status, request body:GetTaskStatusRequest(taskToken=5b31eefb-d371-4f84-b2d6-41c1778d379b), params:null

19

20

2025-03-13 17:34:19.981 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.s.task.util.AgentAsyncTaskHelper : try to request task result(EXECUTE), result:false,null,

21

22

2025-03-13 17:34:19.990 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.ocp.common.lang.pattern.Retry : wait for 10 seconds

23

24

2025-03-13 17:34:29.996 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.e.internal.template.HttpTemplate : POST request to agent, url:http://172.17.151.124:62888/api/v1/task/status, request body:GetTaskStatusRequest(taskToken=5b31eefb-d371-4f84-b2d6-41c1778d379b), params:null

25

26

2025-03-13 17:34:30.300 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.s.task.util.AgentAsyncTaskHelper : try to request task result(EXECUTE), result:false,null,

27

28

2025-03-13 17:34:30.304 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.ocp.common.lang.pattern.Retry : wait for 10 seconds

29

30

2025-03-13 17:34:40.375 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.e.internal.template.HttpTemplate : POST request to agent, url:http://172.17.151.124:62888/api/v1/task/status, request body:GetTaskStatusRequest(taskToken=5b31eefb-d371-4f84-b2d6-41c1778d379b), params:null

31

32

2025-03-13 17:34:40.381 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.o.s.task.util.AgentAsyncTaskHelper : try to request task result(EXECUTE), result:false,null,

33

34

2025-03-13 17:34:40.390 INFO 28699 — [manual-subtask-executor12,5fb6fca1fafa34b9,79da434cd70f882b] c.o.ocp.common.lang.pattern.Retry : wait for 10 seconds

35

这种现象常见于服务器性能较差或者存在坏盘的环境,最后一步任务可以重试几次,如果不成功可以跳过,理论上不影响正常使用。

这个问题解决了吗?

因为是测试环境 我参考 @try_again 的建议 我跳过了

好的,资源方面建议按照文档要求,如果有新问题可以发帖讨论。