【 使用环境 】测试环境
【 OB or 其他组件 】ocp
【 使用版本 】4.3.5
【问题描述】ocp 白屏部署成功后,进行添加主机操作,进行主机标准化检查项失败
【复现路径】第一次标准化检查成功后根据检查报告进行修复后再次进行检查失败
【附件及日志】
subtask_13000013.log (15.9 KB)
【 使用环境 】测试环境
【 OB or 其他组件 】ocp
【 使用版本 】4.3.5
【问题描述】ocp 白屏部署成功后,进行添加主机操作,进行主机标准化检查项失败
【复现路径】第一次标准化检查成功后根据检查报告进行修复后再次进行检查失败
【附件及日志】
看起来是OCP在处理maxUserProcesses这个参数时出现了异常,这个操作系统是什么类型和版本?
如果都已经手工修正了,可以先忽略
2025-09-16 16:11:14.079 WARN 7573 --- [manual-subtask-executor11,62f6a359d9849a68,08adf1a58d0dead0] c.o.o.i.internal.InspectionServiceImpl : inspection error:
java.lang.RuntimeException: [limit of maxUserProcesses of os user(ocp.operation.default.os.user in System Parameters)] inspection failed.
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:44)
at com.oceanbase.ocp.inspection.internal.InspectionExecutorImpl.lambda$writeReportAsync$0(InspectionExecutorImpl.java:54)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: com.oceanbase.ocp.executor.exception.HttpRequestFailedException: [AgentClient]:http request is failed, response:Handle process limit failed, reason: 0%!(EXTRA *errors.fundamental=get limit value failed)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.checkSuccess(HttpTemplate.java:479)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.doPost(HttpTemplate.java:289)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.post(HttpTemplate.java:117)
at com.oceanbase.ocp.inspection.internal.targ.Agent.api(Agent.java:33)
at jdk.nashorn.internal.scripts.Script$10$\^eval\_.:program(<eval>:13)
at jdk.nashorn.internal.runtime.ScriptFunctionData.invoke(ScriptFunctionData.java:637)
at jdk.nashorn.internal.runtime.ScriptFunction.invoke(ScriptFunction.java:494)
at jdk.nashorn.internal.runtime.ScriptRuntime.apply(ScriptRuntime.java:393)
at jdk.nashorn.api.scripting.NashornScriptEngine.evalImpl(NashornScriptEngine.java:421)
at jdk.nashorn.api.scripting.NashornScriptEngine.access$300(NashornScriptEngine.java:73)
at jdk.nashorn.api.scripting.NashornScriptEngine$3.eval(NashornScriptEngine.java:514)
at javax.script.CompiledScript.eval(CompiledScript.java:92)
at com.oceanbase.ocp.inspection.internal.script.nashorn.NashornCompiledScript.call(NashornCompi
ledScript.java:29)
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:39)
... 4 common frames omitted
Ubuntu 20.04.6 桌面版,不知道修复后是否还有其他的问题,所以想再次标准化检查一遍
知道原理后,确认已经恢复,可以忽略此信息!
你再跑一遍检查,如果报错了,发下包含报错时间的 ocp-server.log
重新检查一直这样,然后我把COP卸载重装了,重装之后进行标准化检查后,多了一个报错 can not check clockDiff,已经按照社区的帖子执行 在root 用户执行 setcap ‘cap_sys_nice,cap_net_raw+ep’ “$(which clockdiff)” 还是报错。
subtask_3000231.log (20.0 KB)
包含这个报错时间的ocp-server.log 发下
没有这个文件
/home/admin/logs/ 这里有吗?
搜下呢
sudo find / -name "ocp-server.log"
Clock Offset,limit of maxUserProcesses of os user,SELinux config
这3项都报错了,我再看下,你有centos或者redhat的机器吗?可以试试是否有这个报错?
2025-09-22 16:17:12.620 INFO 6371 --- [http-nio-0.0.0.0-8080-exec-4,db4ea5553bee6ee7,b27cc3b63fb2d11d] c.o.o.s.c.trace.RequestTracingAspect : API OK: [GET /api/v2/sensitiveDataKey client=192.168.10.92, traceId=db4ea5553bee6ee7, duration=1 ms]
2025-09-22 16:17:12.625 INFO 6371 --- [http-nio-0.0.0.0-8080-exec-8,,] c.o.o.s.c.trace.RequestTracingAspect : API: [GET /error?null, client=192.168.10.92, traceId=, method=ModelAndView com.oceanbase.ocp.server.common.controller.web.CustomErrorController.handleError(HttpServletRequest), args=]
2025-09-22 16:17:12.625 WARN 6371 --- [http-nio-0.0.0.0-8080-exec-8,,] c.o.o.s.c.c.web.CustomErrorController : Requested URL not found: [url=/api/v2/maz/state, method=192.168.10.92, client=GET, traceId=]
2025-09-22 16:17:12.625 INFO 6371 --- [http-nio-0.0.0.0-8080-exec-8,,] c.o.o.s.c.trace.RequestTracingAspect : API OK: [GET /error client=192.168.10.92, traceId=, duration=0 ms]
2025-09-22 16:17:12.798 WARN 6371 --- [ocp-async-6,d7c6c9e9aad53022,8fffc337fdf18c67] c.o.o.a.s.AlarmInMemoryServiceImpl : aggregator or allDistributors is empty.
2025-09-22 16:17:12.798 INFO 6371 --- [ocp-async-5,ceced49494b1da77,9032c5a403dc995b] c.o.o.s.common.DistributedLockAspect : Get distributed lock, lockKey=AlarmEventProcessForSelfcure, method=observerAlarmEvents
2025-09-22 16:17:12.850 WARN 6371 --- [inspection_task2,,] c.o.ocp.core.task.util.ClockDiff : Clock-diff failed. args=[192.168.10.86], exception={}. Please try other ocp.host.check.clock-diff.mode in System Parameters
org.apache.commons.exec.ExecuteException: Process exited with an error: 1 (Exit value: 1)
at org.apache.commons.exec.DefaultExecutor.executeInternal(DefaultExecutor.java:404)
at org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:166)
at org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:153)
at com.oceanbase.ocp.core.util.LocalCommandExecutor.exec(LocalCommandExecutor.java:50)
at com.oceanbase.ocp.core.task.util.ClockDiff.execute(ClockDiff.java:95)
at com.oceanbase.ocp.core.task.util.ClockDiff.diffWithIcmpTimestamp(ClockDiff.java:52)
at com.oceanbase.ocp.core.task.util.ClockDiff$ClockDiffMode.doRun(ClockDiff.java:172)
at com.oceanbase.ocp.inspection.internal.targ.Host.clockDiff(Host.java:120)
at jdk.nashorn.internal.scripts.Script$\^eval\_.:program(<eval>:1)
at jdk.nashorn.internal.runtime.ScriptFunctionData.invoke(ScriptFunctionData.java:637)
at jdk.nashorn.internal.runtime.ScriptFunction.invoke(ScriptFunction.java:494)
at jdk.nashorn.internal.runtime.ScriptRuntime.apply(ScriptRuntime.java:393)
at jdk.nashorn.api.scripting.NashornScriptEngine.evalImpl(NashornScriptEngine.java:421)
at jdk.nashorn.api.scripting.NashornScriptEngine.access$300(NashornScriptEngine.java:73)
at jdk.nashorn.api.scripting.NashornScriptEngine$3.eval(NashornScriptEngine.java:514)
at javax.script.CompiledScript.eval(CompiledScript.java:92)
at com.oceanbase.ocp.inspection.internal.script.nashorn.NashornCompiledScript.call(NashornCompiledScript.java:29)
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:39)
at com.oceanbase.ocp.inspection.internal.InspectionExecutorImpl.lambda$writeReportAsync$0(InspectionExecutorImpl.java:54)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
2025-09-22 16:17:12.854 WARN 6371 --- [manual-subtask-executor16,c7201c82a3a1a623,d6f3401db5961f6a] c.o.o.i.internal.InspectionServiceImpl : inspection error:
java.lang.RuntimeException: [Clock Offset] inspection failed.
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:44)
at com.oceanbase.ocp.inspection.internal.InspectionExecutorImpl.lambda$writeReportAsync$0(InspectionExecutorImpl.java:54)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: java.lang.RuntimeException: can not check clockDiff of 192.168.10.86
at com.oceanbase.ocp.inspection.internal.targ.Host.clockDiff(Host.java:125)
at jdk.nashorn.internal.scripts.Script$\^eval\_.:program(<eval>:1)
at jdk.nashorn.internal.runtime.ScriptFunctionData.invoke(ScriptFunctionData.java:637)
at jdk.nashorn.internal.runtime.ScriptFunction.invoke(ScriptFunction.java:494)
at jdk.nashorn.internal.runtime.ScriptRuntime.apply(ScriptRuntime.java:393)
at jdk.nashorn.api.scripting.NashornScriptEngine.evalImpl(NashornScriptEngine.java:421)
at jdk.nashorn.api.scripting.NashornScriptEngine.access$300(NashornScriptEngine.java:73)
at jdk.nashorn.api.scripting.NashornScriptEngine$3.eval(NashornScriptEngine.java:514)
at javax.script.CompiledScript.eval(CompiledScript.java:92)
at com.oceanbase.ocp.inspection.internal.script.nashorn.NashornCompiledScript.call(NashornCompiledScript.java:29)
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:39)
... 4 common frames omitted
Caused by: java.lang.RuntimeException: Can not query clock diff.
at com.oceanbase.ocp.inspection.internal.targ.Host.lambda$clockDiff$0(Host.java:123)
at java.util.Optional.orElseThrow(Optional.java:290)
at com.oceanbase.ocp.inspection.internal.targ.Host.clockDiff(Host.java:123)
... 14 common frames omitted
2025-09-22 16:17:12.856 WARN 6371 --- [manual-subtask-executor16,c7201c82a3a1a623,d6f3401db5961f6a] c.o.o.i.internal.InspectionServiceImpl : inspection error:
java.lang.RuntimeException: [limit of maxUserProcesses of os user(ocp.operation.default.os.user in System Parameters)] inspection failed.
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:44)
at com.oceanbase.ocp.inspection.internal.InspectionExecutorImpl.lambda$writeReportAsync$0(InspectionExecutorImpl.java:54)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: com.oceanbase.ocp.executor.exception.HttpRequestFailedException: [AgentClient]:http request is failed, response:Handle process limit failed, reason: 0%!(EXTRA *errors.fundamental=get limit value failed)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.checkSuccess(HttpTemplate.java:479)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.doPost(HttpTemplate.java:289)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.post(HttpTemplate.java:117)
at com.oceanbase.ocp.inspection.internal.targ.Agent.api(Agent.java:33)
at jdk.nashorn.internal.scripts.Script$10$\^eval\_.:program(<eval>:13)
at jdk.nashorn.internal.runtime.ScriptFunctionData.invoke(ScriptFunctionData.java:637)
at jdk.nashorn.internal.runtime.ScriptFunction.invoke(ScriptFunction.java:494)
at jdk.nashorn.internal.runtime.ScriptRuntime.apply(ScriptRuntime.java:393)
at jdk.nashorn.api.scripting.NashornScriptEngine.evalImpl(NashornScriptEngine.java:421)
at jdk.nashorn.api.scripting.NashornScriptEngine.access$300(NashornScriptEngine.java:73)
at jdk.nashorn.api.scripting.NashornScriptEngine$3.eval(NashornScriptEngine.java:514)
at javax.script.CompiledScript.eval(CompiledScript.java:92)
at com.oceanbase.ocp.inspection.internal.script.nashorn.NashornCompiledScript.call(NashornCompiledScript.java:29)
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:39)
... 4 common frames omitted
2025-09-22 16:17:12.858 WARN 6371 --- [manual-subtask-executor16,c7201c82a3a1a623,d6f3401db5961f6a] c.o.o.i.internal.InspectionServiceImpl : inspection error:
java.lang.RuntimeException: [SELinux config] inspection failed.
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:44)
at com.oceanbase.ocp.inspection.internal.InspectionExecutorImpl.lambda$writeReportAsync$0(InspectionExecutorImpl.java:54)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: com.oceanbase.ocp.executor.exception.HttpRequestFailedException: [AgentClient]:http request is failed, response:Unexpected error: get selinux failed: failed to execute command: Command{user=root, program=sh, outputType=std, cmd=/usr/sbin/getenforce, timeout=10s}, exitCode: 127, output: , error: sh: 1: /usr/sbin/getenforce: not found
: exit status 127
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.checkSuccess(HttpTemplate.java:479)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.doPost(HttpTemplate.java:289)
at com.oceanbase.ocp.executor.internal.template.HttpTemplate.post(HttpTemplate.java:117)
at com.oceanbase.ocp.inspection.internal.targ.Agent.api(Agent.java:33)
at jdk.nashorn.internal.scripts.Script$11$\^eval\_.:program(<eval>:2)
at jdk.nashorn.internal.runtime.ScriptFunctionData.invoke(ScriptFunctionData.java:637)
at jdk.nashorn.internal.runtime.ScriptFunction.invoke(ScriptFunction.java:494)
at jdk.nashorn.internal.runtime.ScriptRuntime.apply(ScriptRuntime.java:393)
at jdk.nashorn.api.scripting.NashornScriptEngine.evalImpl(NashornScriptEngine.java:421)
at jdk.nashorn.api.scripting.NashornScriptEngine.access$300(NashornScriptEngine.java:73)
at jdk.nashorn.api.scripting.NashornScriptEngine$3.eval(NashornScriptEngine.java:514)
at javax.script.CompiledScript.eval(CompiledScript.java:92)
at com.oceanbase.ocp.inspection.internal.script.nashorn.NashornCompiledScript.call(NashornCompiledScript.java:29)
at com.oceanbase.ocp.inspection.internal.InspectionRuntimeImpl.writeReport(InspectionRuntimeImpl.java:39)
... 4 common frames omitted
我虚拟机装个centos 试下
手工执行如下命令看下结果
1.在agent端执行 clockdiff -o ocp-server机器
2.在agent端执行 ulimit -a
3.在agent端执行
/usr/sbin/getenforce