【 使用环境 】生产环境
【 OB or 其他组件 】OBD
【 使用版本 】4.1.0-CE
【问题描述】
使用最新的4.1.0进行部署,其他组件一切部署正常,在obproxy启动后进行健康检查时不通过导致部署中止,经过检查,发现目标服务器上的obproxy已经启动且可以连接ob集群,但查看obd日志时发现似乎是pid对应不上导致的检查失败,不知道是什么原因,希望得到解决~
组件版本:
+--------------------------------------------------------------------------------------------+
| Packages |
+--------------+---------+------------------------+------------------------------------------+
| Repository | Version | Release | Md5 |
+--------------+---------+------------------------+------------------------------------------+
| oceanbase-ce | 4.1.0.0 | 100000192023032010.el7 | d529f5881ecf9798c5f172d54579950f47a66f30 |
| obproxy-ce | 4.1.0.0 | 7.el7 | 71e70d54b36cb8e04afb306d59d1fc9c9aee706b |
| obagent | 1.3.0 | 22.el7 | cfcfe2aa9325e723c98c200f58470d4b043865c4 |
| ocp-express | 1.0.0 | 100000432023032015.el7 | b1e1d39c1f23e26c33a783f809cf4a63b0c9f1f1 |
+--------------+---------+------------------------+------------------------------------------+
报错日志:
[2023-03-25 18:38:41.870] [DEBUG] -- failed to start 10.222.100.76 obproxy, remaining retries: 2
[2023-03-25 18:38:42.871] [DEBUG] -- 10.222.100.76 program health check
[2023-03-25 18:38:42.871] [DEBUG] -- root@10.222.100.76 execute: cat /data/gsy/obproxy/run/obproxy-10.222.100.76-2883.pid
[2023-03-25 18:38:42.922] [DEBUG] -- exited code 0
[2023-03-25 18:38:42.922] [DEBUG] -- root@10.222.100.76 execute: bash -c 'cat /proc/net/{tcp*,udp*}' | awk -F' ' '{print $2,$10}' | grep '00000000:0B43' | awk -F' ' '{print $2}' | uniq
[2023-03-25 18:38:43.059] [DEBUG] -- exited code 0
[2023-03-25 18:38:43.060] [DEBUG] -- failed to start 10.222.100.76 obproxy, remaining retries: 1
[2023-03-25 18:38:44.061] [DEBUG] -- 10.222.100.76 program health check
[2023-03-25 18:38:44.061] [DEBUG] -- root@10.222.100.76 execute: cat /data/gsy/obproxy/run/obproxy-10.222.100.76-2883.pid
[2023-03-25 18:38:44.111] [DEBUG] -- exited code 0
[2023-03-25 18:38:44.111] [DEBUG] -- root@10.222.100.76 execute: bash -c 'cat /proc/net/{tcp*,udp*}' | awk -F' ' '{print $2,$10}' | grep '00000000:0B43' | awk -F' ' '{print $2}' | uniq
[2023-03-25 18:38:44.249] [DEBUG] -- exited code 0
[2023-03-25 18:38:44.250] [DEBUG] -- failed to start 10.222.100.76 obproxy, remaining retries: 0
[2023-03-25 18:38:44.360] [WARNING] failed to start 10.222.100.76 obproxy