找回密码
 注册
查看: 618|回复: 0

HEALTH_WARN failed to probe daemons or devices

[复制链接]

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
发表于 2023-1-8 21:16:01 | 显示全部楼层 |阅读模式
问题
  • After upgrading Red Hat Ceph Storage ( RHCS ) from version 5.0x to version 5.1, the following warning shows in the ceph health detail output:
    [size=0.75]Raw
    . P+ N( y+ v( m. F/ m5 B[root@node01 ~]# ceph health detailHEALTH_WARN failed to probe daemons or devices[WRN] CEPHADM_REFRESH_FAILED: failed to probe daemons or devices    host node01 `cephadm ceph-volume` failed: cephadm exited with an error code: 1, stderr:Inferring config /var/lib/ceph/<fsid>/mon.node01/configERROR: [Errno 2] No such file or directory: '/var/lib/ceph/<fsid>/mon.node01/config'
    3 u% }% E9 I8 r! T
    The above error points to a non existing directory /var/lib/ceph/<fsid>/mon.node01/config. This directory holds the configuration files for a Ceph mon service, but oddly, the node node01 is not a monitor.
  • Why is this message showing in a node that is not a monitor?
  • How to eliminate this warning?

    5 X/ I, x; k( `5 I6 b
决议
  • This issue has been reported in Red Hat internal BZ #2080242. It'll be fixed in RHCS 5.2. At the time of writing, this version is still not released.
  • As a workaround, remove any mon folders named /var/lib/ceph/mon in the affected node. The mentioned warning in the issue description will disappear after that.
    CAUTION
    Please, be aware that the folder /var/lib/ceph/<fsid> contains critical cluster information. Please, do not remove any content from this directory, or the directory itself. This might cause permanent damage to the Ceph cluster.
    " b% s) l9 e9 F% R8 f
根源
This issue is observed in deployments upgraded from RHCS 4.x to RHCS 5.0. The mentioned directory /var/lib/ceph/mon is a leftover from the ceph-ansible deployment that wasn't cleared as part of the upgrade process.
诊断步骤
  • Review the output of the cephadm ls command. It'll show a legacy mon service that shouldn't be present in a Ceph node that is not a monitor:
    [size=0.75]Raw1 _5 W2 `" C5 S2 h
        {            "style": "legacy",            "name": "mon.node01",            "fsid": "<fsid>",            "systemd_unit": "ceph-mon@node1",            "enabled": false,            "state": "stopped",            "host_version": "16.2.0-152.el8cp"        }! k5 x4 H6 ~6 e8 K0 s
  • The cephadm tool pulls the information about legacy services from /var/lib/ceph/<daemon-name> so in this case, it would be /var/lib/ceph/mon. Review the contents of this folder to confirm a mon directory is listed.

    ( U' S6 Z1 p: m" {/ i
# B9 k8 D# n8 N( Y5 O9 v1 S
您需要登录后才可以回帖 登录 | 注册

本版积分规则

返回首页|Archiver|手机版|小黑屋|易陆发现技术论坛 ( 蜀ICP备2026014127号-1 )

GMT+8, 2026-6-12 00:11 , Processed in 0.018062 second(s), 23 queries .

Powered by Discuz! X5.0

© 2001-2026 Discuz! Team.

快速回复 返回顶部 返回列表