hosts fail cephadm check ceph分布式存告警
health: HEALTH_WARN1 hosts fail cephadm check
CEPHADM_HOST_CHECK_FAILED: 1 hosts fail cephadm check
# ceph health detail
HEALTH_WARN 1 hosts fail cephadm check
CEPHADM_HOST_CHECK_FAILED: 1 hosts fail cephadm check
host gm268-1 failed check: Failed to connect to gm268-1 (gm268-1).
Please make sure that the host is reachable and accepts connections using the cephadm SSH key
To add the cephadm SSH key to the host:
> ceph cephadm get-pub-key > ~/ceph.pub
> ssh-copy-id -f -i ~/ceph.pub root@gm268-1
To check that the host is reachable:
> ceph cephadm get-ssh-config > ssh_config
> ceph config-key get mgr/cephadm/ssh_identity_key > ~/cephadm_private_key
> chmod 0600 ~/cephadm_private_key
> ssh -F ssh_config -i ~/cephadm_private_key root@gm268-1
# ceph cephadm check-host gm268-1
podman|docker (/usr/bin/podman) is present
systemctl is present
lvcreate is present
Unit chronyd.service is enabled and running
Hostname "gm268-1" matches what is expected.
Host looks OK
gm268-1 (None) ok
# ceph cephadm check-host gm268-2
podman|docker (/usr/bin/podman) is present
systemctl is present
lvcreate is present
Unit chronyd.service is enabled and running
Hostname "gm268-2" matches what is expected.
Host looks OK
gm268-2 (None) ok
# ceph cephadm check-host gm268-3
podman|docker (/usr/bin/podman) is present
systemctl is present
lvcreate is present
Unit chronyd.service is enabled and running
Hostname "gm268-3" matches what is expected.
Host looks OK
gm268-3 (None) ok
# ceph -s
cluster:
id: ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9
health: HEALTH_OK
services:
mon: 3 daemons, quorum gm268-1,gm268-2,gm268-3 (age 11m)
mgr: gm268-2.zttohs(active, since 3w), standbys: gm268-3.sjagqo, gm268-1.jgdvxs
mds: cephfs:1 {0=cephfs.gm268-2.xdsdoz=up:active} 2 up:standby
osd: 41 osds: 41 up (since 10m), 41 in (since 2w); 1008 remapped pgs
data:
pools: 3 pools, 9217 pgs
objects: 10.14M objects, 14 TiB
usage: 28 TiB used, 569 TiB / 597 TiB avail
pgs: 1026683/20286478 objects misplaced (5.061%)
8207 active+clean
958active+remapped+backfill_wait
52 active+remapped+backfilling
io:
recovery: 735 MiB/s, 14 keys/s, 484 objects/s
# ceph cephadm check-host gm268-1
podman|docker (/usr/bin/podman) is present
systemctl is present
lvcreate is present
Unit chronyd.service is enabled and running
Hostname "gm268-1" matches what is expected.
Host looks OK
gm268-1 (None) ok
页:
[1]