找回密码
 注册
查看: 46|回复: 0

failed to probe daemons or devices 问题ceph -s出现的错误

[复制链接]

0

主题

0

回帖

9

积分

管理员

积分
9
QQ
发表于 2025-4-9 17:00:42 | 显示全部楼层 |阅读模式
[WRN] CEPHADM_REFRESH_FAILED: failed to probe daemons or devices: G( q1 h9 _, [. a) S( Z  T" O' M9 t
host private-registry.example.com `cephadm ceph-volume` failed: cephadm exited with an error code: 1, stderr:Non-zero exit code 125 from /bin/podman run --rm --ipc=host --net=host --entrypoint stat --init -e CONTAINER_IMAGE=ceph5-2-1-registry.example.com:5000/rhceph/rhceph-5-rhel8@sha256:d42c0d99ddeaa001570dce4eb90b71699e0401fe449966b935f669ffad22bd01 -e NODE_NAME=private-registry.example.com -e CEPH_USE_RANDOM_NONCE=1 ceph5-2-1-registry.example.com:5000/rhceph/rhceph-5-rhel8@sha256:d42c0d99ddeaa001570dce4eb90b71699e0401fe449966b935f669ffad22bd01 -c %u %g /var/lib/ceph
" P0 e  h3 B& g6 L! zstat: stderr Trying to pull ceph5-2-1-registry.example.com:5000/rhceph/rhceph-5-rhel8@sha256:d42c0d99ddeaa001570dce4eb90b71699e0401fe449966b935f669ffad22bd01..., k( f5 D8 U1 W( v
stat: stderr Error: initializing source docker://ceph5-2-1-registry.example.com:5000/rhceph/rhceph-5-rhel8@sha256:d42c0d99ddeaa001570dce4eb90b71699e0401fe449966b935f669ffad22bd01: reading manifest sha256:d42c0d99ddeaa001570dce4eb90b71699e0401fe449966b935f669ffad22bd01 in ceph5-2-1-registry.example.com:5000/rhceph/rhceph-5-rhel8: unauthorized: authentication required
8 d: c! o' I+ C; {/ V% nHow to eliminate this warning?/ ]* e  s6 B; u
Resolution' A& q- v! f/ e7 R, \4 t8 G# E: u& T
Login to the cephadm shell on the lead monitor node.5 V1 s2 k0 }) |! }: W( V/ I3 F
1 V+ Q+ I# s4 t% }1 q/ O/ d2 s
Raw
0 j7 H# ^1 i  ?' Y& t# cephadm shell
/ @5 c3 [: W& B- e6 I+ A! NLog in manually to the custom registry on all the new hosts simultaneously:$ N; x: j% \: Z* ?2 P3 {. C6 J' Q

  i* T5 d$ j- ~" }Raw
( f& v& R, m2 @1 O# ceph cephadm registry-login --registry-url <CUSTOM_REGISTRY_NAME>  --registry_username <REGISTRY_USERNAME> --registry_password <PASSWORD>. r- Q! [, _/ G* e
Executing the aforementioned command will produce a podman-auth.json file in the /etc/ceph directory that contains the custom registry's authentication details.
& |+ y+ d. S% ~2 P$ d" k7 l2 _, q( T) k7 _6 q
Wait 3-5 minutes to see if the daemon starts up.
) q0 ^9 E& O8 b9 G+ Q# Y0 \# [
$ a5 h& v2 Z0 I) X! e: s: VRaw# {$ d2 A7 g' C  w
# watch ceph orch ls
  G: J5 ^9 b7 K* zOPTIONAL: Restart the daemon if it is still not in running state.% r! W9 ~$ A" J5 _% j4 D! D% |1 s
5 w) r& i. @. q9 g( Q: {
Raw, _9 ?" `. L9 R/ X1 S% |" D# x" g
# ceph orch restart <DAEMON_NAME>3 y  O8 y' d5 l+ h, H( G5 u! X
SPECIAL CASE:% H  O) m2 e2 V8 l  Y/ t- _
For daemons like node-exporter, prometheus, alertmanager, grafana:& K& {/ r  y" C' x6 p" E; }- H
) i* k& j) [. p# h% K6 ~' X/ I
After logging into the custom registry, use the ceph config command to configure the custom container images:
/ ^& B% V( b& N/ T) W7 t8 ]/ Z; ]. H+ X7 k3 I, L0 h  j" B
Raw$ ]* k; L/ r# E# i. o; {
# ceph config set mgr mgr/cephadm/OPTION_NAME CUSTOM_REGISTRY_NAME/CONTAINER_NAME
- `& m# y" j2 PUse the following options for OPTION_NAME:' d3 `; E" X7 s7 U6 v* M2 C% d

8 E0 F" a- U! ]3 s/ E+ |. ERaw
8 {: P) E/ V& |container_image_prometheus
- H* ^. `% `/ x3 gcontainer_image_grafana: e0 B& U1 f0 {5 w$ v
container_image_alertmanager
' I8 p3 B5 I! `: T* pcontainer_image_node_exporter& _- R# P2 _- A3 H3 ?' M" o
Redeploy each daemon:1 L' X  [% I5 \2 D0 t4 W/ L
# w) g" N8 o# J
Raw
3 u, X6 ]: O1 k# ceph orch redeploy DAEMON_NAME* ^5 A2 h! F- e
Root Cause
) F) f$ z: w# m  R' k! gWhile adding ceph daemons on newly added hosts, cephadm is not able to determine the custom registry credentials under podman-auth.json file under the /etc/ceph directory or the file itself is missing.
* H, f* K' @0 L" `
7 i1 Q4 s" \; @: V& J2 H$ x7 }- ]  [4 IBehind the scene, when we run this command it actually only logs in the host where the command is run. As a result, when cephadm attempts to pull the image on other hosts it still fails due to not logged in on the other hosts.
+ W5 x' v% }1 S$ |* }/ j& j: `, a& P4 D- m6 M
For this, instead of running the command on the host itself, use the ceph cephadm registry-login command from inside the cephadm shell and it should log in all the hosts in the cluster.+ c/ j  F" j. Z+ _, \1 D3 K4 V

$ y  K! q; @# ^% }4 xDiagnostic Steps$ h7 t; I2 g# h
Check if the podman-auth.json file is present under /etc/ceph/ on each node.
" u. ?7 N8 Z( Z, B3 E2 D$ Z
( D, z3 h9 Q( S3 e% ^Raw7 u+ S" y8 P2 Z
# ls -l /etc/ceph/
) s, Q4 e0 _$ B. eVerify the daemon status:
: L+ `3 c  @/ y/ {. s$ }6 ?' I/ f, q5 z6 E8 K' ^% c( R
Raw; w9 k( ?& }0 D) q
# ceph orch ls
4 a1 U! E& ^( A- O# ceph orch ps
8 v, e9 B. }; I3 ~& c3 [7 z( H
+ N# |( J5 G2 I' z7 n: n& C  B; q
您需要登录后才可以回帖 登录 | 注册

本版积分规则

返回首页|Archiver|手机版|小黑屋|易陆发现技术论坛 ( 蜀ICP备2026014127号-1 )

GMT+8, 2026-6-11 22:55 , Processed in 0.023774 second(s), 22 queries .

Powered by Discuz! X5.0

© 2001-2026 Discuz! Team.

快速回复 返回顶部 返回列表