- 积分
- 16843
在线时间 小时
最后登录1970-1-1
|
马上注册,结交更多好友,享用更多功能,让你轻松玩转社区。
您需要 登录 才可以下载或查看,没有账号?开始注册
x
当mon节点全部出现问题的时候或者单独一个节点出现问题时恢复过程. V, s; ?* `' W, G4 j
7 ~$ S! N5 v" ?
$ L6 g: K) ]1 P+ O7 fceph一直无法正常的执行ceph -s命令;# M! ^3 g5 T; S* U# M! K1 ~
: l4 u5 b( N; u/ x; L
! B' F- V6 ^8 ~9 l0 H
5 K5 h' a2 D+ |4 j: Q$ q: o. bceph分部署存储告警monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]
7 d% E% p% f, j5 U5 p0 |& r: `0 {7 X8 N
2024-10-17T22:33:47.295+0800 7f20fe7fc700 -1 monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]
+ N2 l ?& T4 ]3 ?2024-10-17T22:33:47.297+0800 7f20ff7fe700 -1 monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]& a2 T+ C' F6 H3 A+ g
% @: f1 W9 o+ h0 O% I6 f4 y; l/ c' N8 X5 X, _
5 O) v$ V9 g; k, Y
环境中也就只有gm268-3节点因重启失败夯住是好的,gm268-1和gm268-2都已经被损坏。只能想办法从3上入手解决。5 F" l1 c& Y: Z1 |
- I, P+ d" e: @) E( _
结果过程:( T ]' i& E. n" Y; A4 F1 [
* K1 P) h3 E3 }8 l1、在gm268-3节点上导出monmap文件:0 S' t* \& z0 L1 ^
: X! @; `' p8 S. x
( L+ O' G* h9 P4 A; m* M$ monmaptool --create --clobber --fsid ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9 --add gm268-3 10.12.3.2:6789 --add gm268-2 10.12.2.2:6789 --add gm268-1 10.12.1.2:6789 /tmp/monmap
- t' {- G1 v3 i9 i1 amonmaptool: monmap file /tmp/monmap! z3 q, v. Z9 |' E% T
monmaptool: set fsid to ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9
( n2 F2 \) ~) Y5 u( ]' d I3 smonmaptool: writing epoch 0 to /tmp/monmap (3 monitors)
% n. y5 z' U. n: R( z) \
% U2 _4 |; i, B. m( q! w! c r* A% `+ j- l& X0 w. h) F$ @
导出monmap,好的节点写在前面,后面把所有的坏节点加上就可以了。
$ u1 d+ R- h& I' I
1 F' H& n3 t/ f) u/ @查看下导出的文件信息:) Q! R" Q; K1 [( R
2 Z) n8 J; v+ E8 R
$ monmaptool --print /tmp/monmap
: \$ v9 z Y6 r0 `/ H [monmaptool: monmap file /tmp/monmap
" E4 k# |) X. ]$ c% t" I/ Pepoch 0+ R8 e) o# t/ v: k5 i
fsid ce68aab8-8f46-11ed-88c0-ac1f6b3a30b90 k* j; X/ z$ J0 O7 p* W
last_changed 2024-10-18T13:17:03.645872+0800, t( F: ~: Q2 G& e t" q
created 2024-10-18T13:17:03.645872+08004 ~' f1 `5 s3 {2 t& o
min_mon_release 0 (unknown)/ r5 C- m t0 z" D; j
0: v1:10.12.1.2:6789/0 mon.gm268-1
, u; H! N Z6 p# [5 y% L1: v1:10.12.2.2:6789/0 mon.gm268-2
" E. u: Y0 m9 [ ?/ }2 a2: v1:10.12.3.2:6789/0 mon.gm268-30 s6 W+ B$ A* y M/ c5 J+ W7 U
4 [6 M' d4 g' o9 l5 x0 a0 Z, o# X3 e1 I! L; I/ }$ A" k
6 S# e) J _ |% w; I7 a) O6 ?& N2、去gm268-1和gm268-2的节点上找到/var/lib/ceph/mon 目录,备份下。删除掉。因为文件被修改了,导致文件有异常,没有导致认证出问题。原有的/etc/ceph/目录不能删除。
& ~* d8 H9 I) o+ V9 p2 \! m
; Z9 \* z. ^1 O: t$ `* l9 M
* h% C5 P0 Z6 D9 X' d& a3、将正常节点上keyring和导出的monmap文件传送到其他两个节点上:! ]& Z0 ?- w( q( P1 @, R* m
* U! F! B0 C9 S+ s0 @* ]
scp /var/lib/ceph/mon/ceph-gm268-3/keyring gm268-2:/tmp/
: o. {3 P6 {9 g: g1 Jscp /var/lib/ceph/mon/ceph-gm268-3/keyring gm268-1:/tmp/2 u# ]2 a2 @2 K4 ^
' H) t: f8 `1 h0 h0 i+ j( h
scp /tmp/monmap gm268-1:/tmp/, i+ q) R0 ]& u& A$ ~" M
scp /tmp/monmap gm268-1:/tmp/1 W9 R4 k3 w9 I) V
4 m# R- l% `' g4 a& W$ N9 K
+ v1 H9 { f- r# _( f, W) |! @4、重做gm268-1和gm268-2 节点mon
' y4 e9 f4 e. w' q' a% m7 ~ceph-mon --cluster ceph -i gm268-1 --mkfs --monmap /tmp/monmap --keyring /tmp/keyring -c /etc/ceph/ceph.conf 7 Q) q+ g8 H. o$ R: r1 h+ h
" [8 K9 r( P0 C/ P# Q1 H9 i/ x切换到/var/lib/ceph/mon目录下
, y6 l. D* U; h执行:1 I9 t' J$ }( t" z6 g
chown -R ceph:ceph mon/# q. V) V8 R% l. E% p `
5 Y8 C: l2 V4 }8 ?; \启动mon服务:
: C# z- O# x# j& n. E- Tsystemctl start ceph-mon@gm268-1.service
$ N8 f$ k. _, l* {* w3 F- d5 ~
: H' T4 P; j! ?0 J查看服务: z. F6 K( Q/ C" v4 U" i7 ~& P
, a% `0 V4 m" F. p1 D, r' ~
$ systemctl status ceph-mon@gm268-1.service * x' m4 B% f4 X# l: P5 D- Q' Q& h% J
● ceph-mon@gm268-1.service - Ceph cluster monitor daemon5 |7 g, A2 g' n- z
Loaded: loaded (/usr/lib/systemd/system/ceph-mon@.service; enabled; vendor preset: disabled)
3 X# Y% N# F8 Y& a6 i Active: active (running) since Fri 2024-10-18 13:21:24 CST; 38min ago
) T. F, I/ }$ ~3 c" I7 @ Main PID: 664542 (ceph-mon)9 b0 ^. \8 V. Y1 F @
Tasks: 27
! M7 z5 D; Q3 ^8 t2 _. a. L8 h Memory: 286.0M
' F( N. t8 }+ B: B& S, G CGroup: /system.slice/system-ceph\x2dmon.slice/ceph-mon@gm268-1.service
% t, a3 Z7 H$ _2 m( o └─664542 /usr/bin/ceph-mon -f --cluster ceph --id gm268-1 --setuser ceph --setgroup ceph
. n" C- O3 n% R- ~" @9 C+ i0 |2 r/ B
Oct 18 13:21:24 gm268-1 systemd[1]: Started Ceph cluster monitor daemon.
9 i# a9 e3 p5 hOct 18 13:21:24 gm268-1 ceph-mon[664542]: 2024-10-18T13:21:24.793+0800 7fcc5f804700 -1 mon.gm268-1@0(probing) e11 stashing newest monmap 11 for next startup
. f1 [ t ]- h* ^1 V8 J- I8 UOct 18 13:21:24 gm268-1 ceph-mon[664542]: ignoring --setuser ceph since I am not root7 E! e7 X( L' P& O, O2 Z) S
Oct 18 13:21:24 gm268-1 ceph-mon[664542]: ignoring --setgroup ceph since I am not root9 W+ R3 g# T: O. N! F
- {0 t- d" U+ T; K+ m$ U9 {
' z$ h9 ^% \- o. L) ?$ D. |1 r# S节点修复完成。
8 x8 Y) `* Z! t u6 w节点二上
" O7 t3 k3 G5 v, f
5 R& \) O, C1 H: P1 U2 Bceph-mon --cluster ceph -i gm268-2 --mkfs --monmap /tmp/monmap --keyring /tmp/keyring -c /etc/ceph/ceph.conf
# e# |' y7 S1 L: f1 K
% Q) r( v. T9 R: V切换到/var/lib/ceph/mon目录下
9 G; l' |0 g" t' h执行:
, I% j9 n2 V0 echown -R ceph:ceph mon/
8 I" j" B( A! W1 M+ ]) Q
' ?% X$ L8 |+ g L2 _- g启动mon服务:% r& d! h9 o6 | Y- e% @
systemctl start ceph-mon@gm268-2.service; ?* s/ W9 A. T1 S& S$ P: b
# q& D4 M0 V' O7 j" S. P
/ K+ l; `8 i h& R, F. ?: @& } ]+ u# }
$ systemctl status ceph-mon@gm268-2.service ) F* X$ q: V3 E% t
● ceph-mon@gm268-2.service - Ceph cluster monitor daemon
; B& b* |; G4 o Loaded: loaded (/usr/lib/systemd/system/ceph-mon@.service; enabled; vendor preset: disabled)' ^( X4 k' Q% O
Active: active (running) since Fri 2024-10-18 13:09:42 CST; 51min ago
5 B3 l1 A) m/ X$ p Main PID: 157382 (ceph-mon)# D/ V& t. a E* p7 F
Tasks: 27 Z/ P; o4 C. s: |" _* x$ G+ `$ B
Memory: 587.1M
- F0 u4 I7 k/ S CGroup: /system.slice/system-ceph\x2dmon.slice/ceph-mon@gm268-2.service% U8 }" w M6 A2 b; D' S) ~
└─157382 /usr/bin/ceph-mon -f --cluster ceph --id gm268-2 --setuser ceph --setgroup ceph( X+ K- ]1 ~9 P" Y
- \& W- z/ f1 J) T9 T
" r% b3 L2 z2 B$ g+ ~
|
|