- 积分
- 16843
在线时间 小时
最后登录1970-1-1
|
马上注册,结交更多好友,享用更多功能,让你轻松玩转社区。
您需要 登录 才可以下载或查看,没有账号?开始注册
x
当mon节点全部出现问题的时候或者单独一个节点出现问题时恢复过程
0 ^+ H& _: G$ F/ U
6 K' a) D( |+ E4 v1 Q; P L. }$ z2 Z7 f; }% S' _
ceph一直无法正常的执行ceph -s命令;
$ P$ P3 y& T0 J0 h. R# _9 b" @6 z9 t0 |
: a8 P/ F% `# b! @8 h A7 P! C
r) b) Q1 N, ~. X" ], c1 d
ceph分部署存储告警monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]- o9 G- q. Y8 C' g0 ^
2 R5 d" D; |( t/ B, Q8 {1 K2024-10-17T22:33:47.295+0800 7f20fe7fc700 -1 monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]5 C; G9 G4 d$ b* v+ u J2 a0 B" W6 h7 d
2024-10-17T22:33:47.297+0800 7f20ff7fe700 -1 monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]
1 ^# l$ e. k0 Y, g, K2 y5 w* l9 e
+ y; B3 g) I5 r* j+ N
+ s) R! s7 e/ M7 b/ _" A( j- b. N/ M- F9 d3 W
环境中也就只有gm268-3节点因重启失败夯住是好的,gm268-1和gm268-2都已经被损坏。只能想办法从3上入手解决。% j' U/ a9 D& V2 s9 G
3 g& D" G- s: P4 O! _
结果过程:
& o( @, W: I5 q, u
* T& ]' [2 @$ z9 C, e6 n2 }1、在gm268-3节点上导出monmap文件:
- S V6 n5 w" h/ A$ o. h# E2 R# w$ R3 l" w6 E
& y/ A1 g# N; }5 T. B3 ]' \* ?, }
$ monmaptool --create --clobber --fsid ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9 --add gm268-3 10.12.3.2:6789 --add gm268-2 10.12.2.2:6789 --add gm268-1 10.12.1.2:6789 /tmp/monmap1 G+ ^/ Y: R6 t, L
monmaptool: monmap file /tmp/monmap+ Y3 d7 [. }) _* p- c
monmaptool: set fsid to ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9
" O8 p, m: Q# j& \ g8 ?monmaptool: writing epoch 0 to /tmp/monmap (3 monitors)
4 v9 j6 R% X. U5 i2 F
, C7 t! E2 O" B9 o- }2 P* j! Y* [1 } f O3 }
导出monmap,好的节点写在前面,后面把所有的坏节点加上就可以了。- }. r, {* O" K4 L& a0 T3 s9 `& e
+ ?, W6 \$ a. G1 x查看下导出的文件信息:
9 d, G% r' F1 @2 ?4 B; Q+ O. S3 S* j4 V9 [% C+ X" k7 W; O$ Q
$ monmaptool --print /tmp/monmap 5 r1 T, s$ D9 o" O& p8 S' E
monmaptool: monmap file /tmp/monmap
5 `$ A8 M5 X& p3 {# h5 W* ]epoch 0, k3 b2 S* [0 v3 V$ N0 r
fsid ce68aab8-8f46-11ed-88c0-ac1f6b3a30b9
: `, }+ n2 n% a; v# ?last_changed 2024-10-18T13:17:03.645872+0800& ?! u1 E% T( Z, \! P( N
created 2024-10-18T13:17:03.645872+0800
: f M+ i+ F6 ~6 N( {% umin_mon_release 0 (unknown)
' e# g4 d G) P& ?. j2 I0: v1:10.12.1.2:6789/0 mon.gm268-1
! L+ X3 W: m+ O2 F1: v1:10.12.2.2:6789/0 mon.gm268-20 i; ~0 [1 U3 J0 r5 W$ u; t) U: m
2: v1:10.12.3.2:6789/0 mon.gm268-3' q: C$ V# ~8 o7 }- o
' ~2 v! H* l! ]/ d8 |4 g e
) Q3 T* u8 R- p+ p8 |& d- d/ x- D1 ^, y7 ~5 T( Y
2、去gm268-1和gm268-2的节点上找到/var/lib/ceph/mon 目录,备份下。删除掉。因为文件被修改了,导致文件有异常,没有导致认证出问题。原有的/etc/ceph/目录不能删除。, i) f, ` I4 R! j4 w
6 k% Y% W5 K* q4 Z. i8 L/ ?
2 l: _; A5 I, P0 J* H; \1 D5 ?3、将正常节点上keyring和导出的monmap文件传送到其他两个节点上:8 h f! G9 A. c; a! u& G
9 }; f% @+ E7 e6 D6 U! A
scp /var/lib/ceph/mon/ceph-gm268-3/keyring gm268-2:/tmp/( s% X: F) N2 Q3 |5 H9 Q3 Y0 S3 z
scp /var/lib/ceph/mon/ceph-gm268-3/keyring gm268-1:/tmp/9 w7 p3 E2 h* C! B2 z3 a
; S) h. P8 \' X- O" w% nscp /tmp/monmap gm268-1:/tmp/
" B( G5 m2 ]2 ] Y/ c& rscp /tmp/monmap gm268-1:/tmp/
8 d. f4 G1 t" D( u% [* O" `1 Q; K) E# U0 \7 l
" ]( P) N d0 L1 ]& G0 U7 f6 M
4、重做gm268-1和gm268-2 节点mon . d' u3 y. r5 Y: E; V1 q' N
ceph-mon --cluster ceph -i gm268-1 --mkfs --monmap /tmp/monmap --keyring /tmp/keyring -c /etc/ceph/ceph.conf
2 t7 S8 ~% X7 C; X4 Q. f- N: F a6 @+ Q2 f
切换到/var/lib/ceph/mon目录下
. j9 O# K8 o9 l% H/ A2 m执行:% g( m( J7 D# D- o1 B
chown -R ceph:ceph mon/6 Q: r8 Y! b/ r# d. e
) J7 T6 U* l9 I2 y启动mon服务:
9 G" h$ S3 R& j+ lsystemctl start ceph-mon@gm268-1.service
/ M+ ~. d7 w" {; e
' y/ T3 H/ V( ~$ D% y$ O8 g查看服务:1 I& z) L: _% W; r3 `! |
3 `1 E2 ` i7 [9 x7 \- [
$ systemctl status ceph-mon@gm268-1.service " ?( O0 L) ~! B9 Y: Y" O% p
● ceph-mon@gm268-1.service - Ceph cluster monitor daemon4 B$ t9 g- l1 t1 v6 l- K
Loaded: loaded (/usr/lib/systemd/system/ceph-mon@.service; enabled; vendor preset: disabled)$ I/ S4 Z: ~% T1 c$ \+ F" x& X. @6 ^# H
Active: active (running) since Fri 2024-10-18 13:21:24 CST; 38min ago
! L2 q2 l! r' D8 v3 x Main PID: 664542 (ceph-mon)6 T; a9 m9 W5 C+ `. c, `
Tasks: 276 w5 k# _, R2 y. H$ c* R9 P! Y
Memory: 286.0M5 D |+ q7 F c; r6 n
CGroup: /system.slice/system-ceph\x2dmon.slice/ceph-mon@gm268-1.service1 ~+ ^7 s+ p6 c- t
└─664542 /usr/bin/ceph-mon -f --cluster ceph --id gm268-1 --setuser ceph --setgroup ceph" p7 h# A/ e8 j5 s& W$ @0 x
4 {) j0 V8 u8 V& o1 qOct 18 13:21:24 gm268-1 systemd[1]: Started Ceph cluster monitor daemon.% i6 }6 R% @% n, `) G9 ~
Oct 18 13:21:24 gm268-1 ceph-mon[664542]: 2024-10-18T13:21:24.793+0800 7fcc5f804700 -1 mon.gm268-1@0(probing) e11 stashing newest monmap 11 for next startup4 ~0 b2 x4 C5 P* \3 F
Oct 18 13:21:24 gm268-1 ceph-mon[664542]: ignoring --setuser ceph since I am not root1 u* Q8 z5 y a0 d: H4 ]% j
Oct 18 13:21:24 gm268-1 ceph-mon[664542]: ignoring --setgroup ceph since I am not root
) o- P% r: t& \/ h2 t2 r% }! H6 ^
! V+ _+ S* O% i& ~; `3 y; A6 {( R, R, L- l7 ~1 M* W/ t
节点修复完成。$ e: k" d7 x# B |4 J
节点二上3 }, \# W% s! j( K3 r
" l: I+ P+ ^4 A
ceph-mon --cluster ceph -i gm268-2 --mkfs --monmap /tmp/monmap --keyring /tmp/keyring -c /etc/ceph/ceph.conf
" {1 D- d4 q) c8 R4 k6 Q7 X* Z( w! G4 H
切换到/var/lib/ceph/mon目录下
! t+ U) Z. z; K+ A1 ^6 h执行:6 W+ f* @* |% O, K
chown -R ceph:ceph mon/
) [, o9 P' T& a S0 G' e9 p4 C- j
5 l" R3 X, m6 R8 E8 p9 i b$ D2 M5 M. |启动mon服务:8 I$ J4 P" |& V+ P5 n
systemctl start ceph-mon@gm268-2.service
& |+ I+ [& c- `1 x1 d
( R5 y( y, R5 I0 A$ z
) \3 V6 ~! {- j4 M4 S. F4 V
' _& O9 n& X" G$ systemctl status ceph-mon@gm268-2.service
- g: A8 |/ C1 `: q9 |2 a) v● ceph-mon@gm268-2.service - Ceph cluster monitor daemon1 n6 v; p* k. S0 s2 T' Z, \
Loaded: loaded (/usr/lib/systemd/system/ceph-mon@.service; enabled; vendor preset: disabled)$ W% T5 U, ?* W( J! Q
Active: active (running) since Fri 2024-10-18 13:09:42 CST; 51min ago
6 D$ u7 Q' z1 P% @: k" { Main PID: 157382 (ceph-mon)
* f- @) M# C7 K i& N3 L Tasks: 27
8 b# U. |" N/ F% o' {6 _' }- h" g Memory: 587.1M
- F9 j* r/ o% H+ w3 Z CGroup: /system.slice/system-ceph\x2dmon.slice/ceph-mon@gm268-2.service5 E) e9 _7 c& F1 K
└─157382 /usr/bin/ceph-mon -f --cluster ceph --id gm268-2 --setuser ceph --setgroup ceph/ d# Z- E2 a: U; Q! L) H) L
$ ^# D: G. W0 R8 b7 ?4 `" S9 P/ U
1 o# g& m3 C8 D/ c- D
|
|