找回密码
 注册
查看: 6307|回复: 1

ceph删除mon节点

[复制链接]

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
发表于 2018-9-28 12:09:53 | 显示全部楼层 |阅读模式
删除mon节点
6 n, _4 R/ c. P9 h        在当前环境中,拥有3个mon节点,删除其中2个* z) O- y2 g/ [+ s6 {3 O  E, E
[root@ceph-osd-1 ceph-cluster]# ceph-deploy mon destroy ceph-osd-2 ceph-osd-3                                                                                                                                                           
7 P) A1 W6 f: U' S! A[ceph_deploy.conf][DEBUG ] found configuration file at: /root/.cephdeploy.conf8 n5 l, `$ b8 R% p
[ceph_deploy.cli][INFO  ] Invoked (1.5.19): /usr/bin/ceph-deploy mon destroy ceph-osd-2 ceph-osd-3  ?4 u2 x. n% T. r) }2 k+ Z
[ceph_deploy.mon][DEBUG ] Removing mon from ceph-osd-2
, s  v# _% J$ d7 j# U[ceph-osd-2][DEBUG ] connected to host: ceph-osd-2
4 U; g" N9 c7 ?. g& G% M[ceph-osd-2][DEBUG ] detect platform information from remote host+ R3 a/ F  M) D% ^) u
[ceph-osd-2][DEBUG ] detect machine type$ F5 e. n& T7 @* g; h  @" w+ Y
[ceph-osd-2][DEBUG ] get remote short hostname7 ]& }+ E1 v, W# i' U
[ceph-osd-2][INFO  ] Running command: ceph --cluster=ceph -n mon. -k /var/lib/ceph/mon/ceph-ceph-osd-2/keyring mon remove ceph-osd-2
  o$ z0 [3 s- _  k" s6 C1 [3 _7 X[ceph-osd-2][WARNIN] removed mon.ceph-osd-2 at 10.10.200.164:6789/0, there are now 1 monitors  S# z% C- i2 I8 X6 a- _5 E/ d
[ceph-osd-2][INFO  ] polling the daemon to verify it stopped
* ~' _6 O9 H% [, s! L0 o; z8 I8 X[ceph-osd-2][INFO  ] Running command: service ceph status mon.ceph-osd-2
$ [' N# T! p9 n) O; J0 ]7 r! A[ceph-osd-2][INFO  ] Running command: mkdir -p /var/lib/ceph/mon-removed
$ J$ M0 f8 y( K& P7 |6 i. Y$ h[ceph-osd-2][DEBUG ] move old monitor data: ~4 n1 }+ w$ V* d2 C
[ceph_deploy.mon][DEBUG ] Removing mon from ceph-osd-3% N; }8 b7 ]6 S2 u
[ceph-osd-3][DEBUG ] connected to host: ceph-osd-3 ' t& `3 m- y' |" G6 }/ n. ^0 @
[ceph-osd-3][DEBUG ] detect platform information from remote host0 ], B, c( h6 k* ?, t, s, a
[ceph-osd-3][DEBUG ] detect machine type: w5 v) b6 l8 A  \
[ceph-osd-3][DEBUG ] get remote short hostname
1 q- b5 U- ]+ M& j1 L$ h( a
" M+ W  _7 k' w6 T3 n     查看删除2个mon节点后,ceph cluster的状态5 q( y5 [6 R7 Z$ f+ Y/ U) i- S: O; `
[root@ceph-osd-1 ceph-cluster]# ceph -s& l# b% v1 v; n2 Y' |4 c( K, F% U
    cluster 9d717e10-a708-482d-b91c-4bd21f4ae36c
; t+ x1 e' w9 c' I' p( C* D     health HEALTH_OK5 e+ {  T4 Y6 K3 K/ @
     monmap e5: 1 mons at {ceph-osd-1=10.10.200.163:6789/0}, election epoch 69, quorum 0 ceph-osd-1
- T1 @  J# ]6 u* u3 ^4 v     osdmap e220: 7 osds: 7 up, 7 in# ^2 i9 O6 I! F# V! v" H5 S
      pgmap v473: 256 pgs, 1 pools, 0 bytes data, 0 objects
7 f! i5 `5 |2 W- _            36109 MB used, 11870 GB / 11905 GB avail6 F% }* `. x) G
                 256 active+clean4 l2 {$ e) D0 g% B
6 j- N- R2 A6 b3 r6 e

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
 楼主| 发表于 2018-9-28 12:35:27 | 显示全部楼层
添加mon节点$ }& M( q1 W- [/ Y$ y/ ]& p
      ceph monitor的个数是2n+1(n>=0)个,在线上至少3个,只要正常的节点数>=n+1,ceph的paxos算法能保证系统的正常运行。所以,对于3个节点,同时只能挂掉一个。
+ K. C1 t: u, D      当前ceph cluster中仅含有1个mon节点,将其扩展至3个mon节点。
$ D, Y( K; J( p& ]6 O+ p        查看当前ceph cluster状态0 e9 t% `, E  c/ _
    [root@ceph-osd-1 ceph-cluster]# ceph -s  @) m% P, c% u3 N' ?
    cluster 9d717e10-a708-482d-b91c-4bd21f4ae36c
9 v. K' G1 e8 v! J9 ]     health HEALTH_OK. C9 Z3 S' z$ C$ f0 s+ f
     monmap e5: 1 mons at {ceph-osd-1=10.10.200.163:6789/0}, election epoch 69, quorum 0 ceph-osd-1
2 l7 m+ |& t% }     osdmap e220: 7 osds: 7 up, 7 in
8 Z: {. x3 X. b! N      pgmap v473: 256 pgs, 1 pools, 0 bytes data, 0 objects
2 B$ b/ }3 L6 G8 Z) O            36109 MB used, 11870 GB / 11905 GB avail
2 v6 L& h' N+ v& t" M: t8 u                 256 active+clean
/ L2 ]  H! s, }+ \1 m " y5 }! ?  r+ k# |0 f; \1 V
         此时要向ceph cluster中添加两个节点分别为ceph-osd-2,ceph-osd-3
9 k5 \( l% S/ {; J$ m( P         首先修改配置文件如下,添加public_network0 B( z) E0 k% s
[global]/ O# G4 Z5 q( R0 W: D+ P6 x" Y
auth_service_required = cephx9 q1 p- I/ W$ _9 L
filestore_xattr_use_omap = true8 c  [: T7 h9 U. R8 F9 `5 {
auth_client_required = cephx1 \2 `5 g1 T: D( i0 P
auth_cluster_required = cephx
' t1 D. D; G" X. a, Q" Q) h$ g- m4 Nmon_host = 10.10.200.163, 10.10.200.164# p) F/ y, G- c: b. s( }9 Z0 Q. |; q
mon_initial_members = ceph-osd-1, ceph-osd-2* f, K  ?' P4 w0 a  ~1 A
fsid = 9d717e10-a708-482d-b91c-4bd21f4ae36c* v! ]7 z4 ^1 ?" O! u  Y
public_network = 10.10.200.0/24& U" w1 u/ n5 B; y5 o2 W9 ?
0 C8 J9 W# v2 Y' V! b/ q: e
       添加mon节点
) r6 i' h  \9 x: O7 _7 ~" I. c[root@ceph-osd-1 ceph-cluster]# ceph-deploy mon create ceph-osd-2 ceph-osd-3</span>2 ]+ x$ m; w0 o% ]* J* h. M4 U, M
       2 e) Z9 y$ K7 V4 ?
       查看添加mon节点后,查看mon quorum状态信息
( w, [* e2 J5 M6 @[root@ceph-osd-1 ceph-cluster]# ceph quorum_status --format json-pretty  A! ^0 R6 [; p5 ~. e- G( ]

5 F9 D! ?) X( }& Q/ B# V6 D{ "election_epoch": 72,4 r3 [( u! Y1 V0 a  Y4 T
  "quorum": [  H( J0 a) p( u% z
        0,
; s7 c6 K  F& i# c* I$ D- g) I: j        1,
, w9 \9 @0 K  I9 M1 t( i- ^        2],
7 y! a) ~1 T9 k$ @6 z9 \6 B  "quorum_names": [
/ F5 |% Q% n! A- j" I+ j        "ceph-osd-1",2 }+ B  N2 s4 F$ l
        "ceph-osd-2",2 K- L& _9 {5 ^% b
        "ceph-osd-3"],
" f; Y3 V# K: d& J0 m# H% d  "quorum_leader_name": "ceph-osd-1",
; G; n4 Y' S$ h. `  "monmap": { "epoch": 7,
( K0 [- G: n* g: \$ e      "fsid": "9d717e10-a708-482d-b91c-4bd21f4ae36c",
2 X1 b6 c4 M  s5 N+ }5 C      "modified": "2014-11-14 09:10:28.111133",! F) ^, m$ b+ \; b+ q: N
      "created": "0.000000",
) s' b; R& ^% ~. K      "mons": [- B- F' T4 y8 i9 N
            { "rank": 0,8 Z5 G) ~5 j) h. I/ u5 z& a
              "name": "ceph-osd-1",: j: t/ W9 J. S7 J# Y. H
              "addr": "10.10.200.163:6789\/0"},2 i+ z0 f4 z7 ]9 R/ k
            { "rank": 1,' f0 d+ f3 @& x  V; Q
              "name": "ceph-osd-2",4 D& L/ e. V# v6 F
              "addr": "10.10.200.164:6789\/0"},3 o. M5 d' Q# w# h  @! C
            { "rank": 2,/ u) c+ C' C* I, J
              "name": "ceph-osd-3",0 ^) q# n/ C% Q& ^+ g' r0 t
              "addr": "10.10.200.165:6789\/0"}]}}
  ^" M3 U6 u/ \% M# s7 B# Y9 R( ^, B$ o- `
        查看此时ceph cluster状态7 i& Q% S" i1 N
[root@ceph-osd-1 ceph-cluster]# ceph -s
+ b; t0 ]1 J5 B5 c7 l3 {% P    cluster 9d717e10-a708-482d-b91c-4bd21f4ae36c& y; }9 d1 m1 p9 _) v
     health HEALTH_WARN clock skew detected on mon.ceph-osd-3
3 D6 H0 z5 C5 ~" b2 `  e     monmap e7: 3 mons at {ceph-osd-1=10.10.200.163:6789/0,ceph-osd-2=10.10.200.164:6789/0,ceph-osd-3=10.10.200.165:6789/0}, election epoch 72, quorum 0,1,2 ceph-osd-1,ceph-osd-2,ceph-osd-3# e' c4 ]  E) N2 {
     osdmap e220: 7 osds: 7 up, 7 in
0 z" h# q( a/ {4 D' A      pgmap v475: 256 pgs, 1 pools, 0 bytes data, 0 objects
5 R, D% K* X9 B, ^6 {& `) H            36109 MB used, 11870 GB / 11905 GB avail9 `8 D# d/ X$ d! ?2 Y2 T
                 256 active+clean2 Y+ ~$ X8 i* Q# f
+ H6 w) L  @( ^! M! n
      可以发现mon.ceph-osd-3节点的时间与mon.ceph-osd-1的时间不同步,同步各mon节点的时间。
2 G& k) L9 l. `: U0 c      此时ceph mon节点已经添加完毕,模拟ceph-osd-1 mon节点故障,查看ceph cluster能否正常工作,查看此时ceph cluster信息
/ V2 y1 ?& F) R( v$ L* y[root@ceph-osd-2 ~]# ceph -s
5 g: e8 E/ g% {2 Y2014-11-14 09:27:28.582467 7f9cd8712700  0 -- :/1014338 >> 10.10.200.163:6789/0 pipe(0x7f9cd4024230 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f9cd40244c0).fault  x& e0 @7 i: l' C
    cluster 9d717e10-a708-482d-b91c-4bd21f4ae36c
/ E6 G" `; M" v0 X$ j) F     health HEALTH_WARN 256 pgs degraded; 256 pgs stuck unclean; 256 pgs undersized; 1/7 in osds are down; 1 mons down, quorum 1,2 ceph-osd-2,ceph-osd-3" m. W8 f0 }. N: p: y: ]' I, _
     monmap e7: 3 mons at {ceph-osd-1=10.10.200.163:6789/0,ceph-osd-2=10.10.200.164:6789/0,ceph-osd-3=10.10.200.165:6789/0}, election epoch 88, quorum 1,2 ceph-osd-2,ceph-osd-32 M. S/ m- L. w+ j3 A/ w! q
     osdmap e263: 7 osds: 6 up, 7 in
% q" |* A" r& C/ z7 z% ^      pgmap v542: 256 pgs, 1 pools, 0 bytes data, 0 objects/ i8 @$ x' U2 k/ `: w- B# ]8 \
            36112 MB used, 11870 GB / 11905 GB avail
+ X3 u) ~! o8 w0 I7 g/ x                 256 active+undersized+degraded  S6 G1 W  Q6 Z, L' \

/ b6 p9 i5 Q( }$ [! L2 Y. ]4 p& p9 y* U      因为ceph-osd-1节点上面拥有1个mon节点以及1个osd节点,所以在osd cluster中,有个osd也处于down状态。
% E/ k: p: o9 ~( v8 K      本文的开头部分讲过,ceph mon规定在3个节点的状态下,只允许1个mon节点down,那么2个mon节点down会怎么样,继续down掉ceph-osd-2节点
- T; {+ ]$ d, Q. z' ]4 O      通过ceph -s查看此时ceph cluster状态信息5 c3 J; u; S* z4 K  @& c# ^
[root@ceph-osd-3 ~]# ceph -s
0 u( d5 H& `! o9 V* e# Y; o2014-11-14 09:30:23.483264 7f677c28b700  0 -- :/1014680 >> 10.10.200.163:6789/0 pipe(0x7f6778023290 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f6778023520).fault
3 l% m& ^- C7 J4 [+ o5 M2014-11-14 09:30:26.483313 7f677c18a700  0 -- :/1014680 >> 10.10.200.164:6789/0 pipe(0x7f676c000c00 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f676c000e90).fault$ W7 a5 j6 t4 K
2014-11-14 09:30:29.483664 7f677c28b700  0 -- :/1014680 >> 10.10.200.163:6789/0 pipe(0x7f676c0030e0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f676c003370).fault
3 [4 S* X1 X+ p( s. U' f2014-11-14 09:30:32.483904 7f677c18a700  0 -- :/1014680 >> 10.10.200.164:6789/0 pipe(0x7f676c003a00 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f676c003c90).fault0 {7 V$ h7 }) N$ Z5 Q
2014-11-14 09:30:35.484221 7f677c28b700  0 -- :/1014680 >> 10.10.200.163:6789/0 pipe(0x7f676c0031b0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f676c002570).fault
& k( J6 f% I4 X! ^) C5 O1 ]2014-11-14 09:30:38.484476 7f677c18a700  0 -- :/1014680 >> 10.10.200.164:6789/0 pipe(0x7f676c002a60 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f676c002cf0).fault
1 w) d7 p. V/ A$ r: c5 ~' s3 S7 l. T6 Y; j6 u( a
     通过以上信息,ceph cluster已经无法正常工作。所以在3节点的mon cluster中,仅允许1个mon节点down掉。
6 ~+ k/ A7 G  r+ h0 D, E6 C, y- \* F9 b
您需要登录后才可以回帖 登录 | 注册

本版积分规则

返回首页|Archiver|手机版|小黑屋|易陆发现技术论坛 ( 蜀ICP备2026014127号-1 )

GMT+8, 2026-6-12 01:01 , Processed in 0.016015 second(s), 22 queries .

Powered by Discuz! X5.0

© 2001-2026 Discuz! Team.

快速回复 返回顶部 返回列表