找回密码
 注册
查看: 635|回复: 2

ceph fs status报错解决 AttributeError: ‘NoneType’ object has no attribute ‘get’

[复制链接]

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
发表于 2022-8-2 13:27:42 | 显示全部楼层 |阅读模式
问题描述:
5 f3 T9 P% |! E% M$ B7 P5 }7 y[root@mon1 ~]# ceph fs status
9 v& q4 K: j: o! ?Error EINVAL: Traceback (most recent call last):
0 A4 N! i8 N( d0 X4 Y5 z. ZFile “/usr/lib64/ceph/mgr/status/module.py”, line 310, in handle_command
0 Q6 X0 Y2 g9 w( {- _# a! b) E4 Ereturn self.handle_fs_status(cmd)& ~( j. x, O. D$ }' J
File “/usr/lib64/ceph/mgr/status/module.py”, line 176, in handle_fs_status7 t0 Q5 R; ^6 ]
mds_versions[metadata.get(‘ceph_version’, “unknown”)].append(info[‘name’]): \3 M* h  N4 w5 G- V, H1 _+ j
AttributeError: ‘NoneType’ object has no attribute ‘get’( z7 h* A+ ]9 K3 y' \! C7 o& O
解决:
7 S. o9 R7 V$ ~5 b# z报错的原因可能176行是获取的 metadata 为空;
& p4 y# e: S& j; W[root@mon1 ~]# ceph mds metadata
* d% t! K8 H% H5 ]- B/ L* F[
2 E7 U- h" ^% q4 T* g% j{
; c8 j! L2 u& X5 @% z6 v“name”: “mon1”,
) {% C6 B6 S* G( j2 B, u“addr”: “12.10.1.24:6800/583896531”,; S3 z4 H: \5 M- @" j
“arch”: “x86_64”,1 k5 ?; Q# ?  r# j4 e9 n3 E7 L* c! b
“ceph_version”: “ceph version 12.2.5 (cad919881333ac92274171586c827e01f554a70a) luminous (stable)”,
8 k# M' N7 q$ ?1 Z" z  L“cpu”: “Intel® Xeon® CPU E5-2670 v2 @ 2.50GHz”,
  z1 U) _% M5 s2 {9 p# `6 ]“distro”: “centos”,
. `1 K/ s$ G% }- V( o& e, A1 ]“distro_description”: “Redhat Linux 7 (Core)”,( D& R6 n; e* w! n* M7 p9 ]! ~
“distro_version”: “7”,$ v+ }2 u; N6 e+ p$ k( a
“hostname”: “mon1”,0 [# p3 O8 K4 g3 ?0 C! a; b2 ?
“kernel_description”: “#1 SMP Thu Nov 30 09:35:20 EST 2017”,
7 e) m* ?: R- F4 U1 B  Z( P3 e“kernel_version”: “4.14.3-1.el7.elrepo.x86_64”,' c+ J6 E$ e" Q
“mem_swap_kb”: “20971516”," W4 I; v" e" o( `7 }; Y: U
“mem_total_kb”: “65923428”,
0 c- W" d/ J* V2 o. W* o# Z“os”: “Linux”
. N' `! t' P" Y},
4 ]* z# Y4 ~( y' S6 Y9 j{
0 C+ _" ~6 k1 c9 q8 K$ l"name": "mon2"
+ j& h* S: i+ o! v' n( g6 c- i6 D; A},
( Y) S# x/ k2 m{
! ~9 G4 f, R/ z3 g“name”: “mon3”,
4 M2 k1 I2 p+ z9 i+ x7 H“addr”: “12.10.1.25:6800/3412998711”,
6 A4 e6 s, z' c: A“arch”: “x86_64”,6 ^1 j8 k/ B0 _& l( F( |2 T" Y
“ceph_version”: “ceph version 12.2.5 (cad919881333ac92274171586c827e01f554a70a) luminous (stable)”,
4 K9 [! \% k% n4 P8 a“cpu”: “Intel® Xeon® CPU E5-2697 v4 @ 2.30GHz”,. `7 r7 w# v2 ]2 [* g, u7 X
“distro”: “centos”,
+ h5 {' z$ v( }  r* O“distro_description”: “Redhat Linux 7 (Core)”,, }6 m7 C6 x# o7 v8 t/ N
“distro_version”: “7”,$ R& y7 n6 a7 H9 \+ R
“hostname”: “mon3”," |' V) h# e" S3 ~. }
“kernel_description”: “#1 SMP Wed May 9 18:05:47 UTC 2018”,* t6 A  @( a5 y6 [
“kernel_version”: “3.10.0-862.2.3.el7.x86_64”,
3 Q% q) ^0 l  F3 z/ O“mem_swap_kb”: “20479996”,5 f* j4 L# o! N' g! J+ Q
“mem_total_kb”: “197975904”,
" T* F5 X9 B$ E, T9 y“os”: “Linux”
: W( S& q' l7 m7 G* b# \. E}
# ^( u% \7 p. @- ~]& m% b  i4 X* J0 g# y* f& g* o2 I1 B

# U. f) }. L' Z  b' y0 ~2 S查询发现有的mds只有name,8 q  G% X4 d& M

9 p6 f( g  D$ W8 h: V这个时候只要把有问题的mds服务重启下即可
( Z9 X+ P& h% b: F0 B! W( m  ysystemctl restart ceph-mds@mon2.service , Q% q' L, t0 T" t
6 R1 g1 N! e5 M7 F' D9 ]+ d
,问题解决。6 l! M/ E/ y( K0 E& s

  b! w9 W5 S) _' o& q1 i- `( ?- C# J) @' [8 ~+ X9 J/ u

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
 楼主| 发表于 2022-8-3 10:12:55 | 显示全部楼层
[root@ceph-2 ~]# ceph mds metadata 7 v; J* g! H! G1 p( m
[1 \2 |, j8 z0 U5 x  }$ t
    {4 H: W1 s  z' ^
        "name": "ceph-3",% Z. [/ p; z( g% X2 i- ^
        "addr": "192.168.120.31:6800/2291079756",
/ M7 l/ l0 S! @        "arch": "x86_64",: w, @8 q- n$ e) B$ s
        "ceph_version": "ceph version 12.2.8-52.el7 (3af3ca15b68572a357593c261f95038d02f46201) luminous (stable)",
$ O+ l8 v: ^1 ?        "cpu": "Intel Xeon E3-12xx v2 (Ivy Bridge, IBRS)",
" a$ h# I4 z+ H- A% L7 O1 d        "distro": "rhel",
5 S5 w2 Z, ~0 a* Z        "distro_description": "Red Hat Enterprise Linux Server 7.9 (Maipo)",
( g6 d: I$ |0 f        "distro_version": "7.9",
& b4 X$ F. A) K! N7 Q; G        "hostname": "ceph-3.novalocal",
. F! h' p# o" f  H) w2 l' q        "kernel_description": "#1 SMP Tue Aug 18 14:50:17 EDT 2020",  o" ~. P8 V) n
        "kernel_version": "3.10.0-1160.el7.x86_64",
' m, z& J! ]( O, l0 F        "mem_swap_kb": "0",
6 y: h: G, b! A/ ?% A        "mem_total_kb": "3880160",
( S# |" _0 J8 v  y        "os": "Linux"
, Q3 T( _& i3 w# s7 Y/ J8 Y    },, S8 t) |5 O# }% y# d# x
    {
3 k% d" `5 S, r, H9 M# _! W$ h        "name": "ceph-2",+ B6 _: h6 x  C
        "addr": "192.168.120.16:6800/463439748",. b( {) x& ^' S1 e* x
        "arch": "x86_64",
7 S( I/ l7 a& l- c        "ceph_version": "ceph version 12.2.8-52.el7 (3af3ca15b68572a357593c261f95038d02f46201) luminous (stable)",$ s2 r) B' m  x/ b* C
        "cpu": "Intel Xeon E3-12xx v2 (Ivy Bridge, IBRS)",$ q6 u9 F" R# N7 S+ @  J$ Z% s  E8 p
        "distro": "rhel",
# ?& b4 i7 F$ b4 T2 K8 f1 s% R        "distro_description": "Red Hat Enterprise Linux Server 7.9 (Maipo)",
  O2 I: F7 e& W        "distro_version": "7.9",
8 k9 A$ d4 h" |# V  i9 F  O        "hostname": "ceph-2.novalocal",
3 C3 f' d1 `& g        "kernel_description": "#1 SMP Tue Aug 18 14:50:17 EDT 2020",; K& ]1 h% X4 }/ h
        "kernel_version": "3.10.0-1160.el7.x86_64",. o, v2 `2 v+ \( T/ i( v
        "mem_swap_kb": "0",8 L! \0 s8 ]+ @: z  H
        "mem_total_kb": "3880160",) K4 [. m0 f" T* a2 C) v5 q
        "os": "Linux"
- b. l: {& g, Y7 r    },
* B' ~" p/ }0 ^& V* f    {
, r4 J! N2 l  z9 Z3 j; [        "name": "ceph-1",
  m6 N% s4 c0 N# ^+ r* W        "addr": "192.168.120.23:6800/4181220800",$ j: I4 b: g! H( x/ T
        "arch": "x86_64",
8 r- N' g; c& M6 |* q) }        "ceph_version": "ceph version 12.2.8-52.el7 (3af3ca15b68572a357593c261f95038d02f46201) luminous (stable)",2 m/ X' @; i& p1 C
        "cpu": "Intel Xeon E3-12xx v2 (Ivy Bridge, IBRS)",  Q* B7 y' P6 E" d" X# c, D7 h
        "distro": "rhel",
! w1 @( }" T+ W2 i  b        "distro_description": "Red Hat Enterprise Linux Server 7.9 (Maipo)",
: u. b1 I9 ]2 H        "distro_version": "7.9",1 T" L0 s5 v/ J' K+ R
        "hostname": "ceph-1.novalocal",4 @8 y8 `7 l- G/ t
        "kernel_description": "#1 SMP Tue Aug 18 14:50:17 EDT 2020",
/ \( P' a+ c9 @8 E8 ]( w0 u% ]7 `        "kernel_version": "3.10.0-1160.el7.x86_64",
: [" L: j/ W( I  l3 z        "mem_swap_kb": "0",
7 t$ C7 E% W' W% G- f        "mem_total_kb": "3880168",( q6 \5 b* n, h" L; S
        "os": "Linux"* p) D" U! F- P0 H8 e
    }# t4 X. K- ?6 @. ?4 y( F9 s* b' g- l
]

1

主题

0

回帖

12

积分

管理员

积分
12
QQ
 楼主| 发表于 2022-8-3 10:14:30 | 显示全部楼层
ceph集群提供元数据服务,则客户端可以提挂载ceph文件系统。客户端访问数据时,则在客户端和元数据服务器中都缓存相应的数据。元数据服务器会和客户端inode占用情况来消减缓存。当客户端响应太慢,则会报错“failing to respond to cache pressure” or MDS_HEALTH_CLIENT_RECALL。若确实是客户端负荷较大,是正常读写操作,可以考虑增大mds_recall_warning_decay_rate参数的值(默认为60s),从而消除警告。( T4 T$ J; `: m. i$ C( z5 P
4 R+ }/ G7 A) |$ R! Q1 d
可以查询ceph客户端的ID号及其使用inode数(num_caps的值)。  }3 f% p! c4 D3 l$ {  M
* S$ E1 p. r* Z& |2 D' [
ceph tell mds.0 session ls
8 Q4 I7 d+ j# A# V6 y/ x. v8 M谨慎使用如下命令踢出目标客户端或全部客户端。
" \& `8 n  C" |. m$ l+ Y2 g* S8 H4 p
ceph tell mds.0 session evict id=11134635: ~  e2 X* z$ z1 j. A6 F
ceph tell mds.0 session evict
3 _( j0 i: I" l/ m6 h$ {踢出客户端是将客户端加入了黑名单,可以使用如下命令查看黑名单信息或移出黑名单。虽然移出黑名单,可能还不能让客户端正常挂载ceph文件系统,因此需要谨慎处理。
, R8 i  f' G' e5 [! p" q2 x- }3 ^2 Z8 C9 s5 g  R% U  F
ceph osd blacklist ls
9 _$ h3 {1 a; o8 @* j9 gceph osd blacklist rm 192.168.20.1:0/1498586492
& `6 {8 U  l5 T) ?3 q7 B5 \ceph osd blacklist clear
# }+ @& T) E/ H) Z) p* F% g6. Reduced data availability: 4 pgs inactive, 4 pgs incomplete" g" T  L- k1 b7 w7 j
当有pgs出现incomplete时,表明pgs对应的OSDs存活数量少于最小副本数。因此,其对应的数据无法读写,处于reduced状态,会导致MDS服务出问题,提示如下报错信息,示例:
- \* V/ k+ Y# K) a3 d9 N$ c, C) Z/ h  H
3 MDSs report slow metadata IOs9 o6 _0 N* a1 G; l" d
2 MDSs report slow requests
+ p% C) G; P& J! O; C2 MDSs behind on trimming- ]5 h6 v" d" \
Reduced data availability: 4 pgs inactive, 4 pgs incomplete
& P8 f7 V' [, j6 P0 I) b$ `+ S( T( e7 o5 d' T
pg 5.6de is incomplete, acting [254,356,222,352,111,247,100,133,351,206] (reducing pool cephfs_data min_size from 8 may help; search ceph.com/docs for 'incomplete')
/ N/ g0 p& K, |0 w  gpg 5.6e9 is incomplete, acting [276,244,357,358,221,321,311,229,314,351] (reducing pool cephfs_data min_size from 8 may help; search ceph.com/docs for 'incomplete')
7 L6 w& y  i* _4 n7 S. S& lpg 5.73b is incomplete, acting [186,279,351,247,293,354,359,220,181,283] (reducing pool cephfs_data min_size from 8 may help; search ceph.com/docs for 'incomplete')' s9 V* q+ e* _/ F2 t
pg 5.eda is incomplete, acting [164,157,120,227,353,351,295,269,95,354] (reducing pool cephfs_data min_size from 8 may help; search ceph.com/docs for 'incomplete')
  K/ N: c+ C+ |此时,需要修复pgs。( r8 i' d- u: ~

% q; d% J+ G# z+ @5 P, Q0 [# 查询pg信息(pg id 为 5.6de)
( x: h6 c  k; Z- cceph pg 5.6de query! _/ B0 z- Z- f$ _3 \9 D7 `& Q
. L$ \) H/ C0 g0 }! h
# 强行重建pg1 U% |- x- j2 f& b/ P. b# Y
ceph osd force-create-pg 5.6de --yes-i-really-mean-it
5 ]: w5 A" k$ L
您需要登录后才可以回帖 登录 | 注册

本版积分规则

返回首页|Archiver|手机版|小黑屋|易陆发现技术论坛 ( 蜀ICP备2026014127号-1 )

GMT+8, 2026-6-12 00:10 , Processed in 0.017839 second(s), 22 queries .

Powered by Discuz! X5.0

© 2001-2026 Discuz! Team.

快速回复 返回顶部 返回列表