Blame gfs2/man/glocktop.8

Packit 6ef888
.TH glocktop 8
Packit 6ef888
Packit 6ef888
.SH NAME
Packit 6ef888
glocktop - Display or print active GFS2 locks.
Packit 6ef888
Packit 6ef888
.SH SYNOPSIS
Packit 6ef888
.B glocktop
Packit 6ef888
[\fIOPTIONS\fR]
Packit 6ef888
Packit 6ef888
.SH DESCRIPTION
Packit 6ef888
The glocktop tool is used to display active GFS2 inter-node locks,
Packit 6ef888
also known as glocks. Simply put, it's a tool to filter and interpret the
Packit 6ef888
contents of the glocks debugfs file. The glocks debugfs file shows
Packit 6ef888
all glocks known to GFS2, their holders, and technical data such as flags.
Packit 6ef888
The glocktop tool will only show the glocks that are important: glocks that
Packit 6ef888
are being held or for which there are waiters. It also interprets the debugfs
Packit 6ef888
file of DLM (Distributed Lock Manager).
Packit 6ef888
Packit 6ef888
.SH OPTIONS
Packit 6ef888
.TP
Packit 6ef888
\fB-d\fP \fI<delay>\fP
Packit 6ef888
Specify a time delay (in seconds) between reports. (Default is 30 seconds)
Packit 6ef888
.TP
Packit 6ef888
\fB-h\fP
Packit 6ef888
Print help information.
Packit 6ef888
.TP
Packit 6ef888
\fB-i\fP
Packit 6ef888
Interactive mode. In this mode, glocktop acts more like the top command.
Packit 6ef888
It shows the pertinent glocks on the terminal session (as many as it can
Packit 6ef888
fit). The advantage is that it uses different colors to draw attention to
Packit 6ef888
what's important. The disadvantage is that it's limited by the size of
Packit 6ef888
your display, so you may not see all the glocks.
Packit 6ef888
.TP
Packit 6ef888
\fB-D\fP
Packit 6ef888
Omit DLM status. This may be used to reduce the amount of output for
Packit 6ef888
interactive mode.
Packit 6ef888
.TP
Packit 6ef888
\fB-n\fP \fI<iterations>\fP
Packit 6ef888
End the program after the specified number of iterations (reports). The
Packit 6ef888
default is to keep running until interrupted.
Packit 6ef888
.TP
Packit 6ef888
\fB-r\fP
Packit 6ef888
Show resource group reservation information. Normally, glocktop omits
Packit 6ef888
resource group reservation information to condense the output. This
Packit 6ef888
information is only important when debugging information related to the
Packit 6ef888
GFS2 block allocator and file system fragmentation.
Packit 6ef888
.TP
Packit 6ef888
\fB-s\fP \fI<freq>\fR
Packit 6ef888
Print glock summary information every \fI<freq>\fR reports.
Packit 6ef888
The glock summary information is bulky and often not needed, so it's
Packit 6ef888
only printed once every 10 reports. You can eliminate it entirely from
Packit 6ef888
the output by specifying a value of 0. If you want the statistics to
Packit 6ef888
print after every report, specify freq as 1.
Packit 6ef888
.TP
Packit 6ef888
\fB-t\fP
Packit 6ef888
Trace directory path. A lot of GFS2 glock performance problems are caused
Packit 6ef888
by an application's contention for one or two directories. These show up
Packit 6ef888
as regular inodes in the output, but there's no good way to tell from the
Packit 6ef888
output which directory is contended. Ordinarily, glocktop won't try to
Packit 6ef888
look up the full pathname of a contended directory because it's slow,
Packit 6ef888
especially if there are millions of glocks. This option instructs glocktop
Packit 6ef888
to try to determine the full directory path names when it can, so you can
Packit 6ef888
tell the full path (within the mount point) of contended directories.
Packit 6ef888
.TP
Packit 6ef888
\fB-H\fP
Packit 6ef888
Don't show Held glocks, unless there are also waiters for the lock.
Packit 6ef888
Ordinarily, glocktop will show glocks that are held (but not iopen
Packit 6ef888
glocks which are almost always held by the thousands) as well as glocks
Packit 6ef888
for which there are waiters. If it only showed glocks with waiters, you
Packit 6ef888
could see, for example, that a glock is being blocked on one node,
Packit 6ef888
but you couldn't see the information for a different node currently
Packit 6ef888
holding the lock and thus, blocking the waiter. This option forces glocktop to
Packit 6ef888
stop printing information for glocks with no waiters (on that node).
Packit 6ef888
The advantage is that the output is smaller and easier to look at.
Packit 6ef888
The disadvantage is that you can't see information from the node that's
Packit 6ef888
blocking the waiter, unless both waiter and holder are on the same node.
Packit 6ef888
.SH OUTPUT LINES
Packit 6ef888
.TP
Packit 6ef888
\fB@ name\fP
Packit 6ef888
This is the GFS2 file system name for which the information is printed. It
Packit 6ef888
also gives the time stamp of the report, and the cluster node name.
Packit 6ef888
.TP
Packit 6ef888
\fBG:\fP
Packit 6ef888
This line represents a glock (internode GFS2 lock).
Packit 6ef888
 G:  s:UN n:2/609b4 f:lIqob t:EX d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
.TP
Packit 6ef888
\fBD:\fP
Packit 6ef888
This line gives you glocktop's interpretation of the glock's state as
Packit 6ef888
far as DLM (distributed lock manager) is concerned.
Packit 6ef888
  D: Granted PR on node 2 to pid 17511 [python]
Packit 6ef888
.TP
Packit 6ef888
\fBH:\fP
Packit 6ef888
This line represents a glock holder: a process that's either holding the
Packit 6ef888
glock, or is waiting to hold it. The value after S: represents how this
Packit 6ef888
holder needs the lock: EX (Exclusive), SH (Shared), PR (Protected Read),
Packit 6ef888
or UN (Unlocked). The value after F: indicates the holder flags: a W
Packit 6ef888
indicates the holder is Waiting for the lock to be granted. An H indicates
Packit 6ef888
the holder is currently holding the lock.
Packit 6ef888
  H: s:EX f:W e:0 p:17511 [python] gfs2_unlink+0x7e/0x250 [gfs2]
Packit 6ef888
.TP
Packit 6ef888
\fBU:\fP
Packit 6ef888
These lines represent glocktop's user interpretation of the data, both glock
Packit 6ef888
and holder. Lines that begin with (N/A:...) can probably be ignored because
Packit 6ef888
they ought to be unimportant: system files such as journals, etc.
Packit 6ef888
  U:  W inode      183f5     Is:Shared, Want:Exclusive   [Demote pending, Reply pending, Queued, Blocking]
Packit 6ef888
  U:  W  --->  waiting pid 17511 [python]  (Granted PR on node 2 to pid 17511 [python])
Packit 6ef888
.TP
Packit 6ef888
\fBC:\fP
Packit 6ef888
These lines give you the call trace (call stack) of the process that's
Packit 6ef888
either holding or waiting to hold the glock.
Packit 6ef888
.TP
Packit 6ef888
\fBS\fP
Packit 6ef888
These lines give you the summary of all glocks for this file system: How many of
Packit 6ef888
each category are unlocked, locked, how many are held in EX, SH, and DF, and how
Packit 6ef888
many are waiting. G Waiting is how many glocks have waiters. P Waiting is
Packit 6ef888
how many processes are waiting. Thus, you could have one glock that's got
Packit 6ef888
ten processes waiting, or ten glocks that have ten processes waiting.
Packit 6ef888
.SH EXAMPLE OUTPUT
Packit 6ef888
.nf
Packit 6ef888
.RS
Packit 6ef888
# glocktop
Packit 6ef888
.PP
Packit 6ef888
@ nate_bob1       Wed Jan 27 07:24:14 2016  @host-050
Packit 6ef888
 G:  s:EX n:9/1 f:Iqb t:EX d:EX/0 a:0 v:0 r:2 m:200 (journal)
Packit 6ef888
  D: Granted EX on node 2 to pid 17468 [ended]
Packit 6ef888
  H: s:EX f:eH e:0 p:17468 [(ended)] gfs2_glock_nq_num+0x5b/0xa0 [gfs2]
Packit 6ef888
  U: (N/A:Journl)  H journal    1         Held:Exclusive   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Journl)  H  --->  held by pid 17468 [(ended)]  (Granted EX on node 2 to pid 17468 [ended])
Packit 6ef888
 G:  s:SH n:1/1 f:Iqb t:SH d:EX/0 a:0 v:0 r:2 m:200 (non-disk)
Packit 6ef888
  D: Granted PR on node 2 to pid 17468 [ended]
Packit 6ef888
  H: s:SH f:eEH e:0 p:17468 [(ended)] gfs2_glock_nq_num+0x5b/0xa0 [gfs2]
Packit 6ef888
  U: (N/A:Not EX)  H non-disk   1         Held:Shared   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Not EX)  H  --->  held by pid 17468 [(ended)]  (Granted PR on node 2 to pid 17468 [ended])
Packit 6ef888
 G:  s:EX n:2/181ec f:yIqob t:EX d:EX/0 a:1 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted EX on this node to pid 17468 [ended]
Packit 6ef888
  H: s:EX f:H e:0 p:17468 [(ended)] init_per_node+0x17d/0x280 [gfs2]
Packit 6ef888
  I: n:12/98796 t:8 f:0x00 d:0x00000201 s:24
Packit 6ef888
  U: (N/A:System)  H inode      181ec     Held:Exclusive   [Dirty, Queued, Blocking]
Packit 6ef888
  U: (N/A:System)  H  --->  held by pid 17468 [(ended)]  (Granted EX on this node to pid 17468 [ended])
Packit 6ef888
 G:  s:EX n:2/181ed f:Iqob t:EX d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted EX on this node to pid 17468 [ended]
Packit 6ef888
  H: s:EX f:H e:0 p:17468 [(ended)] init_per_node+0x1b0/0x280 [gfs2]
Packit 6ef888
  I: n:13/98797 t:8 f:0x00 d:0x00000200 s:1048576
Packit 6ef888
  U: (N/A:System)  H inode      181ed     Held:Exclusive   [Queued, Blocking]
Packit 6ef888
  U: (N/A:System)  H  --->  held by pid 17468 [(ended)]  (Granted EX on this node to pid 17468 [ended])
Packit 6ef888
 G:  s:SH n:2/183f5 f:ldrIqob t:EX d:UN/0 a:0 v:0 r:5 m:10 (inode)
Packit 6ef888
  D: Granted PR on node 2 to pid 17511 [python]
Packit 6ef888
  H: s:EX f:W e:0 p:17511 [python] gfs2_unlink+0x7e/0x250 [gfs2]
Packit 6ef888
  I: n:1/99317 t:4 f:0x00 d:0x00000003 s:2048
Packit 6ef888
  U:  W inode      183f5     Is:Shared, Want:Exclusive   [Demote pending, Reply pending, Queued, Blocking]
Packit 6ef888
  U:  W  --->  waiting pid 17511 [python]  (Granted PR on node 2 to pid 17511 [python])
Packit 6ef888
  C:              gfs2_unlink+0xdc/0x250 [gfs2]
Packit 6ef888
  C:              vfs_unlink+0xa0/0xf0
Packit 6ef888
  C:              do_unlinkat+0x163/0x260
Packit 6ef888
  C:              sys_unlink+0x16/0x20
Packit 6ef888
 G:  s:SH n:2/805b f:Iqob t:SH d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted PR on node 2 to pid 17468 [ended]
Packit 6ef888
  H: s:SH f:eEcH e:0 p:17468 [(ended)] init_journal+0x185/0x500 [gfs2]
Packit 6ef888
  I: n:5/32859 t:8 f:0x01 d:0x00000200 s:134217728
Packit 6ef888
  U: (N/A:Not EX)  H inode      805b      Held:Shared   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Not EX)  H  --->  held by pid 17468 [(ended)]  (Granted PR on node 2 to pid 17468 [ended])
Packit 6ef888
S    glocks  nondisk    inode    rgrp   iopen   flock quota jrnl    Total
Packit 6ef888
S  --------- ------- -------- ------- ------- ------- ----- ---- --------
Packit 6ef888
S  Unlocked:       1        5       4       0       0     0    0       10
Packit 6ef888
S    Locked:       2      245       6      58       0     0    1      313
Packit 6ef888
S     Total:       3      250      10      58       0     0    1      323
Packit 6ef888
S
Packit 6ef888
S   Held EX:       0        2       0       0       0     0    1        3
Packit 6ef888
S   Held SH:       1        1       0      57       0     0    0       59
Packit 6ef888
S   Held DF:       0        0       0       0       0     0    0        0
Packit 6ef888
S G Waiting:       0        1       0       0       0     0    0        1
Packit 6ef888
S P Waiting:       0        1       0       0       0     0    0        1
Packit 6ef888
S  DLM wait:       0
Packit 6ef888
Packit 6ef888
@ nate_bob0       Wed Jan 27 07:24:14 2016  @host-050
Packit 6ef888
 G:  s:EX n:2/180e9 f:yIqob t:EX d:EX/0 a:1 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted EX on this node to pid 17465 [ended]
Packit 6ef888
  H: s:EX f:H e:0 p:17465 [(ended)] init_per_node+0x17d/0x280 [gfs2]
Packit 6ef888
  I: n:9/98537 t:8 f:0x00 d:0x00000201 s:24
Packit 6ef888
  U: (N/A:System)  H inode      180e9     Held:Exclusive   [Dirty, Queued, Blocking]
Packit 6ef888
  U: (N/A:System)  H  --->  held by pid 17465 [(ended)]  (Granted EX on this node to pid 17465 [ended])
Packit 6ef888
 G:  s:UN n:2/609b4 f:lIqob t:EX d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted EX on node 2 to pid 14367 [ended]
Packit 6ef888
  H: s:EX f:W e:0 p:16297 [delete_workqueu] gfs2_delete_inode+0x9d/0x450 [gfs2]
Packit 6ef888
  U:  W inode      609b4     Is:Unlocked, Want:Exclusive   [Queued, Blocking]
Packit 6ef888
  U:  W  --->  waiting pid 16297 [delete_workqueu]  (Granted EX on node 2 to pid 14367 [ended])
Packit 6ef888
  C:              gfs2_delete_inode+0xa5/0x450 [gfs2]
Packit 6ef888
  C:              generic_delete_inode+0xde/0x1d0
Packit 6ef888
  C:              generic_drop_inode+0x65/0x80
Packit 6ef888
  C:              gfs2_drop_inode+0x37/0x40 [gfs2]
Packit 6ef888
 G:  s:SH n:2/19 f:Iqob t:SH d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted PR on this node to pid 17465 [ended]
Packit 6ef888
  H: s:SH f:eEcH e:0 p:17465 [(ended)] init_journal+0x185/0x500 [gfs2]
Packit 6ef888
  I: n:4/25 t:8 f:0x01 d:0x00000200 s:134217728
Packit 6ef888
  U: (N/A:Not EX)  H inode      19        Held:Shared   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Not EX)  H  --->  held by pid 17465 [(ended)]  (Granted PR on this node to pid 17465 [ended])
Packit 6ef888
 G:  s:EX n:2/180ea f:Iqob t:EX d:EX/0 a:0 v:0 r:3 m:200 (inode)
Packit 6ef888
  D: Granted EX on this node to pid 17465 [ended]
Packit 6ef888
  H: s:EX f:H e:0 p:17465 [(ended)] init_per_node+0x1b0/0x280 [gfs2]
Packit 6ef888
  I: n:10/98538 t:8 f:0x00 d:0x00000200 s:1048576
Packit 6ef888
  U: (N/A:System)  H inode      180ea     Held:Exclusive   [Queued, Blocking]
Packit 6ef888
  U: (N/A:System)  H  --->  held by pid 17465 [(ended)]  (Granted EX on this node to pid 17465 [ended])
Packit 6ef888
 G:  s:EX n:9/0 f:Iqb t:EX d:EX/0 a:0 v:0 r:2 m:200 (journal)
Packit 6ef888
  D: Granted EX on this node to pid 17465 [ended]
Packit 6ef888
  H: s:EX f:eH e:0 p:17465 [(ended)] gfs2_glock_nq_num+0x5b/0xa0 [gfs2]
Packit 6ef888
  U: (N/A:Journl)  H journal    0         Held:Exclusive   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Journl)  H  --->  held by pid 17465 [(ended)]  (Granted EX on this node to pid 17465 [ended])
Packit 6ef888
 G:  s:UN n:2/4fe12 f:ldIqob t:EX d:UN/0 a:0 v:0 r:4 m:10 (inode)
Packit 6ef888
  H: s:EX f:W e:0 p:17523 [python] gfs2_rename+0x344/0x8b0 [gfs2]
Packit 6ef888
  H: s:SH f:AW e:0 p:17527 [python] gfs2_permission+0x176/0x210 [gfs2]
Packit 6ef888
  U:  W inode      4fe12     Is:Unlocked, Want:Exclusive   [Demote pending, Queued, Blocking]
Packit 6ef888
  U:  W  --->  waiting pid 17523 [python]
Packit 6ef888
  C:              gfs2_permission+0x17f/0x210 [gfs2]
Packit 6ef888
  C:              __link_path_walk+0xb3/0x1000
Packit 6ef888
  C:              path_walk+0x6a/0xe0
Packit 6ef888
  C:              filename_lookup+0x6b/0xc0
Packit 6ef888
  U:  W  --->  waiting pid 17527 [python]
Packit 6ef888
  C:              do_unlinkat+0x107/0x260
Packit 6ef888
  C:              sys_unlink+0x16/0x20
Packit 6ef888
  C:              system_call_fastpath+0x16/0x1b
Packit 6ef888
  C:              0xffffffffffffffff
Packit 6ef888
 G:  s:SH n:1/1 f:Iqb t:SH d:EX/0 a:0 v:0 r:2 m:200 (non-disk)
Packit 6ef888
  D: Granted PR on node 2 to pid 14285 [ended]
Packit 6ef888
  D: Granted PR on this node to pid 17465 [ended]
Packit 6ef888
  H: s:SH f:eEH e:0 p:17465 [(ended)] gfs2_glock_nq_num+0x5b/0xa0 [gfs2]
Packit 6ef888
  U: (N/A:Not EX)  H non-disk   1         Held:Shared   [Queued, Blocking]
Packit 6ef888
  U: (N/A:Not EX)  H  --->  held by pid 17465 [(ended)]  (Granted PR on node 2 to pid 14285 [ended]) (Granted PR on this node to pid 17465 [ended])
Packit 6ef888
S    glocks  nondisk    inode    rgrp   iopen   flock quota jrnl    Total
Packit 6ef888
S  --------- ------- -------- ------- ------- ------- ----- ---- --------
Packit 6ef888
S  Unlocked:       1        8       7       0       0     0    0       16
Packit 6ef888
S    Locked:       2      208       3      41       0     0    1      256
Packit 6ef888
S     Total:       3      216      10      41       0     0    1      272
Packit 6ef888
S
Packit 6ef888
S   Held EX:       0        2       0       0       0     0    1        3
Packit 6ef888
S   Held SH:       1        1       0      41       0     0    0       43
Packit 6ef888
S   Held DF:       0        0       0       0       0     0    0        0
Packit 6ef888
S G Waiting:       0        2       0       0       0     0    0        2
Packit 6ef888
S P Waiting:       0        3       0       0       0     0    0        3
Packit 6ef888
S  DLM wait:       0
Packit 6ef888
.RE
Packit 6ef888
.fi
Packit 6ef888
.PP
Packit 6ef888
From this example output, we can see there are two GFS2 file systems
Packit 6ef888
mounted on system host-050: nate_bob1 and nate_bob0. In nate_bob1, we can
Packit 6ef888
see six glocks, but we can ignore all of them marked (N/A:...) because they
Packit 6ef888
are system files or held in SHared mode, and therefore other nodes should
Packit 6ef888
be able to hold the lock in SHared as well.
Packit 6ef888
.PP
Packit 6ef888
There is one glock, for inode 183f5, which is has a process waiting to
Packit 6ef888
hold it. The lock is currently in SHared mode (s:SH on the G: line) but
Packit 6ef888
process 17511 (python) wants to hold the lock in EXclusive mode (S:EX
Packit 6ef888
on the H: line). That process has a call stack that indicates it is trying
Packit 6ef888
to hold the glock from gfs2_unlink. The DLM says the lock is currently
Packit 6ef888
granted on node 2 in PR (Protected Read) mode.
Packit 6ef888
.PP
Packit 6ef888
For file system nate_bob0, there are 7 glocks listed. All but two are
Packit 6ef888
uninteresting. Locks 2/609b4 and 2/4fe12 have processes waiting to
Packit 6ef888
hold them.
Packit 6ef888
.PP
Packit 6ef888
In the summary data for nate_bob0, you can see there are 3 processes waiting
Packit 6ef888
for 2 inode glocks (so one of those glocks has multiple processes waiting).
Packit 6ef888
.PP
Packit 6ef888
Since DLM wait is 0 in the summary data for both GFS2 mount points,
Packit 6ef888
nobody is waiting for DLM to grant the lock.
Packit 6ef888
Packit 6ef888
.SH KNOWN BUGS AND LIMITATIONS
Packit 6ef888
.PP
Packit 6ef888
Since the GFS2 debugfs files are completely separate from the DLM debugfs
Packit 6ef888
files, and locks can change status in a few nanoseconds time, there will
Packit 6ef888
always be a lag between the GFS2 view of a lock and the DLM view of a lock.
Packit 6ef888
If there is some kind of long-term hang, they are more likely to match.
Packit 6ef888
However, under ordinary conditions, by the time glocktop gets around to
Packit 6ef888
fetching the DLM status of a lock, the information has changed. Therefore,
Packit 6ef888
don't be surprised if the DLM's view of a lock is at odds with its glock.
Packit 6ef888
.PP
Packit 6ef888
Since iopen glocks are held by the thousands, glocktop skips most of the
Packit 6ef888
information related to them unless there's a waiter. For that reason,
Packit 6ef888
iopen lock problems may be difficult to debug with glocktop.
Packit 6ef888
.PP
Packit 6ef888
It doesn't handle very large numbers (millions) of glocks.
Packit 6ef888