rocm-smi - rocm-smi - a tool to monitor AMD accelerators and
GPUs
usage: rocm-smi [-h] [-V] [-d DEVICE [DEVICE ...]] [--alldevices]
[--showhw] [-a] [-i] [-v] [-e [EVENT ...]]
- [--showdriverversion] [--showtempgraph] [--showfwinfo [BLOCK ...]]
[--showmclkrange] [--showmemvendor] [--showsclkrange] [--showproductname]
[--showserial] [--showuniqueid] [--showvoltagerange] [--showbus]
[--showpagesinfo] [--showpendingpages] [--showretiredpages]
[--showunreservablepages] [-f] [-P] [-t] [-u] [--showmemuse]
[--showvoltage] [-b] [-c] [-g] [-l] [-M] [-m] [-o] [-p] [-S] [-s]
[--showmeminfo TYPE [TYPE ...]] [--showpids [VERBOSE]] [--showpidgpus
[SHOWPIDGPUS ...]] [--showreplaycount] [--showrasinfo [SHOWRASINFO ...]]
[--showvc] [--showxgmierr] [--showtopo] [--showtopoaccess]
[--showtopoweight] [--showtopohops] [--showtopotype] [--showtoponuma]
[--showenergycounter] [--shownodesbw] [--showcomputepartition]
[--showmemorypartition] [-r] [--resetfans] [--resetprofile]
[--resetpoweroverdrive] [--resetxgmierr] [--resetperfdeterminism]
[--resetcomputepartition] [--resetmemorypartition] [--setclock TYPE LEVEL]
[--setsclk LEVEL [LEVEL ...]] [--setmclk LEVEL [LEVEL ...]] [--setpcie
LEVEL [LEVEL ...]] [--setslevel SCLKLEVEL SCLK SVOLT] [--setmlevel
MCLKLEVEL MCLK MVOLT] [--setvc POINT SCLK SVOLT] [--setsrange SCLKMIN
SCLKMAX] [--setextremum min|max sclk|mclk CLK] [--setmrange MCLKMIN
MCLKMAX] [--setfan LEVEL] [--setperflevel LEVEL] [--setoverdrive %]
[--setmemoverdrive %] [--setpoweroverdrive WATTS] [--setprofile
SETPROFILE] [--setperfdeterminism SCLK] [--setcomputepartition
{CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}] [--setmemorypartition
{NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}] [--rasenable BLOCK ERRTYPE]
[--rasdisable BLOCK ERRTYPE] [--rasinject BLOCK] [--gpureset] [--load FILE
| --save FILE] [--autorespond RESPONSE] [--loglevel LEVEL] [--json]
[--csv]
AMD ROCm System Management Interface | ROCM-SMI version: 2.2.0
- -b, --showbw
- Show estimated PCIe use
- -c,
--showclocks
- Show current clock frequencies
- -g,
--showgpuclocks
- Show current GPU clock frequencies
- -l,
--showprofile
- Show Compute Profile attributes
- -M,
--showmaxpower
- Show maximum graphics package power this GPU will consume
- -m,
--showmemoverdrive
- Show current GPU Memory Clock OverDrive level
- -o,
--showoverdrive
- Show current GPU Clock OverDrive level
- -p,
--showperflevel
- Show current DPM Performance Level
- -S,
--showclkvolt
- Show supported GPU and Memory Clocks and Voltages
- -s,
--showclkfrq
- Show supported GPU and Memory Clock
- --showmeminfo
TYPE [TYPE ...]
- Show Memory usage information for given block(s) TYPE
- --showpids
[VERBOSE]
- Show current running KFD PIDs (pass details to VERBOSE for detailed
information)
- --showpidgpus
[SHOWPIDGPUS ...]
- Show GPUs used by specified KFD PIDs (all if no arg given)
- --showreplaycount
- Show PCIe Replay Count
- --showrasinfo
[SHOWRASINFO ...]
- Show RAS enablement information and error counts for the specified
block(s) (all if no arg given)
- --showvc
- Show voltage curve
- --showxgmierr
- Show XGMI error information since last read
- --showtopo
- Show hardware topology information
- --showtopoaccess
- Shows the link accessibility between GPUs
- --showtopoweight
- Shows the relative weight between GPUs
- --showtopohops
- Shows the number of hops between GPUs
- --showtopotype
- Shows the link type between GPUs
- --showtoponuma
- Shows the numa nodes
- --showenergycounter
- Energy accumulator that stores amount of energy consumed
- --shownodesbw
- Shows the numa nodes
- --showcomputepartition
- Shows current compute partitioning
- --showmemorypartition
- Shows current memory partition
- --setclock TYPE
LEVEL
- Set Clock Frequency Level(s) for specified clock (requires manual Perf
level)
- --setsclk LEVEL
[LEVEL ...]
- Set GPU Clock Frequency Level(s) (requires manual Perf level)
- --setmclk LEVEL
[LEVEL ...]
- Set GPU Memory Clock Frequency Level(s) (requires manual Perf level)
- --setpcie LEVEL
[LEVEL ...]
- Set PCIE Clock Frequency Level(s) (requires manual Perf level)
- --setslevel
SCLKLEVEL SCLK SVOLT
- Change GPU Clock frequency (MHz) and Voltage (mV) for a specific
Level
- --setmlevel
MCLKLEVEL MCLK MVOLT
- Change GPU Memory clock frequency (MHz) and Voltage for (mV) a specific
Level
- --setvc POINT SCLK
SVOLT
- Change SCLK Voltage Curve (MHz mV) for a specific point
- --setsrange
SCLKMIN SCLKMAX
- Set min and max SCLK speed
- --setextremum
min|max sclk|mclk CLK
- Set min/max of SCLK/MCLK speed
- --setmrange
MCLKMIN MCLKMAX
- Set min and max MCLK speed
- --setfan
LEVEL
- Set GPU Fan Speed (Level or %)
- --setperflevel
LEVEL
- Set Performance Level
- --setoverdrive
%
- Set GPU OverDrive level (requires manual|high Perf level)
- --setmemoverdrive
%
- Set GPU Memory Overclock OverDrive level (requires manual|high Perf
level)
- --setpoweroverdrive
WATTS
- Set the maximum GPU power using Power OverDrive in Watts
- --setprofile
SETPROFILE
- Specify Power Profile level (#) or a quoted string of CUSTOM Profile
attributes "# # # #..." (requires manual Perf level)
- --setperfdeterminism
SCLK
- Set clock frequency limit to get minimal performance variation
- --setcomputepartition
{CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}
- Set compute partition
- --setmemorypartition
{NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}
- Set memory partition
- --rasenable
BLOCK ERRTYPE
- Enable RAS for specified block and error type
- --rasdisable
BLOCK ERRTYPE
- Disable RAS for specified block and error type
- --rasinject
BLOCK
- Inject RAS poison for specified block (ONLY WORKS ON UNSECURED
BOARDS)
- --loglevel
LEVEL
- How much output will be printed for what program is doing, one of
debug/info/warning/error/critical
- --json
- Print output in JSON format
- --csv
- Print output in CSV format
The full documentation for rocm-smi is maintained as a
Texinfo manual. If the info and rocm-smi programs are properly
installed at your site, the command
- info rocm-smi
should give you access to the complete manual.