我得到nvidia-smi
得到Memory-Usage
是这样的
$nvidia-smi -i 0,1
Wed Mar 4 16:20:07 2020
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.113 Driver Version: 418.113 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce RTX 208... Off | 00000000:18:00.0 Off | N/A |
| 27% 37C P8 1W / 250W | 10789MiB / 10989MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 GeForce RTX 208... Off | 00000000:3B:00.0 Off | N/A |
| 41% 50C P2 67W / 250W | 10893MiB / 10989MiB | 2% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 231853 C tensorflow_model_server 10779MiB |
| 1 120908 C python 10883MiB |
+-----------------------------------------------------------------------------+
这个内存使用率
是99%,但是
当我得到像这样的 nvidia-smi dmon
$nvidia-smi dmon -i 0,1
# gpu pwr gtemp mtemp sm mem enc dec mclk pclk
# Idx W C C % % % % MHz MHz
0 1 37 - 0 0 0 0 405 300
1 67 50 - 0 0 0 0 6800 1350
0 1 37 - 0 0 0 0 405 300
1 67 50 - 0 0 0 0 6800 1350
0 1 37 - 0 0 0 0 405 300
1 67 50 - 0 0 0 0 6800 1350
这个mem%
有0%,有时是0~3%。
为什么会有这样的差异?
最佳答案
Memory-Usage
来自nvidia-smi
是内存的使用。
mem%
来自nvidia-smi dmon
是内存的利用率。
Memory-Usage = used memory / total memory.
Utilization = time over the past sample period / global (device) memory was being read or written * 100%
关于tensorflow - 内存使用指标 `nvidia-smi dmon` 与 `nvidia-smi` 之间有何差异,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/60521808/