Hi,
try the option "--ratewatchdog" to monitor the hash rate and automatic restart of a dropping GPU.
Or use the cast-xmr-ui for monitoring:
https://github.com/anadventureisu/cast-xmr-uiBtw. are you using the Blockchain Beta or 18.3.4 driver?
Regards,
glph3k
I'm using --ratewatchdog and Blockchain Beta From August
It seems like after sometime if watchdog kicks in, the hash rate drops lower. It doesn't appear as if my graphics driver crashed, because normally it will reset all of my OC settings. Which also I tried to adjust OC settings however I'm not having much luck.
In the above sample I posted, each card will start off by doing roughly 1870Hs, but after a few kernel resets I guess over the course of the day, some of the cards just become obsolete. Maybe ratewatchdog is not fully bringing the vega to it's potential. When the hashrate drops, I also notice that particular card no longer reports hashes that often. As you can tell by that small insert, GPU 1 started reporting hashes but 0 2 3 slowly fell off. So it's interesting!
OC is 0 Core 905 Mem and -20% power
Thank you for any advice you have.
UPDATE: I was able to watch this happen today and I wanted to share the log[19:04:10] GPU2 | 48°C | Fan 4229 RPM | 1896.3 H/s
[19:04:11] GPU0 | 56°C | Fan 4634 RPM | 1863.8 H/s
[19:04:11] GPU3 Kernel Resetted.
[19:04:11] GPU3 | 54°C | Fan 4328 RPM | 86.0 H/s
[19:04:14] GPU1 | 55°C | Fan 4342 RPM | 1880.0 H/s
[19:04:16] GPU2 | 48°C | Fan 4219 RPM | 1892.3 H/s
[19:04:16] GPU0 | 56°C | Fan 4636 RPM | 1861.8 H/s
[19:04:20] GPU1 | 55°C | Fan 4352 RPM | 1883.0 H/s
[19:04:21] GPU2 | 51°C | Fan 4180 RPM | 1894.6 H/s
[19:04:22] GPU0 | 56°C | Fan 4636 RPM | 1862.8 H/s
[19:04:26] GPU1 | 55°C | Fan 4352 RPM | 1879.7 H/s
[19:04:27] GPU2 | 51°C | Fan 4178 RPM | 1894.3 H/s
[19:04:28] GPU0 | 56°C | Fan 4650 RPM | 1864.7 H/s
[19:04:31] GPU1 | 55°C | Fan 4363 RPM | 1880.7 H/s
[19:04:33] GPU2 | 51°C | Fan 4178 RPM | 1895.0 H/s
[19:04:34] GPU0 | 56°C | Fan 4632 RPM | 1865.0 H/s
[19:04:37] GPU1 | 55°C | Fan 4362 RPM | 1881.0 H/s
[19:04:39] GPU2 | 52°C | Fan 4169 RPM | 1895.3 H/s
[19:04:39] GPU0 | 56°C | Fan 4629 RPM | 1864.7 H/s
[19:04:43] GPU1 | 55°C | Fan 4373 RPM | 1881.4 H/s
[19:04:44] GPU2 | 52°C | Fan 4144 RPM | 1894.6 H/s
[19:04:45] GPU0 | 56°C | Fan 4620 RPM | 1865.0 H/s
[19:04:48] GPU1 | 55°C | Fan 4368 RPM | 1883.0 H/s
[19:04:50] GPU2 | 49°C | Fan 4274 RPM | 1893.0 H/s
[19:04:51] GPU0 | 57°C | Fan 4631 RPM | 1865.4 H/s
[19:04:54] GPU1 | 55°C | Fan 4356 RPM | 1883.0 H/s
[19:04:56] GPU2 | 49°C | Fan 4273 RPM | 1893.0 H/s
[19:04:57] GPU0 | 56°C | Fan 4630 RPM | 1865.0 H/s
[19:05:00] GPU1 | 55°C | Fan 4362 RPM | 1883.7 H/s
[19:05:01] GPU2 | 52°C | Fan 4279 RPM | 1884.7 H/s
[19:05:03] GPU0 | 56°C | Fan 4636 RPM | 1860.9 H/s
[19:05:06] GPU1 | 55°C | Fan 4363 RPM | 1871.2 H/s
[19:05:07] GPU2 | 52°C | Fan 4278 RPM | 1886.0 H/s
[19:05:08] GPU0 | 57°C | Fan 4669 RPM | 1866.7 H/s
[19:05:11] GPU1 | 55°C | Fan 4358 RPM | 1881.7 H/s
[19:05:11] GPU3 | 54°C | Fan 4301 RPM | 59.7 H/s
[19:05:13] GPU2 | 49°C | Fan 4249 RPM | 1900.7 H/s
[19:05:14] GPU0 | 56°C | Fan 4636 RPM | 1862.1 H/s
[Pool: 'cryptonightv7.usa.nicehash.com:3363' | Connected: 2018-04-19 12:02:04]
7:03:11 (100%) Online
0:00:00 ( 0%) Offline
[Job: #446 | Difficulty: 400015 | Running: 95.0 sec | Avg Job Time: 56.8 sec]
[Hash Rate Avg: 7443.7 H/s]
1852.4 H/s GPU0
1867.1 H/s GPU1
1879.9 H/s GPU2
1844.3 H/s GPU3
[Shares Found: 621 | Avg Search Time: 40.8 sec]
620 (100%) Accepted
1 ( 0%) Rejected by pool
0 ( 0%) Invalid result computation failed
0 ( 0%) Could not be submitted because of network error
0 ( 0%) Outdated because of job change
[19:05:17] GPU1 | 55°C | Fan 4336 RPM | 1882.0 H/s
[19:05:18] GPU2 | 50°C | Fan 4244 RPM | 1896.6 H/s
[19:05:20] GPU0 | 57°C | Fan 4623 RPM | 1873.5 H/s
[19:05:23] GPU1 | 55°C | Fan 4341 RPM | 1881.7 H/s
[19:05:24] GPU2 | 50°C | Fan 4216 RPM | 1894.6 H/s
[19:05:26] GPU0 | 57°C | Fan 4618 RPM | 1864.4 H/s
[19:05:28] GPU1 | 55°C | Fan 4329 RPM | 1881.0 H/s
[19:05:30] GPU2 | 52°C | Fan 4185 RPM | 1889.6 H/s
[19:05:31] GPU0 | 56°C | Fan 4605 RPM | 1866.7 H/s
[19:05:34] GPU1 | 55°C | Fan 4322 RPM | 1878.1 H/s
[19:05:35] GPU2 | 52°C | Fan 4171 RPM | 1890.6 H/s
[19:05:37] GPU0 | 55°C | Fan 4615 RPM | 1858.9 H/s
[19:05:40] GPU1 | 55°C | Fan 4317 RPM | 1881.4 H/s
[19:05:41] GPU2 | 49°C | Fan 4172 RPM | 1893.3 H/s
Now after GPU 3 was reset > it dropped to 56 hashes and it's no longer showing up like GPU 0 1 2 (3 the one which was just reset is now MIA)