Давно хотел посмотреть, как работает опция рестарта драйвера при потере CUDA GPU. На этом же риге с дополнительной немайнинговой карточкой наконец-то смоделировалась подходящая ситуация и, похоже, опция не очень работает. На ходу прямо под форкфиксом с включенной опцией сломать одну из карт пока не удалось (но, вероятно, удастся), а при уже подвисшей карточке программа на старте ругается, что один из GPU сдох, но ничего не предпринимает.
Опцию рестарта драйвера только включил; без неё при умирании карточки перестаёт совпадать нумерация карт в форкфиксе (в нём она не меняется, будто ничего не произошло) и в майнерах. В результате, к примеру, тирекс больше не может стартовать, т.к. не может применить на 650 memtweak, предназначенный для 1070, а нбмайнер выругивается на четвёртую карту (которая в форкфиксе при этом пятая и отключена), но работает на трёх.
Журнал NvidiaGPUGetDataHost (эти пары строк добаляются каждые несколько секунд):
[2021-06-08 05:16:34] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:16:34] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:16:43] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:16:43] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:16:50] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:16:50] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:16:56] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:16:56] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:17:02] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:17:02] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:17:08] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:17:08] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:17:14] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:17:14] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:17:20] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:17:20] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
[2021-06-08 05:17:26] [INFO] [NvidiaGPUGetDataHost] Start
[2021-06-08 05:17:26] [INFO] [NvidiaGPUGetDataHost] nvmlDeviceGetCount error: Uninitialized
Журнал старта собственно клиента:
[2021-06-08 05:16:31] [INFO] [NICEHASH] Starting up NiceHashMiner Legacy Fork Fix: Build date 06/06/2021 11:29:08
[2021-06-08 05:16:31] [INFO] [NICEHASH] Selected language: English
[2021-06-08 05:16:31] [INFO] [NICEHASH] Critical error: missing language
[2021-06-08 05:16:31] [INFO] [NICEHASH] WMI service seems to be running, ManagementObjectSearcher returned success.
[2021-06-08 05:16:31] [INFO] [NICEHASH] Start Form_Main
[2021-06-08 05:16:31] [INFO] [NICEHASH] Start InitializeComponent
[2021-06-08 05:16:31] [INFO] [NICEHASH] Start InitLocalization
[2021-06-08 05:16:32] [INFO] [SystemSpecs] FreePhysicalMemory = 13966388
[2021-06-08 05:16:32] [INFO] [SystemSpecs] FreeSpaceInPagingFiles = 50331648
[2021-06-08 05:16:32] [INFO] [SystemSpecs] FreeVirtualMemory = 58458796
[2021-06-08 05:16:32] [INFO] [SystemSpecs] LargeSystemCache = 0
[2021-06-08 05:16:32] [INFO] [SystemSpecs] MaxNumberOfProcesses = 4294967295
[2021-06-08 05:16:32] [INFO] [SystemSpecs] MaxProcessMemorySize = 137438953344
[2021-06-08 05:16:32] [INFO] [SystemSpecs] NumberOfLicensedUsers = 0
[2021-06-08 05:16:32] [INFO] [SystemSpecs] NumberOfProcesses = 129
[2021-06-08 05:16:32] [INFO] [SystemSpecs] NumberOfUsers = 2
[2021-06-08 05:16:32] [INFO] [SystemSpecs] OperatingSystemSKU = 4
[2021-06-08 05:16:32] [INFO] [SystemSpecs] SizeStoredInPagingFiles = 50331648
[2021-06-08 05:16:32] [INFO] [SystemSpecs] SuiteMask = 272
[2021-06-08 05:16:32] [INFO] [SystemSpecs] TotalSwapSpaceSize = 0
[2021-06-08 05:16:32] [INFO] [SystemSpecs] TotalVirtualMemorySize = 66980824
[2021-06-08 05:16:32] [INFO] [SystemSpecs] TotalVisibleMemorySize = 16649176
[2021-06-08 05:16:32] [INFO] [SystemSpecs] ProcessorCount = 32
[2021-06-08 05:16:32] [INFO] [NICEHASH] Start query RAM
[2021-06-08 05:16:32] [INFO] [NICEHASH] Total RAM: 16258MB
[2021-06-08 05:16:32] [INFO] [NICEHASH] Page File Size: 49152MB
[2021-06-08 05:16:32] [INFO] [Windows Defender] expired
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ccminer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_hsrneoscrypt
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_CryptoDredge
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_trex
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_NBMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_miniZ
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_Kawpowminer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_TTMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ZEnemy
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ccminer_CryptoNight
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ethminer_OCL
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ethminer_CUDA
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_mkxminer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_teamredminer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_lolMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_lolMinerBEAM
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_cpuminer_opt
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_nheqminer_CPU
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_nheqminer_CUDA
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_eqm_CUDA
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ClaymoreZcash
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ClaymoreNeoscrypt
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ClaymoreCryptoNight
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_XmrigAMD
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_XmrigNVIDIA
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_WildRig
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_SRBMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_OptiminerZcash
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_excavator
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_ClaymoreDual
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_Phoenix
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_EWBF
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_Nanominer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_GMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_Bminer
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_Xmrig
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_CastXMR
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_lyclMiner
[2021-06-08 05:16:32] [INFO] [ExtraLaunchParameters] Creating internal params config MinerOptionPackage_dstm
[2021-06-08 05:16:32] [INFO] [ComputeDeviceManager.Query]
QueryVideoControllers:
Win32_VideoController detected:
Name NVIDIA GeForce GTX 1070
Description NVIDIA GeForce GTX 1070
Manufacturer ASUS (1043)
PNPDeviceID PCI\VEN_10DE&DEV_1B81&SUBSYS_85DB1043&REV_A1\4&2BFC19AD&0&000B
DeviceID VideoController2
DriverVersion 27.21.14.5751
Status OK
InfSection Section094
AdapterRAM 4293918720
Win32_VideoController detected:
Name NVIDIA GeForce GTX 1070
Description NVIDIA GeForce GTX 1070
Manufacturer ASUS (1043)
PNPDeviceID PCI\VEN_10DE&DEV_1B81&SUBSYS_85A01043&REV_A1\4&384B6BB9&0&0019
DeviceID VideoController3
DriverVersion 27.21.14.5751
Status OK
InfSection Section094
AdapterRAM 4293918720
Win32_VideoController detected:
Name NVIDIA GeForce GTX 650
Description NVIDIA GeForce GTX 650
Manufacturer Gigabyte (1458)
PNPDeviceID PCI\VEN_10DE&DEV_0FC6&SUBSYS_35681458&REV_A1\4&2A522381&0&0019
DeviceID VideoController4
DriverVersion 27.21.14.5751
Status OK
InfSection Section004
AdapterRAM 1073741824
Win32_VideoController detected:
Name NVIDIA GeForce GTX 1070
Description NVIDIA GeForce GTX 1070
Manufacturer ASUS (1043)
PNPDeviceID PCI\VEN_10DE&DEV_1B81&SUBSYS_85DC1043&REV_A1\4&1E02D175&0&000B
DeviceID VideoController5
DriverVersion 27.21.14.5751
Status OK
InfSection Section094
AdapterRAM 4293918720
Win32_VideoController detected:
Name NVIDIA GeForce GTX 1070
Description NVIDIA GeForce GTX 1070
Manufacturer ASUS (1043)
PNPDeviceID PCI\VEN_10DE&DEV_1B81&SUBSYS_85A01043&REV_A1\6&1FE67AA1&0&00180209
DeviceID VideoController6
DriverVersion 27.21.14.5751
Status OK
InfSection Section094
AdapterRAM 4293918720
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] HasNvidiaVideoController: True
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] Adding NVML to PATH='C:\Program Files\NVIDIA Corporation\NVSMI'
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] NVSMI Error: GPUIsLost
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] QueryCudaDevices START
[2021-06-08 05:16:33] [INFO] [NVAPI] Found handle for busid 66
[2021-06-08 05:16:33] [INFO] [NVAPI] Bus ID get failed with status: NVIDIA_DEVICE_NOT_FOUND
[2021-06-08 05:16:33] [INFO] [NVAPI] Found handle for busid 9
[2021-06-08 05:16:33] [INFO] [NVAPI] Found handle for busid 10
[2021-06-08 05:16:33] [INFO] [NVAPI] Found handle for busid 65
[2021-06-08 05:16:33] [INFO] [NVML] System.Exception: NVML init failed with code GPUIsLost
в NiceHashMiner.Devices.ComputeDeviceManager.Query.Nvidia.QueryCudaDevices() в D:\NiceHashMinerLegacy\NiceHashMinerLegacy\NiceHashMiner\Devices\ComputeDeviceManager.cs:строка 959
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 1073741824 bytes - GeForce GTX 650
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query]
CudaDevicesDetection:
ADDED device:
ID: 0
pciBusID: 9
NAME: GeForce GTX 1070
MANUFACTURER: ASUS (1043)
VENDOR: ASUS
UUID: GPU-3bb7ad41-fa5e-913c-e21a-b2ac589b7309
Monitor: 0
MEMORY: 8589934592
ETHEREUM: YES
ADDED device:
ID: 1
pciBusID: 10
NAME: GeForce GTX 1070
MANUFACTURER: ASUS (1043)
VENDOR: ASUS
UUID: GPU-1a24743d-6f1c-707f-57af-ff286eedd4bd
Monitor: 0
MEMORY: 8589934592
ETHEREUM: YES
ADDED device:
ID: 2
pciBusID: 65
NAME: GeForce GTX 1070
MANUFACTURER: ASUS (1043)
VENDOR: ASUS
UUID: GPU-51be13ca-8b7f-6194-a948-c742dfd1cb8f
Monitor: 0
MEMORY: 8589934592
ETHEREUM: YES
ADDED device:
ID: 3
pciBusID: 66
NAME: GeForce GTX 650
MANUFACTURER: Gigabyte (1458)
VENDOR: Gigabyte
UUID: GPU-38e35ef4-5685-9d0f-d3ed-162c4bcb67e2
Monitor: 0
MEMORY: 1073741824
ETHEREUM: NO
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] QueryCudaDevices END
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] Skipping AMD device detection, settings set to disabled
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] Cuda NVIDIA/CUDA device count BAD!!!
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] AMD GPU device count GOOD
[2021-06-08 05:16:33] [INFO] [NVIDIA driver]
[2021-06-08 05:16:33] [INFO] [QueryDevices] ID: 0 BusID: 9 IDByBus: -1 Index: 0 lolMinerBusID:-1 GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [QueryDevices] ID: 1 BusID: 10 IDByBus: -1 Index: 1 lolMinerBusID:-1 GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [QueryDevices] ID: 2 BusID: 65 IDByBus: -1 Index: 2 lolMinerBusID:-1 GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [QueryDevices] ID: 3 BusID: 66 IDByBus: -1 Index: 3 lolMinerBusID:-1 GeForce GTX 650
[2021-06-08 05:16:33] [INFO] [ComputeDeviceManager.Query] virtual memory size GOOD
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 8589934592 bytes - GeForce GTX 1070
[2021-06-08 05:16:33] [INFO] [GPU MEMORY: ] 1073741824 bytes - GeForce GTX 650
[2021-06-08 05:16:33] [INFO] [NICEHASH] Setting environment variables
[2021-06-08 05:16:33] [INFO] [NvidiaGPUGetDataHost] Starting OK
[2021-06-08 05:16:34] [INFO] [NICEHASH] Trying to enable/disable Windows error reporting
[2021-06-08 05:16:34] [INFO] [NICEHASH] Current DontShowUI value: 1
[2021-06-08 05:16:34] [INFO] [GITHUB] Check new version
[2021-06-08 05:16:34] [INFO] [GITHUB] Current version: 38
[2021-06-08 05:16:34] [INFO] [GITHUB] Current build: 20210606.112908
[2021-06-08 05:16:35] [INFO] [GITLAB] Fork_Fix_38
[2021-06-08 05:16:36] [INFO] [GITHUB] GITHUB Version: 38
[2021-06-08 05:16:36] [INFO] [GITHUB] GITHUB Build: 20210531.064908
[2021-06-08 05:16:36] [INFO] [GITLAB] GITLAB Version: 38
[2021-06-08 05:16:36] [INFO] [NHSMA] Try initialize SMA
[2021-06-08 05:16:36] [INFO] [SOCKET] Using previous SMA
[2021-06-08 05:16:36] [INFO] [SOCKET-address:] wss://nhmws.nicehash.com/v3/nhml
[2021-06-08 05:16:36] [INFO] [SOCKET] Connecting
[2021-06-08 05:16:36] [INFO] [SOCKET] 06/08/2021 05:16:36|Warn |Logger.set_Output:181|The current output action has been changed.
[2021-06-08 05:16:36] [INFO] [SOCKET] 06/08/2021 05:16:36|Debug|WebSocket.sendHttpRequest:2056|A request to the server:
GET /v3/nhml HTTP/1.1
User-Agent: websocket-sharp/1.0
Host: nhmws.nicehash.com
Upgrade: websocket
Connection: Upgrade
Sec-WebSocket-Key: IGwAekd6dnjaPXQMb4bCuQ==
Sec-WebSocket-Version: 13
[2021-06-08 05:16:36] [INFO] [SOCKET] 06/08/2021 05:16:36|Debug|WebSocket.sendHttpRequest:2058|A response to this request:
HTTP/1.1 101 Switching Protocols
Date: Tue, 08 Jun 2021 02:16:36 GMT
Connection: upgrade
Upgrade: websocket
Sec-WebSocket-Accept: pXLZh3ydyQnNNCEc8O5QRLXKNy0=
CF-Cache-Status: DYNAMIC
cf-request-id: 0a8b016164000023139c969000000001
Expect-CT: max-age=604800, report-uri="https://report-uri.cloudflare.com/cdn-cgi/beacon/expect-ct"
Strict-Transport-Security: max-age=15552000; includeSubDomains
Server: cloudflare
CF-RAY: 65be9e7bdbb72313-KBP
alt-svc: h3-27=":443"; ma=86400, h3-28=":443"; ma=86400, h3-29=":443"; ma=86400, h3=":443"; ma=86400
[2021-06-08 05:16:36] [INFO] [SOCKETNEW] Sending data: {"method":"login","version":"NHM/3.0.5.6","protocol":3,"btc":"---","worker":"R3","group":"","rig":"0-lNjZO3--OVSCdCUUKIis-g"}
[2021-06-08 05:16:37] [INFO] [NVAPI] Bus ID get failed with status: NVIDIA_DEVICE_NOT_FOUND
[2021-06-08 05:16:37] [INFO] [NVAPI] Found handle for busid 9
[2021-06-08 05:16:37] [INFO] [NVAPI] Bus ID get failed with status: NVIDIA_DEVICE_NOT_FOUND
[2021-06-08 05:16:37] [INFO] [NVAPI] Found handle for busid 10
[2021-06-08 05:16:37] [INFO] [NVAPI] Bus ID get failed with status: NVIDIA_DEVICE_NOT_FOUND
[2021-06-08 05:16:37] [INFO] [NVAPI] Found handle for busid 65
[2021-06-08 05:16:37] [INFO] [NVAPI] Found handle for busid 66