Author

Topic: [Mining OS] SimpleMining.net - Manage Your GPU farm the easy way! (30 days free) - page 223. (Read 835838 times)

legendary
Activity: 2660
Merit: 1096
Simplemining.net Admin
This didnt work for me:

3 Rigs are 500km away and in SM it says "offline", but its still working, i have access to the router and also see the consumption ...
In SM i see its Software RX 1142, and the network has ipv6...
It worked find up till today  Huh

PLEASE help ASAP

Please write to me at [email protected]
I would need from you ssh access to one of your rigs to check it out or team veiwer to any windows that is in the same network LAN as rigs.

Also i see that sometimes there is problme that 0,5% user having when cloudflare is changing ip addresses of my domain.
It seems that some DNS resolvers on client side are not working properly and that would be the ISP fault or client router.
Still i would like to check it.
newbie
Activity: 1
Merit: 0
Hello everyone,

I am making my first NVIDIA based rig:

Asus Z270F STRIX Gaming
6 x Palit GTX 1060 6GB DDR5 (Hynix)

Latest BIOS, all recommended BIOS settings are there - to the best of my knowledge.
Started with one card first, no problems mining with the card in Windows 10 with latest drivers or in nvOC after updating the drivers.

Etched simpleminer-RX-NV-rc2.img on 16GB SanDisk USB drive, booted, and unfortunately, card is not being properly detected by the OS.
I am unable to update the drivers to another version that could support this 1060, as nvidia-390 (and nvidia-384 and nvidia-387) crashes during building (kernel too old? I am not an expert Sad )
Definitely forgetting something basic, as no one seems to be complaining about lack of support for 1060 GPU.

Some output from a freshly etched pendrive:

miner@simpleminer:~$ nvidia-smi -L
Unable to determine the product name for gpu 0000:02:00.0: Unknown Error

miner@simpleminer:~$  lspci | grep NVIDIA
02:00.0 VGA compatible controller: NVIDIA Corporation Device 1c06 (rev a1)
02:00.1 Audio device: NVIDIA Corporation Device 10f1 (rev a1)

miner@simpleminer:~$ cat /proc/driver/nvidia/version
NVRM version: NVIDIA UNIX x86_64 Kernel Module  387.22  Wed Oct 25 23:13:21 PDT 2017
GCC version:  gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.5)

Needless to say, miners cannot communicate with the card...
Would happily give more info if needed.

Please advise, thank you.
newbie
Activity: 9
Merit: 0
I'am not a SMOS dev but I believe it should not impact the system in any way since he only makes some rest api calls to the website backend.
newbie
Activity: 22
Merit: 0
Hello SimpleMining Dev,
Question, I know recently you have advised people not to use SSH aka not to open ssh ports.
But honestly ssh is perfect for live viewing of multiple rigs so i have a question.
I went ahead and changed the root and miner passwords as to prevent an attacker changing my rigs from mining something else.
would this cause issues for simpleOS , say from updating or anything like that ?
To be frank your website version of viewing the rig is not even an option as its missing the ability to watch it live and to give it command line changes aka + - on dual mining and so on or even clicking S for stats.
so please let me know if changing the password is an issue for you (DEV) and if it would cause issues down the line in term of you pushing updates and so on.
newbie
Activity: 9
Merit: 0
Yes, I did that too and it seems to be ok now.
It's strange, all the card are identical and I believe from the same lot but two of them don't overclock that much.
Now I have 4 with:
1100 core/2100 mem/900 undervolt
2 with:
1050/2050/900

Seems to be stable at least for now.
full member
Activity: 140
Merit: 100
Hi,

I'am facing a problem and don't know how to approach it correctly.
I'am running simplemining with 6 x XFX RX470 8GB Hynix, there are two thing happening:

1. After about 10 minutes of mining, hashrate of GPU3 goes down from 28mh/s to about 22.. 20mh/s and stays there, after some time eventually it will crash with
the following message in dmesg:
[ 1084.587016] amdgpu 0000:07:00.0: GPU fault detected: 147 0x05708801
[ 1084.587016] amdgpu 0000:07:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00E0983C
[ 1084.587016] amdgpu 0000:07:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06088002

Lots of those messages.

2. Sometimes all GPU's hashrate drops to about 10mh/s and the CPU usage goes to 100% ( claymore process ), again, the same error appears into dmesg (the gpu fault).

Now, I did shut off GPU3 and identify the card that cooled down, the problem is that the dmesg errors are for PCI ID 03 not for PCI ID 04 ( card that drops in hashrate ), here is my lspci:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
03:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
04:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
04:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
05:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
06:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
06:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0


Now, I believe claymore see GPU in order, right ? I mean, GPU0 is 01:00, GPU2 is 02:00, etc... is this correct ?
Because the card that is dropping in hashrate is GPU3 but the error in dmesg are for PCI ID: 03:00, which should be for GPU2 in claymore ( GPU0, 1, 2-- 3rd card ).

So, what card is faulty Smiley ? The one that drops in hashrate or the one that id refereed by dmesg error ?

Has anyone else encountered this type of problem ?

Thanks!

I had this issue and had to lower my core and mem a little till it stopped.
newbie
Activity: 9
Merit: 0
Hi,

I'am facing a problem and don't know how to approach it correctly.
I'am running simplemining with 6 x XFX RX470 8GB Hynix, there are two thing happening:

1. After about 10 minutes of mining, hashrate of GPU3 goes down from 28mh/s to about 22.. 20mh/s and stays there, after some time eventually it will crash with
the following message in dmesg:
[ 1084.587016] amdgpu 0000:07:00.0: GPU fault detected: 147 0x05708801
[ 1084.587016] amdgpu 0000:07:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00E0983C
[ 1084.587016] amdgpu 0000:07:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06088002

Lots of those messages.

2. Sometimes all GPU's hashrate drops to about 10mh/s and the CPU usage goes to 100% ( claymore process ), again, the same error appears into dmesg (the gpu fault).

Now, I did shut off GPU3 and identify the card that cooled down, the problem is that the dmesg errors are for PCI ID 03 not for PCI ID 04 ( card that drops in hashrate ), here is my lspci:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
03:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
04:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
04:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
05:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
06:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67df (rev cf)
06:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0


Now, I believe claymore see GPU in order, right ? I mean, GPU0 is 01:00, GPU2 is 02:00, etc... is this correct ?
Because the card that is dropping in hashrate is GPU3 but the error in dmesg are for PCI ID: 03:00, which should be for GPU2 in claymore ( GPU0, 1, 2-- 3rd card ).

So, what card is faulty Smiley ? The one that drops in hashrate or the one that id refereed by dmesg error ?

Has anyone else encountered this type of problem ?

Thanks!
member
Activity: 107
Merit: 11
live long and prosper
This would be cool, AMD1950X plus GPU !

Is there a system that already supports cpu+gpu?

pure linux + windows excluded, of course


Hey, dev. Do you plan to add the ability to mine on CPU? And a separate setting for each GPU.
member
Activity: 107
Merit: 11
live long and prosper
This didnt work for me:

3 Rigs are 500km away and in SM it says "offline", but its still working, i have access to the router and also see the consumption ...
In SM i see its Software RX 1142, and the network has ipv6...
It worked find up till today  Huh

PLEASE help ASAP




I was wondering how smOS behaves wehn there is no internet connection for couple of hours? Does it freeze or how does it behave?

I have Win10 + Claymore running atm. It runs for weeks, but if I loose internet for like 30mins or so the whole rig freezes and you have to physically switch it on and off...

I just fixed some connection problems.
I appears that if someone has ipv6 in his network then curl was connecting even 6.9 seconds to simplemining.net server.
I released update in which now it is using ipv4 default and problem appears to be fixed. Now it is connecting under 0.5 second Smiley

UPDATE v1143 (already on your rig Smiley
- fixed - case where rig could not connect to simplemining, register and get new config (curl now uses ipv4 as default protocol), That caused that rig was using old written config and in case of new flashed pendrive it was XMR mining

Anyway if simplemining servers are down for some reason or if there is planned maintenence or for example internet connection to simplemining doesnt work ...
Then rig is using last downloaded settings.

newbie
Activity: 197
Merit: 0
can anybody explain what the power stage is exactly?
newbie
Activity: 6
Merit: 0
Tytanick Please please please give us a PHI miner!
Yes, it would be great!!
Thanks!
newbie
Activity: 50
Merit: 0
Tytanick Please please please give us a PHI miner!
newbie
Activity: 20
Merit: 0
It's not bad power, that wouldn't cause one card to flake out. It could be a bad riser, cable or the little PCIE chip that plugs into the slot. I've had bad risers that run fine for days or weeks and then all of the sudden cards will drop off.  I hate to be obvious, but have you updated the BIOS and tried to set the Gen settings for the slots etc? I have one ASUS board and the rest are ASROCK H110 pro's. The H110's are setup for mining and  fired right up,  I have 6 10 card rx580 rigs running SMOS. The ASUS board is 8 cards and was kind of a PITA to get running. Had to set everything related to PCIE to Gen1, disable SATA controller, audio controller etc. Finally it has been running for over a week. I read ton's on boards before setting up my 10 card rigs and stayed away from the B250 as it seemed to have overheating issues and just be unstable compared to the H110 or another board. Hell, I paid $129 for my Asus Z270-P and it runs 8 cards...

Thanks for your answer.

In the B250 bios there is a mining mode where everything gets set properly right away (gen1, above 4g decoding).  I turned off the audio already but I can try and turn off all the things I don't use.

Tonight I'll do that and also embark on a mission to identify the faulty riser, if any.  I don't want to run with 1 risers at a time to identify the faulty one... I guess I'll try removing them 1 by 1  instead even if it might not be as accurate (if I have 2 faulty ones).

Otherwise I already ordered a new 6 pack of risers earlier this week since I have 3 more GPUs on the way.

Make sure you aren't trying to boot with a monitor plugged into one of the mining cards on B250. Also I found that mining mode doesn't really get you exactly where you need to be. You still need to set all your pcie to gen2 enable csm(I think that's what it's called). And make sure to enable onboard video.

Thanks.

I use onboard video already.  I had to enable CSM to even be able to boot so that's the first thing I ever did.
I played around with the PCI settings.. not sure if I tried GEN2 already.  

I'll test that AND I'll start writing down everything I do otherwise I'll end up doing everything twice while forgetting other things to try Wink

Update to my situation.

I replaced 1 PCIe riser, just the small part you plug in the PCIe slot on the mobo, it had something that did not appear normal.  The board was not showing that PCIe has having something plugged in but SMOS was detecting the card and using it fine (normal hashrate).  I replaced it to be safe not before it was not working.

But this fixes nothing.

I have 6 GPUs plugged with 6 risers.  

I tested with GPU 1-2-3-4 (with their respective risers unchanged) for 12 hours and everything runs perfectly.
Then I tested with GPU 3-4-5-6 (with their respective risers unchanged) for 12 hours and everything runs perfectly as well.

Am I right to assume all these risers are fine because of this?
In fact I pretty much know they are good, I used them to mine smoothly on a Z170 PC Mate with 5 GPUs for weeks.  (all 6 were used, I switched some around).

However the moment I plug 1 more GPU (number 1, with the same riser it's been tested with before), I have 4 GPU mining normally and 1 mining at 4Mh/s.

I tried both Gen1 and Gen2.
I disabled all the Sata but the one I use.
I disabled the audio controller.
I have CSM enabled.
I use on-board graphics, not PEG.
I have a fan pointing straight at the little chip that can get hot (in the middle of the PCIe slots).

Any help is appreciated.
legendary
Activity: 2660
Merit: 1096
Simplemining.net Admin
Hey Tytan,

I imaged a new USB stick because the update I pushed out to this system failed. Also, another system I what to bring online has the same problem. When updating files it gets to Claymore 10.2 and gets stuck in a loop for a while and finally finishing booting after many attempts. Any rig group using Claymore will not work now for that system or another that I'm trying to bring online.

Error is:

Unexpected EOF in archive
Please contact me on [email protected]
I would like to debug this problem.
newbie
Activity: 1
Merit: 0
Hey Tytan,

I imaged a new USB stick because the update I pushed out to this system failed. Also, another system I what to bring online has the same problem. When updating files it gets to Claymore 10.2 and gets stuck in a loop for a while and finally finishing booting after many attempts. Any rig group using Claymore will not work now for that system or another that I'm trying to bring online.

Error is:

Unexpected EOF in archive
legendary
Activity: 2660
Merit: 1096
Simplemining.net Admin
Hey, dev. Do you plan to add the ability to mine on CPU? And a separate setting for each GPU.
Totally not my priority and i will really need to make many tests becasue this could lower speed of gpu mining and we dont want that ...
Anyway i am not planning this for at least next 3 months.
full member
Activity: 434
Merit: 101
In crypto we trust!
Hey, dev. Do you plan to add the ability to mine on CPU? And a separate setting for each GPU.
legendary
Activity: 2660
Merit: 1096
Simplemining.net Admin
I was wondering how smOS behaves wehn there is no internet connection for couple of hours? Does it freeze or how does it behave?

I have Win10 + Claymore running atm. It runs for weeks, but if I loose internet for like 30mins or so the whole rig freezes and you have to physically switch it on and off...

I just fixed some connection problems.
I appears that if someone has ipv6 in his network then curl was connecting even 6.9 seconds to simplemining.net server.
I released update in which now it is using ipv4 default and problem appears to be fixed. Now it is connecting under 0.5 second Smiley

UPDATE v1143 (already on your rig Smiley
- fixed - case where rig could not connect to simplemining, register and get new config (curl now uses ipv4 as default protocol), That caused that rig was using old written config and in case of new flashed pendrive it was XMR mining

Anyway if simplemining servers are down for some reason or if there is planned maintenence or for example internet connection to simplemining doesnt work ...
Then rig is using last downloaded settings.
full member
Activity: 238
Merit: 100
Hi,

I registered in to simplemining.net 25th January 2018 and I received the an activqation email, but the link doesn't work.

I try to replay in email but the email address ([email protected]) also don't work.


What I can do? Can the admin help me? My registered email is [email protected].

Thanks in advance,

Zorro23hu
czr
newbie
Activity: 27
Merit: 0
I was wondering how smOS behaves wehn there is no internet connection for couple of hours? Does it freeze or how does it behave?

I have Win10 + Claymore running atm. It runs for weeks, but if I loose internet for like 30mins or so the whole rig freezes and you have to physically switch it on and off...

it will keep trying for ever until internet comes back
Jump to: