Author

Topic: [ mining os ] nvoc - page 225. (Read 418546 times)

member
Activity: 119
Merit: 10
September 16, 2017, 04:54:33 PM
I have recently built a new rig with 13 P106-100 ASUS cards on nv0019 running -200cc 1550mc and PL90. I have a weird problem that after some time the hashrate on all cards drop by 50% or more and doesn't go back up, the miner doesn't restart and it just keeps running with lower hashrate.  This time it did this after 7 hours , with 7 cards running it was up for over 12hours and no problems. Has anyone experienced this before and maybe you know where the problem is?

How is the temperature of your cards? Have you configured nvOC to use auto temp control? If you have, it may be the case that you set the temp too low for what can be reached with 100% fan speed. In this case, auto temp control will smartly reduce the power limit to cool the cards, but of course this reduces the hash rate. Just a guess...
newbie
Activity: 50
Merit: 0
September 16, 2017, 04:47:34 PM
Hi guys,

I have recently built a new rig with 13 P106-100 ASUS cards on nv0019 running -200cc 1550mc and PL90. I have a weird problem that after some time the hashrate on all cards drop by 50% or more and doesn't go back up, the miner doesn't restart and it just keeps running with lower hashrate.  This time it did this after 7 hours , with 7 cards running it was up for over 12hours and no problems. Has anyone experienced this before and maybe you know where the problem is?

I am obviously running headless and using SSH to monitor the rig and adjust the settings.

PS. The miner or the OS doesn't restart the miner or reboot the system, just the hashrate drops pretty badly from 328 to 140 MH/s.

Thanks in advance to everyone who can help and also thanks to fullzero for the new version!

full member
Activity: 378
Merit: 104
nvOC forever
September 16, 2017, 04:44:14 PM
hi i get strange error when trying nicehash or mph with json:

LAUNCHING:  SALFTER_MPH_PROFIT_SWITCHING

Traceback (most recent call last):
  File "/home/m1/mph_switch", line 14, in
    cfg=json.loads(open(sys.argv[1]).read())
  File "/usr/lib/python2.7/json/__init__.py", line 339, in loads
    return _default_decoder.decode(s)
  File "/usr/lib/python2.7/json/decoder.py", line 364, in decode
    obj, end = self.raw_decode(s, idx=_w(s, 0).end())
  File "/usr/lib/python2.7/json/decoder.py", line 382, in raw_decode
    raise ValueError("No JSON object could be decoded")
ValueError: No JSON object could be decoded



I've seen this too; mostly a restart should fix or try to fire those url's (mph, coinbase, nicehash) in RIG's browser, see whether you can browse through.
newbie
Activity: 16
Merit: 0
September 16, 2017, 04:28:42 PM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)

SRR = Simple Rig Resetter
it's a little board that connects to your power/reset pins on the motherboard, it then listens for heartbeats from specific rigs, when the heartbeat stops, it means the rig froze, it hard re-sets the specific rig.

https://simplemining.net/download/SRR/PDF/SRR-manual-2017-02-10.pdf

I've been using one for a few months, saved me lots of time at the beginning.
Does Linux ever hard freeze?
Been using ubuntu since 2006,
Got lucky and have never experienced it.

I'm running my own 176 cards and hosting another 388 for friends and family and turning this into a hosting business.

After trying basically all the setups possible (with 6,7,8,9, 13 cards) i've settled on 3 cards per rig. Never looking back.

If you're using second hand HP Compaq DC7800 or DC7900 (you get it for 40 usd in my country with 4 G RAM, 3 pciex slots) and a dual core duo CPU you get the vPro awesomeness of full remote power on and off and Intel AMT out of band remote management included. Nobody can't beat that. Upgrade the firmware on the motherboards and BAM! you have a remote KVM built in. You can see your screen and do whatever you want from anywhere in the world.

If i am running 1080 cards and above, i buy a 750 W server PSU for another 50 USD and i can have a full rig up and running with the total cost of 90 USD per rig. If i am running 1060 cards, i simply upgrade for 10 usd the PSU in the SFF desktop to a 500 W one...

All the rigs are beautiful, no cabling nightmare, whenever a rig crashes i scripted a full power cycle using a raspberry pi as a watchdog.
SRR's are expensive when you have a lot of rigs.
newbie
Activity: 88
Merit: 0
September 16, 2017, 04:21:01 PM
hi i get strange error when trying nicehash or mph with json:

LAUNCHING:  SALFTER_MPH_PROFIT_SWITCHING

Traceback (most recent call last):
  File "/home/m1/mph_switch", line 14, in
    cfg=json.loads(open(sys.argv[1]).read())
  File "/usr/lib/python2.7/json/__init__.py", line 339, in loads
    return _default_decoder.decode(s)
  File "/usr/lib/python2.7/json/decoder.py", line 364, in decode
    obj, end = self.raw_decode(s, idx=_w(s, 0).end())
  File "/usr/lib/python2.7/json/decoder.py", line 382, in raw_decode
    raise ValueError("No JSON object could be decoded")
ValueError: No JSON object could be decoded

full member
Activity: 420
Merit: 106
https://steemit.com/@bibi187
September 16, 2017, 04:09:47 PM
hi need help is there any way i can schedule miner to run like only from 6pm-11am ?


U have multiple solution, depend what u want exactly.

PC have to be turned off or only mining process ?

If u need to pc be turned off, you probably can buy a power timer.

If u let PC run and just turn off miner, u can do a crontab Wink

Code:
crontab -e 0 11 * * * miner1 pkill -e screen && pkill -f 3main
contrab -e 0 18 * * * miner1 bash /home/m1/3main &

First line will kill your process every day at 11am
Second line will launch miner process every day at 6pm (18)
newbie
Activity: 16
Merit: 0
September 16, 2017, 04:09:27 PM
There are NO such limitations; nvOC is designed to max of 13 cards without any modification of 1bash & 3main (can add more by modifying these two files). Try to verify the risers (swap them around with the working ones).

Hash rates dropped with your 8 cards??
OK, I think same - no limitations...

I've rate ~6400 Sol/s with 8 cards.

After connection next 3 cards, rate dropped to ~5100 Sol/s only. nvOS didn't detect more than 8 cards, as you can saw on nvidia-smi.

Well, risers maybe problem - I'm novice in mining, so I ordered more types. I using version v006C and v008S. On cards plugged on v006C blinking LEDs below power connectors on card. I've ordered 13 pcs of v008S (these: www.aliexpress.com/item/TISHRIC-2018-Hot-VER008S-Molex-4PIN-15PIN-SATA-6PIN-3-in-1-PCIE-PCI-Express-Riser/32830747032.html). I hope that will works well.

Many thanks :-)

What is your CPU ?
I had the exact same issue with AsRock and 12 Ti's and it all went smooth when i switched the CPU's from the rig with my own. For me it worked putting a i5 7400 on the motherboard. Maybe it works with a less expensive CPU but it was the only one i had around.
member
Activity: 119
Merit: 10
September 16, 2017, 03:43:33 PM
Does Linux ever hard freeze?
Been using ubuntu since 2006,
Got lucky and have never experienced it.

Yes it does... Of course it historically has always been more stable than, say, Windows, but hard freezes can and do happen mostly for these two reasons: kernel (including modules) bugs, and hardware issues, as pointed before. More rarely, a bug in the "init" process can also do this. These can trigger what's called in the Unix world a "kernel panic", the equivalent of Windows' BSOD. In some cases, even in a kernel trap, you can do some debugging (like dumping memory) and take a few actions, like syncing the filesystem and rebooting, but not always.

That's one of the reasons why enterprise and "stable" distros (RHEL / CentOS, SLES, Debian stable, Ubuntu LTS) are more conservative with new kernel and software versions. They tend to use the long-term kernel tree, which doesn't introduce too many features, focusing more on fixing bugs.

Now, personally, with nvOC (which uses Ubuntu LTS) I've never had a kernel panic. Smiley But you never know, if you need full remote control this SRR seems a cool addition, or an intelligent power switch, KVM etc.
sr. member
Activity: 395
Merit: 250
September 16, 2017, 02:49:19 PM
hi need help is there any way i can schedule miner to run like only from 6pm-11am ?
full member
Activity: 420
Merit: 106
https://steemit.com/@bibi187
September 16, 2017, 02:25:05 PM
guys, when I do putty, what is the username and password?

Default user is m1 and password is miner1 Wink

you are the best! thanks! oh and how do we change the bash file or the miner settings from putty? and how to stop the miner and start? thanks!


For modify 1bash from putty just do

nano 1bash

To reset your miner follow this, thanks @papampi


In v0019 you dont need to kill 3main. just killing miner will make wdog re-init 3main and run it with your new edited 1bash
but if you want to kill best way is :

Code:
pkill -e screen # to kill miner, wdog and temp
pkill -f 3main # to kill 3main
bash /home/m1/3main # to load 3main again
one line cmd :

Code:
pkill -e screen && pkill -f 3main && bash /home/m1/3main &
Click enter when 3main loaded, and miner started, dont do ctrl+c
newbie
Activity: 44
Merit: 0
September 16, 2017, 01:31:13 PM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)

SRR = Simple Rig Resetter
it's a little board that connects to your power/reset pins on the motherboard, it then listens for heartbeats from specific rigs, when the heartbeat stops, it means the rig froze, it hard re-sets the specific rig.

https://simplemining.net/download/SRR/PDF/SRR-manual-2017-02-10.pdf

I've been using one for a few months, saved me lots of time at the beginning.
Does Linux ever hard freeze?
Been using ubuntu since 2006,
Got lucky and have never experienced it.

Linux is very stable, but it won't protect you from hardware issues so if you OC the cards too much any OS will freeze, updates can cause issues, drivers etc.
Use case:
Remote rigs - it can correct a remote issue in minutes before you're even aware.
USB failure issues - I've had USB issues with the Asrock BTC+ 110H boards where sometimes they don't recognize the USB and reboot into bios... this cycles the power and the USB is seen again.
I've had rigs hang when OC'ing cards too hard, or heat buildup during summer at a remote site.
The next Mobo manufacturer that includes this functionality in their mining Mobo's is gonna do well, all server class machines already have similar capabilities for the same reasons.



full member
Activity: 686
Merit: 140
Linux FOREVER! Resistance is futile!!!
September 16, 2017, 12:33:17 PM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)

SRR = Simple Rig Resetter
it's a little board that connects to your power/reset pins on the motherboard, it then listens for heartbeats from specific rigs, when the heartbeat stops, it means the rig froze, it hard re-sets the specific rig.

https://simplemining.net/download/SRR/PDF/SRR-manual-2017-02-10.pdf

I've been using one for a few months, saved me lots of time at the beginning.
Does Linux ever hard freeze?
Been using ubuntu since 2006,
Got lucky and have never experienced it.
newbie
Activity: 44
Merit: 0
September 16, 2017, 12:17:00 PM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)

SRR = Simple Rig Resetter
it's a little board that connects to your power/reset pins on the motherboard, it then listens for heartbeats from specific rigs, when the heartbeat stops, it means the rig froze, it hard re-sets the specific rig.

https://simplemining.net/download/SRR/PDF/SRR-manual-2017-02-10.pdf

I've been using one for a few months, saved me lots of time at the beginning.
full member
Activity: 378
Merit: 104
nvOC forever
September 16, 2017, 12:12:49 PM
Guys; I thought of keeping all the best OC settings in one place for all the available cards in the market.

I would need your help to input your best OC finds bit of details; happy to update it time to time; it would help us hit the rock bottom (even for a beginner).

You can PM me or Post a reply on this post or request on this post :

https://bitcointalksearch.org/topic/all-oc-settings-in-one-place-2176936

Hope I get good support from our nvOC community members to make this possible Smiley

This is URL :

http://krypto-mining.blogspot.co.uk/

I've only started; any suggestions welcome Smiley

I have added a link to the OP.  This should be helpful for members.  

Thanks fullzero for that Smiley
full member
Activity: 224
Merit: 100
September 16, 2017, 11:08:07 AM
Guys, I am having issues unzipping the NVOS (nvOC_v0019.zip) on both MAC and PC. The file size shows 6,436,759,477 bytes (6.44 GB on disk), but when I am trying to unzip the size of the image is getting huge (over 250GB!). I downloaded the google drive version of NVOS zip from this forum. Has anyone encountered this issue and can give some advice? Thank you in advance!

if you really tried on 2 different computers then your original must be corrupted. Try downloading it again.
full member
Activity: 686
Merit: 140
Linux FOREVER! Resistance is futile!!!
September 16, 2017, 11:04:09 AM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)
Why you wana set it to 25 or lower ?
Even a standby GPU temp is higher than that.
Dont kill your fans just to take temp down.
newbie
Activity: 8
Merit: 0
September 16, 2017, 10:21:40 AM
Guys what is SRR option?

also when i set minimum temp=25 or lower, it wont go lower than 30

(using v0019 with updating)
full member
Activity: 686
Merit: 140
Linux FOREVER! Resistance is futile!!!
September 16, 2017, 07:05:49 AM
I just ssh'd in with m1/miner1 and edited ~/1bash file.

Then, "screen -r miner" to switch to miner and Ctrl+C to cancel mining.

The miner will start itself back up and apply any changes  you made to 1bash file.

Not sure if this is the right workflow but it worked for me.

-phil

Nop, miner will get restart but with same OC setting, and dont check 1bash.

But if you use Paralallax mode (use pastabin) if you modify 1bash your miner will get restart with new setup.

To load new setting after u modify 1bash directly on miner, i am not sure on this new version but i assume you have to kill some process.

pkill -e 3main
pkill -e miner
pkill -e wdog

After just type

bash 1bash or maybe you have other thing to launch like i said is a new version.

In v0019 you dont need to kill 3main. just killing miner will make wdog re-init 3main and run it with your new edited 1bash
but if you want to kill best way is :

Code:
pkill -e screen # to kill miner, wdog and temp
pkill -f 3main # to kill 3main
bash /home/m1/3main # to load 3main again
one line cmd :

Code:
pkill -e screen && pkill -f 3main && bash /home/m1/3main &
Click enter when 3main loaded, and miner started, dont do ctrl+c
full member
Activity: 420
Merit: 106
https://steemit.com/@bibi187
September 16, 2017, 06:36:46 AM
I just ssh'd in with m1/miner1 and edited ~/1bash file.

Then, "screen -r miner" to switch to miner and Ctrl+C to cancel mining.

The miner will start itself back up and apply any changes  you made to 1bash file.

Not sure if this is the right workflow but it worked for me.

-phil

Nop, miner will get restart but with same OC setting, and dont check 1bash.

But if you use Paralallax mode (use pastabin) if you modify 1bash your miner will get restart with new setup.

To load new setting after u modify 1bash directly on miner, i am not sure on this new version but i assume you have to kill some process.

pkill -e 3main
pkill -e miner
pkill -e wdog

After just type

bash 1bash or maybe you have other thing to launch like i said is a new version.
full member
Activity: 686
Merit: 140
Linux FOREVER! Resistance is futile!!!
September 16, 2017, 12:16:07 AM
Thanks a lot @salfter.
SALFTER_MPH_PROFIT_SWITCHING is working now,
But I think it does not change the power limit based on 1bash algo values
It does change CC and MC.

Code:
Myriad_Groestl_POWERLIMIT_WATTS=125
__Myriad_Groestl_CORE_OVERCLOCK=100
Myriad_Groestl_MEMORY_OVERCLOCK=100
Code:
   POWERLIMIT_WATTS=135
__CORE_OVERCLOCK=100
MEMORY_OVERCLOCK=600

screen start with this :
Code:
INDIVIDUAL_POWERLIMIT_0:  135
INDIVIDUAL_POWERLIMIT_1:  135
INDIVIDUAL_POWERLIMIT_2:  135
INDIVIDUAL_POWERLIMIT_3:  135
INDIVIDUAL_POWERLIMIT_4:  135
INDIVIDUAL_POWERLIMIT_5:  135
INDIVIDUAL_POWERLIMIT_6:  135
INDIVIDUAL_POWERLIMIT_7:  135
INDIVIDUAL_POWERLIMIT_8:  135
INDIVIDUAL_POWERLIMIT_9:  135
INDIVIDUAL_POWERLIMIT_10:  135
INDIVIDUAL_POWERLIMIT_11:  135
INDIVIDUAL_POWERLIMIT_12:  135
INDIVIDUAL_POWERLIMIT_13:  135

TARGET_TEMP_0: 75
TARGET_TEMP_1: 75
TARGET_TEMP_2: 75
TARGET_TEMP_3: 75
TARGET_TEMP_4: 75
TARGET_TEMP_5: 75
TARGET_TEMP_6: 75
TARGET_TEMP_7: 75
TARGET_TEMP_8: 75
TARGET_TEMP_9: 75
TARGET_TEMP_10: 75
TARGET_TEMP_11: 75
TARGET_TEMP_12: 75
TARGET_TEMP_13: 75

FAN_ADJUST:  5
POWER_ADJUST:  5
ALLOWED_TEMP_DIFF:  3
RESTORE_POWER_LIMIT:  90


nvidia-smi :

Code:
m1@m1-desktop-101:~$ nvidia-smi
Thu Sep 14 15:53:00 2017
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 384.69                 Driver Version: 384.69                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce GTX 1070    Off  | 00000000:01:00.0  On |                  N/A |
| 65%   74C    P2   102W / 135W |    421MiB /  8113MiB |     94%      Default |
+-------------------------------+----------------------+----------------------+
|   1  GeForce GTX 1070    Off  | 00000000:02:00.0 Off |                  N/A |
| 65%   71C    P2   133W / 135W |    180MiB /  8114MiB |     84%      Default |
+-------------------------------+----------------------+----------------------+
|   2  GeForce GTX 1070    Off  | 00000000:03:00.0 Off |                  N/A |
| 65%   68C    P2   120W / 135W |    180MiB /  8114MiB |     97%      Default |
+-------------------------------+----------------------+----------------------+
|   3  GeForce GTX 1070    Off  | 00000000:05:00.0 Off |                  N/A |
| 65%   69C    P2   112W / 135W |    180MiB /  8114MiB |     96%      Default |
+-------------------------------+----------------------+----------------------+
|   4  GeForce GTX 1070    Off  | 00000000:06:00.0 Off |                  N/A |
| 65%   74C    P2   141W / 135W |    180MiB /  8114MiB |     91%      Default |
+-------------------------------+----------------------+----------------------+
|   5  GeForce GTX 1070    Off  | 00000000:07:00.0 Off |                  N/A |
| 65%   74C    P2   123W / 135W |    180MiB /  8114MiB |     90%      Default |


Another question.
Is it possible to use multi algo switch from this link https://miningpoolhub.com/index.php?page=api&action=getautoswitchingandprofitsstatistics
instead of coin switch ?
Multi algo will switch coins based on their profits without stopping minner,
For example if mining zcash and zclassic goes more profitable it switches within the pool, and miner wont get restart...

I think I made a logic error in that multiple implementations require:

Code:
POWERLIMIT="YES"

to work.  I will fix this if it turns out to be the problem.

Already have it in 1bash

Code:

SSH="YES"                       # YES NO
 
POWERLIMIT="YES"                # YES NO
 
 
        POWERLIMIT_WATTS=130
__CORE_OVERCLOCK=120
MEMORY_OVERCLOCK=700
Jump to: