Author

Topic: [ mining os ] nvoc - page 289. (Read 418549 times)

member
Activity: 66
Merit: 10
July 21, 2017, 08:12:39 AM
Hi @fullzero

when I recently upgraded my rig to run nvOC 0018, it did something nasty to an ssd that was connected to the rig which had a Windows 8.1 install on it. I remember reading something after it booted about how it was doing something to Windows partitions, and now if I remove the USB stick so the BIOs tries to boot from the SSD it says something like "NO OPERATING SYSTEM INSTALLED".

I do like to be able to swap to Windows to try out different things - do you have any idea how I may be able to repair the SSD so Windows will boot again? Or do I just have to go through the pain of installing it from scratch and unplug it when I use nvOC?

Install from scratch.

Then make sure it is not attached or you will kill it

BUMMER! A warning would have been nice.

Newmz, I had the identical thing happen to me. Most likely what happened is the boot record was wiped out on the hard drive where you had windows installed. I have not tried to recover mine yet but it should be a very simple thing. I would try something before taking philipma1957's advice; I would try putting a windows install disk into your DVD or USB key. Then run a recover. There is also (if I recall) an FSISK /MBR you can run.

I am going to try this myself but your windows partition should be fine, you're just missing an MBR. -No big deal really!

So give that some googling and you'll find it's easy to recover. Otherwise when I get to it this weekend. I will try to recover mine and let you know what I did to hopefully fix the issue.
full member
Activity: 153
Merit: 100
July 21, 2017, 07:39:23 AM
The following can be handy for ppl using ssh for their rig. Add to your .bashrc file the following lines:
Code:
#Alias for the screens
alias miner='screen -x miner'
alias temp='screen -x temp'
alias wdog='screen -x wdog'

After the command 'exec bash' or logging out and in again you can type miner / temp / wdog to go to the respective screens directly.
newbie
Activity: 12
Merit: 0
July 21, 2017, 03:30:33 AM
I added logic based on Avarets experience with P106-100 (I don't have any of these so I am going on your report Avarets; let me know if these changes work)

Please let me know if there are any issues with these updates.

The script doesn't seem to start automatically.
If I run it manually there are some errors but mining process starts:

Code:
Invalid MIT-MAGIC-COOKIE-1 keyFailed to connect to Mir: Failed to connect to server socket: No such file or directory
Unable to init server: Could not connect: Connection refused

ERROR: The control display is undefined; please run `nvidia-settings --help` for usage information.


My guess is X isn't running; or isn't running properly.  
You've probably changed hostname? You have to add the new name to Xorg. If so, see http://ubuntuhandbook.org/index.php/2016/06/change-hostname-ubuntu-16-04-without-restart/

No, I didn't.
I reverted previous version of script with my desribed changes and it works again.

Also I can confirm that with updated script overclocking parameters for P106-100 are applied properly.
full member
Activity: 153
Merit: 100
July 21, 2017, 03:11:27 AM
I added logic based on Avarets experience with P106-100 (I don't have any of these so I am going on your report Avarets; let me know if these changes work)

Please let me know if there are any issues with these updates.

The script doesn't seem to start automatically.
If I run it manually there are some errors but mining process starts:

Code:
Invalid MIT-MAGIC-COOKIE-1 keyFailed to connect to Mir: Failed to connect to server socket: No such file or directory
Unable to init server: Could not connect: Connection refused

ERROR: The control display is undefined; please run `nvidia-settings --help` for usage information.


My guess is X isn't running; or isn't running properly. 
You've probably changed hostname? You have to add the new name to Xorg. If so, see http://ubuntuhandbook.org/index.php/2016/06/change-hostname-ubuntu-16-04-without-restart/
sr. member
Activity: 372
Merit: 250
The road of excess leads to the palace of wisdom
July 20, 2017, 11:41:54 PM
Hi @fullzero

when I recently upgraded my rig to run nvOC 0018, it did something nasty to an ssd that was connected to the rig which had a Windows 8.1 install on it. I remember reading something after it booted about how it was doing something to Windows partitions, and now if I remove the USB stick so the BIOs tries to boot from the SSD it says something like "NO OPERATING SYSTEM INSTALLED".

I do like to be able to swap to Windows to try out different things - do you have any idea how I may be able to repair the SSD so Windows will boot again? Or do I just have to go through the pain of installing it from scratch and unplug it when I use nvOC?

Install from scratch.

Then make sure it is not attached or you will kill it

BUMMER! A warning would have been nice.
legendary
Activity: 4354
Merit: 9201
'The right to privacy matters'
July 20, 2017, 11:37:59 PM
Hi @fullzero

when I recently upgraded my rig to run nvOC 0018, it did something nasty to an ssd that was connected to the rig which had a Windows 8.1 install on it. I remember reading something after it booted about how it was doing something to Windows partitions, and now if I remove the USB stick so the BIOs tries to boot from the SSD it says something like "NO OPERATING SYSTEM INSTALLED".

I do like to be able to swap to Windows to try out different things - do you have any idea how I may be able to repair the SSD so Windows will boot again? Or do I just have to go through the pain of installing it from scratch and unplug it when I use nvOC?

Install from scratch.

Then make sure it is not attached or you will kill it
member
Activity: 118
Merit: 10
July 20, 2017, 11:32:18 PM
First off all THANK YOU Fullzero for nvOC v0018!
Simply love it!

I have a question - WhatToMine shows great rev for DGB-Groestl (DGB), but I can not mine DGB with Myriad-Groestl algo
It always trying to mine with Skein algo.
If I'm not mistaken nvOC has this algo for Diamond(DMD)
so, how DGB-Groestl could be mined?
sr. member
Activity: 372
Merit: 250
The road of excess leads to the palace of wisdom
July 20, 2017, 11:11:57 PM
Hi @fullzero

when I recently upgraded my rig to run nvOC 0018, it did something nasty to an ssd that was connected to the rig which had a Windows 8.1 install on it. I remember reading something after it booted about how it was doing something to Windows partitions, and now if I remove the USB stick so the BIOs tries to boot from the SSD it says something like "NO OPERATING SYSTEM INSTALLED".

I do like to be able to swap to Windows to try out different things - do you have any idea how I may be able to repair the SSD so Windows will boot again? Or do I just have to go through the pain of installing it from scratch and unplug it when I use nvOC?
newbie
Activity: 15
Merit: 0
July 20, 2017, 10:03:57 PM
Thank you very much for making such a wonderful OS.

Does NVOC v0018 have log file when miner restart after GPU soft crash?

In the v0018 1bash it does; in this updated version it only logs restarts.  This is because logging slightly decreases stability with using USB keys.  I will make watchdog logs a YES/NO option for the next 1bash.  For now you can open the watchdog file:

Code:
IAmNotAJeep_and_Maxximus007_WATCHDOG

go to line 86:

Code:
kill $target #| tee -a ${LOG_FILE}

and remove the # so it reads:

Code:
kill $target | tee -a ${LOG_FILE}

and it will log soft crashes.


Thank you very much. I'll try.
newbie
Activity: 9
Merit: 0
July 20, 2017, 07:14:43 PM
fullzero, can I mine Pascal Lite via NvOc? And can I mine it solo, without pool?
newbie
Activity: 50
Merit: 0
July 20, 2017, 04:57:32 PM
Hi Fullzero,

what about extract all the wallet's altcoin address from 1bash and to put them in a specific file named wallet.cabinet ?

this way it would be easier when we want to upgrade the 1bash file for instance

cheers
newbie
Activity: 12
Merit: 0
July 20, 2017, 03:55:39 PM
I added logic based on Avarets experience with P106-100 (I don't have any of these so I am going on your report Avarets; let me know if these changes work)

Please let me know if there are any issues with these updates.

The script doesn't seem to start automatically.
If I run it manually there are some errors but mining process starts:

Code:
Invalid MIT-MAGIC-COOKIE-1 keyFailed to connect to Mir: Failed to connect to server socket: No such file or directory
Unable to init server: Could not connect: Connection refused

ERROR: The control display is undefined; please run `nvidia-settings --help` for usage information.
member
Activity: 66
Merit: 10
July 20, 2017, 01:15:02 PM
Hi Guys,

I have been happily working with the new nv0018 image for about 4 days now. It's going well and I plan to continue my work with it! I thank everyone for their hard work and contributions to the project ...especially Fullzero of course.

Anyway, last night I tried to shut my rigs down for some changes I was going to make. I realized I couldn't shutdown! Every time it shut down it would boot right back up.

I have seen this before and it's an ACPI issue more often than not. But for kicks and giggles, I tried booting to my windows drive on the same hardware. It shut down without any issue at all. So I know that the problem is not a BIOS setting as some posts suggest. Windows shuts the rig down just fine.

I toyed with the kernel settings GRUB_CMDLINE_LINUX_DEFAULT="quiet splash acpi=force"
( https://www.unixmen.com/fix-shutdown-power-computer-ubuntu-14-04/ )
but it didn't help...

I'm just wondering if anyone else has run into this? I'm using the Asus Z270-A motherboard:
https://www.newegg.com/Product/Product.aspx?item=N82E16813132936

These are nice MB's and let you run 7 cards on one Rig.
newbie
Activity: 13
Merit: 0
July 20, 2017, 10:07:45 AM
Love the v0018 release and all the functionality!  

However, POWERLIMIT NIGHTMARES!  

I have one major issue, I cannot lower the POWERLIMIT.  I run 8 rigs of 1050Ti and 125W is just way to high.  I have tried adjusting the base line and the individual POWERLIMIT settings and I am still seeing maximum power being utilized in NVIDIA-SMI and TEMP CONTROL.  I thought maybe the TEMP CONTROL was trumping the setting, but I don't think that is the case (at least based on what my 46 year old brain and eye balls looking at the 1bash code understands).  I thought maybe it was the correction in line 527, but that didn't change anything.

I tried "NO" for both WATCHDOG and TEMP CONTROL with POWERLIMIT set below MAX for the 1050Ti and I still see max power output.

I did notice during startup, of the three terminal screens that pop-up during startup that the second terminal session has the POWERLIMIT set correctly at 60.   However, something happens after the third terminal screen initiates (miner starting) that pushes the POWER back to MAX.

I added another rig of 1050Tis tonight and I saw more unusual behavior from POWER settings again where GPU0 goes to 125W as the max power limit and the rest of the GPUs all complied with my setting of 65Watts.   I have no idea what is causing this inconsistency in power limit settings.

I also noticed in the Guake terminal that the TEMP CONTROL module is displaying continuous notifications that 125W is not a valid power limit (even after changing the settings in the module to 60-65).

I normally run all my rigs at 60W, which keeps the current draw low enough to run 3 rigs of 8 GPUs on each 15 AMP circuit.  Also, extremely efficient.

I am still hunting for what is causing the forced 125W power setting.

Try the new 1bash and additional files posted on the OP.  Let me know if it doesn't solve this for you.

[/quote]

Fullzero, yes this solved all my POWERLIMIT problems.  All 8 rigs up and hashing away.  Thank you very much!
sr. member
Activity: 340
Merit: 250
July 20, 2017, 10:05:12 AM

I got a BIOSTAR TB250-BTC PRO (12x gpu) today Link

I made a 12x 1060 rig with it.

ensure Mining Mode is enabled in the bios. 

ensure Max TOLUD is set to 3.5 GB in the bios.

NOTE: you must first only connect 6x GPUs, boot, make Bios changes, save and reboot, shutdown, add the other 6x GPUs, boot





I like the 13x out the box + m2 ssd ready ASRock more; but this is also a good mobo. 

Biostar sadly still can't handle; making mining settings the default.

It's good to see somebody got this board to work. I have problems with this board. I have a tons of PCIe bus errors. Are you sure you haven't changed anything in the bios? PCIe bus speeds auto or gen2, or above 4G MMIO? Also do you use the IGFX or one of the mining card?
Could you share your settings? Smiley

The settings are in the quote you posted; also on the OP:

ensure Mining Mode is enabled in the bios. LINK to PICTURE 

ensure Max TOLUD is set to 3.5 GB in the bios. LINK to PICTURE

NOTE: you must first only connect 6x GPUs, boot, make Bios changes, save and reboot, shutdown, add the other 6x GPUs, attach the USB or SSD and boot

Connect a monitor to the GPU connected to the 16x slot; nvOC and rxOC do not currently support integrated graphics.

Thanks, my problem was my skylake cpu. I needed to add one extra line to grub kernel and it worked fine. ( I couldn't make NVoC work, but I could make SMOS work with 12 cards)
sr. member
Activity: 1414
Merit: 487
YouTube.com/VoskCoin
July 20, 2017, 08:25:42 AM
I have a major issue, all of my miners just completely turned off 30 minutes ago,

Room was around 80 degrees, they never rebooted, breaker wasn't tripped, I have one asic miner in there and it was mining away when I walked in while every other machine was sitting there off?

Any idea on what happened? How can I figure out more and how can I prevent this from happening in the future?

If they are all powered off; I'm not sure what would have caused that.

If they are still on / but not mining:

I would look at one and see if there is a connection problem with the pool.  This is the most likely reason for a large number of rigs to simultaneously stop mining.

pool disconnect detection and mitigation / auto failover can be improved in a later version

Right now; if the pool server goes down: the rig will reinitialize 1bash 5 times then, reboot.  This will occur in a larger loop until the pool server is reachable.  

This occurs because when the pool is not providing work, the GPU utilization will be below 90.




Any idea why they would stay turned off though? The pool did not go down / I called my buddy his rigs on your software on the same pool had no issue?

I have a whole house surge protector, but could it still be a powe surge?
sr. member
Activity: 1414
Merit: 487
YouTube.com/VoskCoin
July 20, 2017, 08:06:41 AM
I have a major issue, all of my miners just completely turned off 30 minutes ago,

Room was around 80 degrees, they never rebooted, breaker wasn't tripped, I have one asic miner in there and it was mining away when I walked in while every other machine was sitting there off?

Any idea on what happened? How can I figure out more and how can I prevent this from happening in the future?
newbie
Activity: 12
Merit: 0
July 20, 2017, 03:56:43 AM
My configuration:
v0018
Biostar TB250-BTC PRO + 12 Zotac P106-100 cards (without output).
When I run it with LOCAL (GT 730 for monitor + 7 P106-100 cards) I see it works.
But when I remove GT 730 adapter and monitor and attach all 12 P106-100 cards and use REMOTE and connect by SSH it doesn't seem to be working.
I tried to run it manually but the OS was rebooted with Xorg error.
Any ideas how to fix it?

P.S. I tried new 1bash - still the same issue.

Code:
m1@m1-desktop:~$ pkill -e miner
m1@m1-desktop:~$ export DISPLAY=:0
m1@m1-desktop:~$ screen -r miner
There is no screen to be resumed matching miner.
m1@m1-desktop:~$ bash /home/m1/1bash


workername: nv045

Xorg PROBLEM DETECTED

Restoring Xorg

Rebooting in 5

FIrst: ensure you have made the 2x bios changes as indicated in the OP for this mobo; and saved / restarted as directed.  If you have made additional bios changes then you should restore the default settings and perform the procedure in the OP. 

Second while troubleshooting I recommend attaching the GPU with output to the primary 16x slot and using 11 of the mining GPUs in the other slots.  Run in local mode.

If you have significantly changed the GPU configuration; especially in regard to the the primary GPU it is likely the system will need to restore the xorg and reboot.  If it does this once it is expected; if it does this in a loop (ie multiple times in a row there is a problem).

Let me know how this goes.

PS: I highly recommend using the ASRock 13x mobo to get out the box; easy setup.  If I was having a lot of trouble with this mobo, I would get one of the ASRock and then return the Biostar when I had the rig running with 13x.




I figured out this was because of wrong xorg.conf.
Used this command:
Code:
sudo nvidia-xconfig -a --cool-bits=28 --allow-empty-initial-configuration
Also commented out this part and forced XORG to be OK:
Code:
XORG="OK"

#if grep -q "28800" /etc/X11/xorg.conf;
#then
#XORG="OK"
#fi
Now the script starts fine.

One more thing. The script doesn't support P106-100 overclokling because of this part:
Code:
___1050_or_1050ti="NO"

NORMAL="NO"

nvidia-smi -L > /tmp/tempa

if grep -q "1050" /tmp/tempa;
then
___1050_or_1050ti="YES"
fi

if grep -q "1060" /tmp/tempa;
then
NORMAL="YES"
fi

"nvidia-smi -L > /tmp/tempa" in case of P106-100 is like this:
Code:
m1@m1-desktop:~$ cat /tmp/tempa
GPU 0: P106-100 (UUID: GPU-afea0b93-e083-bde7-f6dd-fb5b9f55ae98)
GPU 1: P106-100 (UUID: GPU-191d50dc-d599-de1d-fa4b-54493a9035c6)
GPU 2: P106-100 (UUID: GPU-2ae0b358-33bb-8438-f47b-2a2ce8088f88)
GPU 3: P106-100 (UUID: GPU-66bce3b8-51aa-9f9d-f3c5-fce4e667f994)
GPU 4: P106-100 (UUID: GPU-bae124b9-96ad-5086-20f4-32bdb6d2663f)
GPU 5: P106-100 (UUID: GPU-a9664776-7549-499a-6cfa-3b74a6c6c843)
GPU 6: P106-100 (UUID: GPU-4b57123b-20b9-20c6-ffb9-0203a51cf009)
GPU 7: P106-100 (UUID: GPU-f851be56-15e7-adf2-5a65-7508a25e6e66)
GPU 8: P106-100 (UUID: GPU-1249a132-7df6-a1d3-4794-947cd1e1887a)
GPU 9: P106-100 (UUID: GPU-f31fca46-13ad-4eee-5024-177de21d36f9)
GPU 10: P106-100 (UUID: GPU-4161850e-1f6a-7c6b-fda6-03d58826f758)
GPU 11: P106-100 (UUID: GPU-af9286f8-e0c5-2139-87f1-7019b8a1ccca)


So I manually set "TI=2" and now overcloking values are applied.

Code:
TI="2"

if [ $___1050_or_1050ti == "YES" ]
then
    TI="2"
if [ $NORMAL == "YES" ]
then
    TI="2 3"
fi
fi
newbie
Activity: 4
Merit: 0
July 20, 2017, 12:22:16 AM

BaliMiner please provide a BTC address for the next version.


Hil Fullzero this is my BTC address: 1HbzxQ6AVeWYvFm322KtxZcJJLAqfJHpN8
newbie
Activity: 14
Merit: 0
July 20, 2017, 12:15:36 AM
fullzero -- did you managed to get ccminer_alexei78 into this new v18 build?

I still need to add some more ccminer versions; v0018 doesn't have the version I believe you are looking for.



I have compiled the correct alexis78 version for nvoc with arch flags for 10x series cards.

https://mega.nz/#!p64lHS4Q!BpaOMyEx5pL8GhkEXx6WTfgILxMa5FjvreN7jwLxuVE
Jump to: