Pages:
Author

Topic: cgmon - mining monitor for Linux - auto restart, reboot, sick gpu, ASIC, &more - page 3. (Read 48365 times)

sr. member
Activity: 269
Merit: 250
newbie
Activity: 7
Merit: 0
I think you solved the problem... I had the wrong path to cgmon.tcl in crontab.

Now it shows its running in the log file.

Will see if it restarts cgminer when a miner goes down.

Thank you
sr. member
Activity: 269
Merit: 250
Is cgmon running every few minutes in your crontab?  If you're not sure, check the log file.



newbie
Activity: 7
Merit: 0
When that happens, run this command and send me the result.

Code:
php -f /tmp/cgmon-api.php notify

I may be able to add support for gridseed with that information.

Ok, miner GSD 3 went down.

Code:
--------------------------------------------------------------------------------
 [P]ool management [S]ettings [D]isplay options [Q]uit
 GSD 0: 8D8A16685449  875 MHz | 370.1K/370.2Kh/s | A:254784 R:512 HW: 51 WU: 1.4/m N: 259 255[49] 244 236 219[2
 GSD 1: 8D841F994849  888 MHz | 375.8K/375.7Kh/s | A:266880 R:  0 HW: 32 WU: 1.5/m N: 231 255 257[28] 241[4] 26
 GSD 2: 8D82348D5753  850 MHz | 359.5K/359.6Kh/s | A:263616 R:256 HW:164 WU: 1.5/m N: 270[65] 278[76] 279[22] 2
 GSD 3: 6D8514714857  913 MHz | OFF   /320.9Kh/s | A:226112 R:256 HW:  5 WU: 1.1/m N: 170 203 186 174[5] 188
--------------------------------------------------------------------------------

 [2014-04-05 13:21:26] Accepted 4a13a3d4 Diff 885/256 GSD 1 pool 0
 [2014-04-05 13:21:32] Accepted 2728d78c Diff 1.67K/256 GSD 2 pool 0
 [2014-04-05 13:21:46] Accepted cdf2dcd2 Diff 318/256 GSD 2 pool 0
 [2014-04-05 13:22:35] Accepted a150cecd Diff 406/256 GSD 2 pool 0
 [2014-04-05 13:22:37] Stratum from pool 0 detected new block
 [2014-04-05 13:22:38] Accepted 0a66c626 Diff 6.3K/256 GSD 1 pool 0
 [2014-04-05 13:22:49] Accepted 551185f5 Diff 770/256 GSD 0 pool 0
 [2014-04-05 13:22:53] Accepted 9b0cd043 Diff 423/256 GSD 1 pool 0
 [2014-04-05 13:23:01] Accepted 2cd1cb1c Diff 1.46K/256 GSD 2 pool 0
 [2014-04-05 13:23:10] Accepted a76977fd Diff 391/256 GSD 1 pool 0
 [2014-04-05 13:23:14] Accepted 5dd9ddf1 Diff 698/256 GSD 0 pool 0
 [2014-04-05 13:23:17] Stratum from pool 0 detected new block


Code:
pi@raspberrypi ~ $ php -f /tmp/cgmon-api.php notify
 notify returned 'STATUS=S,When=1396718524,Code=60,Msg=Notify,Description=cgminer 3.7.2|NOTIFY=0,Name=GSD,ID=0,Last Well=1396718524,Last Not Well=0,Reason Not Well=None,*Thread Fail Init=0,*Thread Zero Hash=0,*Thread Fail Queue=0,*Dev Sick Idle 60s=0,*Dev Dead Idle 600s=0,*Dev Nostart=0,*Dev Over Heat=0,*Dev Thermal Cutoff=0,*Dev Comms Error=0,*Dev Throttle=0|NOTIFY=1,Name=GSD,ID=1,Last Well=1396718524,Last Not Well=0,Reason Not Well=None,*Thread Fail Init=0,*Thread Zero Hash=0,*Thread Fail Queue=0,*Dev Sick Idle 60s=0,*Dev Dead Idle 600s=0,*Dev Nostart=0,*Dev Over Heat=0,*Dev Thermal Cutoff=0,*Dev Comms Error=0,*Dev Throttle=0|NOTIFY=2,Name=GSD,ID=2,Last Well=1396718524,Last Not Well=0,Reason Not Well=None,*Thread Fail Init=0,*Thread Zero Hash=0,*Thread Fail Queue=0,*Dev Sick Idle 60s=0,*Dev Dead Idle 600s=0,*Dev Nostart=0,*Dev Over Heat=0,*Dev Thermal Cutoff=0,*Dev Comms Error=0,*Dev Throttle=0|NOTIFY=3,Name=GSD,ID=3,Last Well=1396710329,Last Not Well=1396710329,Reason Not Well=Thread got zero hashes,*Thread Fail Init=0,*Thread Zero Hash=1,*Thread Fail Queue=0,*Dev Sick Idle 60s=0,*Dev Dead Idle 600s=0,*Dev Nostart=0,*Dev Over Heat=0,*Dev Thermal Cutoff=0,*Dev Comms Error=0,*Dev Throttle=0|'
Array
(
    [STATUS] => Array
        (
            [STATUS] => S
            [When] => 1396718524
            [Code] => 60
            [Msg] => Notify
            [Description] => cgminer 3.7.2
        )

    [NOTIFY0] => Array
        (
            [NOTIFY] => 0
            [Name] => GSD
            [ID] => 0
            [Last Well] => 1396718524
            [Last Not Well] => 0
            [Reason Not Well] => None
            [*Thread Fail Init] => 0
            [*Thread Zero Hash] => 0
            [*Thread Fail Queue] => 0
            [*Dev Sick Idle 60s] => 0
            [*Dev Dead Idle 600s] => 0
            [*Dev Nostart] => 0
            [*Dev Over Heat] => 0
            [*Dev Thermal Cutoff] => 0
            [*Dev Comms Error] => 0
            [*Dev Throttle] => 0
        )

    [NOTIFY1] => Array
        (
            [NOTIFY] => 1
            [Name] => GSD
            [ID] => 1
            [Last Well] => 1396718524
            [Last Not Well] => 0
            [Reason Not Well] => None
            [*Thread Fail Init] => 0
            [*Thread Zero Hash] => 0
            [*Thread Fail Queue] => 0
            [*Dev Sick Idle 60s] => 0
            [*Dev Dead Idle 600s] => 0
            [*Dev Nostart] => 0
            [*Dev Over Heat] => 0
            [*Dev Thermal Cutoff] => 0
            [*Dev Comms Error] => 0
            [*Dev Throttle] => 0
        )

    [NOTIFY2] => Array
        (
            [NOTIFY] => 2
            [Name] => GSD
            [ID] => 2
            [Last Well] => 1396718524
            [Last Not Well] => 0
            [Reason Not Well] => None
            [*Thread Fail Init] => 0
            [*Thread Zero Hash] => 0
            [*Thread Fail Queue] => 0
            [*Dev Sick Idle 60s] => 0
            [*Dev Dead Idle 600s] => 0
            [*Dev Nostart] => 0
            [*Dev Over Heat] => 0
            [*Dev Thermal Cutoff] => 0
            [*Dev Comms Error] => 0
            [*Dev Throttle] => 0
        )

    [NOTIFY3] => Array
        (
            [NOTIFY] => 3
            [Name] => GSD
            [ID] => 3
            [Last Well] => 1396710329
            [Last Not Well] => 1396710329
            [Reason Not Well] => Thread got zero hashes
            [*Thread Fail Init] => 0
            [*Thread Zero Hash] => 1
            [*Thread Fail Queue] => 0
            [*Dev Sick Idle 60s] => 0
            [*Dev Dead Idle 600s] => 0
            [*Dev Nostart] => 0
            [*Dev Over Heat] => 0
            [*Dev Thermal Cutoff] => 0
            [*Dev Comms Error] => 0
            [*Dev Throttle] => 0
        )

)
[/code]
sr. member
Activity: 269
Merit: 250
bro

we need for multiple config and edition cgmon setting

like if somebody have 50-100 pc its very hard to change all conf info in ur file on each pc.

could u help with that ?

like u have client file and server file too for mass controll cgmon on pcs

Place a customized copy of cgmon.tcl on a webserver and download it to each of your clients as needed.  

You could even change the auto update URL to your custom copy and then update your cgmon via the builtin update function
Code:
./cgmon.tcl update
 The update function saves the unique configuration options on each machine.

sr. member
Activity: 264
Merit: 252
alt coins trader/miner.
bro

we need for multiple config and edition cgmon setting

like if somebody have 50-100 pc its very hard to change all conf info in ur file on each pc.

could u help with that ?

like u have client file and server file too for mass controll cgmon on pcs
sr. member
Activity: 269
Merit: 250
If your path to cgmon.tcl was /home/username/cgmon.tcl you could use this to add the line to crontab:

Code:
echo "*/2 * * * *  root	/home/username/cgmon.tcl >/dev/null 2>&1" >> /etc/crontab
hero member
Activity: 530
Merit: 500
Sorry, I'm a pre-school level Linux user. Could you break step 2 in the installation instructions down a little more. I'm using BAMT.
sr. member
Activity: 269
Merit: 250
When that happens, run this command and send me the result.

Code:
php -f /tmp/cgmon-api.php notify

I may be able to add support for gridseed with that information.
newbie
Activity: 7
Merit: 0
Hello, need some help.

I have a RaspPi and some gridseeds. Sometimes miners randomly fail and have to restart cgminer.

Code:
[2014-04-03 03:18:57] GSD 2 GetResults usb read err:(-1) LIBUSB_ERROR_IO
[2014-04-03 03:18:57] No response from 2
[2014-04-03 03:18:57] GSD 2 failure, disabling!
[2014-04-03 03:19:01] Stratum from pool 1 detected new block
[2014-04-03 03:19:03] Accepted aaeacd03 Diff 383/256 GSD 3 pool 1
[2014-04-03 03:19:10] Accepted 8fc9cf7f Diff 456/256 GSD 1 pool 1

I followed everything in the CGmon installation instructions.
 
 Did ./cgmon.tcl and cgminer starts up but when a miner fails, cgminer does not restart.
 
 So, is there a way to get CGmon to work with gridseeds?
sr. member
Activity: 269
Merit: 250
I've added the install instructions to the first post in this thread (its 4 steps).

So you think that this shouldn't interfere with Step 6: Create cgminer startup script or Step 7: Create auto-start scripts?

http://www.cryptobadger.com/2013/04/build-a-litecoin-mining-rig-linux/


Correct.  It shouldn't interfere Smiley
sr. member
Activity: 269
Merit: 250
Thanks for the script, it's really useful as my rig randomly hangs in a few hours. Angry You saved me from endless nights of panic and wake up to a dead miner. I do have a question about the conf of the script:

Is there an option that CGmon can reboot the miner if CGminer failed to start?

I notice sometimes after reboot, CGminer just won't start and giving "invalid MIT-MAGIK-COOKIES-1 key segmentation fault", a cold reboot would fix the problem nicely. But right now CGmon would go into an endless circle saying "can't find cgminer running and check your conf file" but no reboot would take place. Am I missed the option or it does not come with this option yet? Right now I just use a small cron script to run constantly to check CGminer existence every 15 minutes. If CGmon has this already built in would be really great!


You can make your system reboot at that point with a small customization.  Find this line in cgmon.tcl:
Code:
				sendmail "[stamp] $hostname - $conf(cgminer_exec) failed to start" "$hostname $conf(cgminer_exec) failed to start.\n mining command was: $mining_command\n"

Add a new line below it with this one command:

Code:
reboot
sr. member
Activity: 269
Merit: 250
Hi,

a few of times now I've woken up to a dead system with disk errors.  I click F to fix the errors and everything is fine.  I guess the last time the errors were too much as this didn't work.  Manually running fsck used up all memory and crashed.  The same thing happened when I ran a live cd and attempted to fix the disk.  So I re-installed the OS Sad

I'm wondering if there is a more graceful way to shut down the system and avoid this or are we using best practices already?

Cgmon uses standard shutdown procedures - nothing that would cause file system corruption.

Sounds like you might want a new hard drive on that rig... It might last for awhile but it's probably failing.



newbie
Activity: 34
Merit: 0
Hi,

a few of times now I've woken up to a dead system with disk errors.  I click F to fix the errors and everything is fine.  I guess the last time the errors were too much as this didn't work.  Manually running fsck used up all memory and crashed.  The same thing happened when I ran a live cd and attempted to fix the disk.  So I re-installed the OS Sad

I'm wondering if there is a more graceful way to shut down the system and avoid this or are we using best practices already?
newbie
Activity: 1
Merit: 0
Thanks for the script, it's really useful as my rig randomly hangs in a few hours. Angry You saved me from endless nights of panic and wake up to a dead miner. I do have a question about the conf of the script:

Is there an option that CGmon can reboot the miner if CGminer failed to start?

I notice sometimes after reboot, CGminer just won't start and giving "invalid MIT-MAGIK-COOKIES-1 key segmentation fault", a cold reboot would fix the problem nicely. But right now CGmon would go into an endless circle saying "can't find cgminer running and check your conf file" but no reboot would take place. Am I missed the option or it does not come with this option yet? Right now I just use a small cron script to run constantly to check CGminer existence every 15 minutes. If CGmon has this already built in would be really great!

newbie
Activity: 9
Merit: 0
I've added the install instructions to the first post in this thread (its 4 steps).

So you think that this shouldn't interfere with Step 6: Create cgminer startup script or Step 7: Create auto-start scripts?

http://www.cryptobadger.com/2013/04/build-a-litecoin-mining-rig-linux/
sr. member
Activity: 269
Merit: 250
I've added the install instructions to the first post in this thread (its 4 steps).
sr. member
Activity: 269
Merit: 250
The instructions are at the top of the cgmon.tcl file and are pretty much step by step.  You just need to know how to edit a text file in unix (/etc/crontab) and known what paths you're already using for your cgminer setup.  cgmon wont interfere with any of your cryptobadger stuff.
newbie
Activity: 9
Merit: 0
I've been looking for something like this as I'm getting really frustrated with random hangs... For example I went on a training course for 10 hours and discovered a minute after I left my house cgminer hung for the whole time grrrrr.

The problem I have is I'm a linux n00b (shock horror!) and I'll need a little help if possible please!

As background I followed the cryptobadger guide completely for my rig, to the letter, and my concern is that I'm not too sure what I'd need to disable and how to integrate this. Has anyone got the knowhow to help me? I'd need a step by step what to type to stop the cryptobadger and move to this if possible.

If it is straight forward and I don't need to do much, feel free to point me in the right direction!

Of course, it goes without saying, whoever can get me up and running (without these infuriating crashes) will indeed get a donation as I would very much appreciate your time Smiley

I am an IT professional FYI, just not in Linux...

sr. member
Activity: 269
Merit: 250
one of my gpu always goes sick/dead...temperature decreases until it goes dead moved it on different PCI slots and same thing. Mining lasts for maybe 2-5minutes and then has to restart.

I definitely have more then enough of a PSU as I have the corsair 1300Watt and only using 3 GPU HIS ICEQ 7950 and my kill-a-watt says 850watts.

I'd try lowering your intensity or gpu-engine settings until that card is stable.  Each card is unique.  Your PSU is fine.  Cheers.
Pages:
Jump to: