Pages:
Author

Topic: Smartcoin Linux mining administration. [MULTI-MACHINE SUPPORT NOW IN!] - page 6. (Read 105029 times)

hero member
Activity: 504
Merit: 502
Hey johndecker

Ive recently started to try out poclbm and with smartcoin it doesnt seem to read at all on the "dashboard"

The processes do load in screen but on the dashboard it simply shows that the gpu's run at 99% and on the gpu shares monitoring they just remain as
can you detach from smartcoin and run
Code:
screen -r miner
and post the output of one of the miners?

Hi John

The output of the miner screen session (all instances of gpus start correctly) is that of running poclbm instance.

There is no error, just poclbm running as it should on each gpu instance behind miner screen.

The problem is of course that these screen instances do not reflect as running on the dashboard since it only shows for each gpu.
full member
Activity: 238
Merit: 100
Update r633e now available:
- New installer uses the new AutoDetection routine (the same one that runs on remote machines). It also uses the new settings code.
- When adding a new machine, default settings are now filled out
- The status screen now only grabs settings information for the current machine





All that is left to do now for full multi-machine support is figure out how to get the miner screen instance to launch on a remote host.  I may play with that a bit today, but I expect to have it figured out in a day or two


Something new with failover?
As you see profile 1 CPU fail. And same time in profile 2 is CPU happily hashing with full hashpower.

Code:
Smartcoin r633e 23:39:46
----------------------------------------
Host: localhost
GPU[0]: Temp: 71.00 load: 99%
GPU[1]: Temp: 73.00 load: 99%
GPU[2]: Temp: 77.00 load: 99%
GPU[3]: Temp: 70.00 load: 99%
CPU Load Avgs: 6.59 6.63 6.60

Profile: Failover
--------BTCGuild--------
GPU[0]: [103.79 MHash/s] [194 OK] [1 Bad] [.515% Bad]
GPU[1]: [103.74 MHash/s] [199 OK] [4 Bad] [2.010% Bad]
GPU[2]: [103.78 MHash/s] [171 OK] [1 Bad] [.584% Bad]
GPU[3]: [103.79 MHash/s] [215 OK] [1 Bad] [.465% Bad]
CPU:    <<>
Total : [415.10 MHash/s] [779 OK] [7 Bad] [.898% Bad]

Failover to: Mt.Red
--------MtRed--------
GPU[0]: [105.89 MHash/s] [185 OK] [3 Bad] [1.621% Bad]
GPU[1]: [103.74 MHash/s] [248 OK] [4 Bad] [1.612% Bad]
GPU[2]: [103.79 MHash/s] [197 OK] [1 Bad] [.507% Bad]
GPU[3]: [103.78 MHash/s] [211 OK] [1 Bad] [.473% Bad]
CPU:    [17.6 MHash/s] [33 OK] [0 Bad] [0% Bad]
Total : [434.80 MHash/s] [874 OK] [9 Bad] [1.029% Bad]

Grand Total : [849.90 MHash/s] [1653 OK] [16 Bad] [.967% Bad]
{/code]

I just took a peek at your box and everything is running fine now - did you do anything that solves the  problem?

The "<<>>" status means that either the miner failed to launch, or crashes sometime after launch. If it happens again, can you post the output of the miner instance here?
full member
Activity: 238
Merit: 100
Hey johndecker

Ive recently started to try out poclbm and with smartcoin it doesnt seem to read at all on the "dashboard"

The processes do load in screen but on the dashboard it simply shows that the gpu's run at 99% and on the gpu shares monitoring they just remain as
can you detach from smartcoin and run
Code:
screen -r miner
and post the output of one of the miners?
newbie
Activity: 56
Merit: 0
Update r633e now available:
- New installer uses the new AutoDetection routine (the same one that runs on remote machines). It also uses the new settings code.
- When adding a new machine, default settings are now filled out
- The status screen now only grabs settings information for the current machine





All that is left to do now for full multi-machine support is figure out how to get the miner screen instance to launch on a remote host.  I may play with that a bit today, but I expect to have it figured out in a day or two


Something new with failover?
As you see profile 1 CPU fail. And same time in profile 2 is CPU happily hashing with full hashpower.

Code:
Smartcoin r633e 23:39:46
----------------------------------------
Host: localhost
GPU[0]: Temp: 71.00 load: 99%
GPU[1]: Temp: 73.00 load: 99%
GPU[2]: Temp: 77.00 load: 99%
GPU[3]: Temp: 70.00 load: 99%
CPU Load Avgs: 6.59 6.63 6.60

Profile: Failover
--------BTCGuild--------
GPU[0]: [103.79 MHash/s] [194 OK] [1 Bad] [.515% Bad]
GPU[1]: [103.74 MHash/s] [199 OK] [4 Bad] [2.010% Bad]
GPU[2]: [103.78 MHash/s] [171 OK] [1 Bad] [.584% Bad]
GPU[3]: [103.79 MHash/s] [215 OK] [1 Bad] [.465% Bad]
CPU:    <<>
Total : [415.10 MHash/s] [779 OK] [7 Bad] [.898% Bad]

Failover to: Mt.Red
--------MtRed--------
GPU[0]: [105.89 MHash/s] [185 OK] [3 Bad] [1.621% Bad]
GPU[1]: [103.74 MHash/s] [248 OK] [4 Bad] [1.612% Bad]
GPU[2]: [103.79 MHash/s] [197 OK] [1 Bad] [.507% Bad]
GPU[3]: [103.78 MHash/s] [211 OK] [1 Bad] [.473% Bad]
CPU:    [17.6 MHash/s] [33 OK] [0 Bad] [0% Bad]
Total : [434.80 MHash/s] [874 OK] [9 Bad] [1.029% Bad]

Grand Total : [849.90 MHash/s] [1653 OK] [16 Bad] [.967% Bad]
{/code]
hero member
Activity: 504
Merit: 502
Hey johndecker

Ive recently started to try out poclbm and with smartcoin it doesnt seem to read at all on the "dashboard"

The processes do load in screen but on the dashboard it simply shows that the gpu's run at 99% and on the gpu shares monitoring they just remain as
full member
Activity: 238
Merit: 100
Update r633e now available:
- New installer uses the new AutoDetection routine (the same one that runs on remote machines). It also uses the new settings code.
- When adding a new machine, default settings are now filled out
- The status screen now only grabs settings information for the current machine





All that is left to do now for full multi-machine support is figure out how to get the miner screen instance to launch on a remote host.  I may play with that a bit today, but I expect to have it figured out in a day or two
full member
Activity: 238
Merit: 100
Update r613 now available!

Jon, assuming that's the "e" Experimental release?  I'm only seeing r496s in the stable branch.  True?
Or do I need to do a complete re-install to move into the r6xx stable branch?

Yes, 'e' stands for experimental, 's' stands for stable.  You can change the settings in Edit Settings from the control screen.
496 is the latest stable release - though pretty much everything new in the experimental branch has tested to be safe.  One thing you could do if you wanted, is:
1) Switch to the experimental branch from Edit Settings
2) Do an update
3) Switch back to stable from Edit settings

What this will do, is update you to the latest experimental revision, and then set you back to stable so that any new updates from then on will wait until the next stable update rolls out (essentially this means that you will be at r627s right now, and you will get no further experimental updates until I increment the stable release counter again.  This is an undocumented feature.
full member
Activity: 238
Merit: 100
Update r627e now available:
- More improvements to the way SQL queries are retried if they fail. It is even more robust now
- Many many many lower-level changes for multiple-machine support (99% complete now)
- Lots of work on new autodetect routines (99% complete also)
- Experimenting with a new way to set the DISPLAY=:0 before aticonfig commands are called. Please report if you have any problems.




FOR THOSE WISHING TO TEST MULTI_MACHINE SUPPORT:
You can now test the Adding and Deleting of machines (editing not supported yet)
There is a hidden control screen menu item #13. From there you can add/delete machines to test the new routines.  Note that you won't be able to actually control the machines yet, so I wouldn't bother adding profiles for the other machines yest (though you can for testing purposes).  This information will be useful to let me know that the remote information is being detected correctly, and that the low-level routines which communicate over ssh sockets work correctly.  Please report back your experiences if you do play around with it a little.
member
Activity: 84
Merit: 10
Update r613 now available!

Jon, assuming that's the "e" Experimental release?  I'm only seeing r496s in the stable branch.  True?
Or do I need to do a complete re-install to move into the r6xx stable branch?
full member
Activity: 238
Merit: 100
italeffect:
thanks for the report.  I'm committing new code that may help
sr. member
Activity: 255
Merit: 250
Sorry posted prematurely. I was able to fix it by fully killing smartcoin and then relaunching with DISPLAY=:0 smartcoin
sr. member
Activity: 255
Merit: 250
Just updated to r617e.

While accessing my miners from ssh...
Seeing a repeating message at the bottom of my screen sessions (screen #1) that says "aticonfig: This program must be run as root when no X server is active".

Tried running as:

smartcoin
screen -x smartcoin
DISPLAY=:0 smartcoin
DISPLAY=:0 screen -x smartcoin

Can't get it to go away. And of course the % load and temps do not show up with this error.

Thanks for any help.
full member
Activity: 238
Merit: 100
Update r613 now available!

- The settings table now goes through a verification process every boot. This process ensures that the correct settings are available in the database, and there are no duplicates.  This also ensures that it stays up to date with changes automatically(i can change the description information, or re-order the settings in an update)

- The settings now have an information field with more detailed instructions on what they do. This information is now displayed when you edit a setting.

- The entire settings system is now re-written and fully supports multiple machines.

- The control interface has also been rewritten.  You now "drill down" the settings more logically (Edit Settings->General Settings->SettingName in the case of general settings, OR EditSettings->MachineSettings->Machine#->SettingName in the case of machine settings)

- There is a partial update (brakpoint) in this update.  You will fist  update to r607. After a restart of smartcoin, Running the update again will take you from r607 to r613.  I had to do it this way to avoid some potential problems with the massive change of the settings system.


Please post any problems or suggestions.
full member
Activity: 238
Merit: 100
Update r590e now available!
- The ugly dependency of needing a phoenix path setting is now gone. (the setting is still in the Edit Settings menu for now, though it is now defunct).  When miners are launched, it is checked whether or not the miner is phoenix, and if so use the path already stored in the phoenix miner's database entry instead of from the settings table.

- Some lower-level changes were made that should eventually lead to being able to do miner instance reloads/killing/starting on the fly without killing and recreating the screen session.

- The failover routine was optimized a bit. It used to update the table every iteration. It now only updates the table if there was an actual  change.

- The status script and the monitoring functions have all now been made multi-machine aware.  Sadly, I haven't had a chance to test the parts that run the commands on remote machines but I'll be doing those tests soon. For now, I do know that it knows when a command should run on the local machine, and works as it should.



To give an idea of what still needs done for multi-machine support:
- I'm 90% finished with the new AutoDetect routine which will be run locally by the installer, and also remotely when a machine is added. Most of what I need to do now revolves around testing.

- The settings table needs a revamp so that it holds general settings, as well as settings specific to each machine (each machine could have a different AMD/ATI SDK location for example).  This will require a schema update that uses the breakpoint feature of the update system.  I may push out this shema update later today.  While I'm at it, I'll add an extra field to put better descriptions of the settings so when you edit them you will be given a more complete description of what the setting does.

- The new settings descriptions will need added to the routine that initially populates the database, as well as pushed out as an update patch to existing installs.

- The Configure Settings routine will need revamped to separate out general and specific machine settings, and use the new description feature.  This will actually be pretty simple.

- The Add Machine routine is finished and tested already. The part that adds the information permanently to the database is deactivated for now for my own testing purposes.  I just need to add a call to the new AutoDetection routine so that the miners/devices etc. add to the database automatically.

- I need to implement the Edit Machine and Delete Machine routines.  This isn't very hard, and I may even wait on these as they aren't all that important at the moment.

-Currently, the routine that launches commands on remote machines queries the database for the remote machine information each call.  I need to optimize this to load this information once at the beginning so that we don't pound the database for no good reason. (another easy task)

- Here is the biggest thing right now. I have to rewrite the routine that creates the screen session and actually launches the miner instances so that this happens on the appropriate machine (locally or remotely).  Quite actually, multi-machine would work right now if this were implemented, the AMD/ATI location was the same on all machines and I enabled the new features.

As you can see, we are getting very close. Please report any bugs that you may find with these latest releases, as this will ensure that I catch any problems in the new multi-machine code (which is fully active and being used for the last several updates). Technically, if it works for running commands locally, it should work just as well running commands remotely - so getting this testing out of the way now will help ensure that when multi-machine stuff is finished, it will be a smooth transition!
newbie
Activity: 41
Merit: 0
Thank you for exclude option !
full member
Activity: 238
Merit: 100
Update r576e now available!
- General cleanup of the new AutoDetect routine
- 1, 5, and 15 minute load averages now displayed!
- Failover order can now exclude profiles.  Simply don't add the ones you don't want to the comma-separated-list.  The Set Failover Order menu option has been rewritten to clarify this as well.
full member
Activity: 238
Merit: 100
As there seems to be some confusion regarding the CPU load averages, I recommend that you change the text from "CPU Load" to "CPU load averages" and show all three averages instead of just the 1 minute number. That's what people are used to from running "w" and "top".

plantucha: If you're running CPU mining with CGminer, the load that puts on the CPU is included in the "CPU Load" that's displayed in smartcoin. It's the same number that's displayed when you run "top". It's not a percentage.
Yeah, good idea. I'll change the wording and display all 3 averages
full member
Activity: 168
Merit: 100
I'll have a steak sandwich and a... steak sandwich
As there seems to be some confusion regarding the CPU load averages, I recommend that you change the text from "CPU Load" to "CPU load averages" and show all three averages instead of just the 1 minute number. That's what people are used to from running "w" and "top".

plantucha: If you're running CPU mining with CGminer, the load that puts on the CPU is included in the "CPU Load" that's displayed in smartcoin. It's the same number that's displayed when you run "top". It's not a percentage.
full member
Activity: 238
Merit: 100
Just thought I would give a sneak-peak of a new feature that will be available once multi-machine support is in.
You will be able to define "Macros" (macro-profiles to be specific)
This will allow you to define a macro that basically says: (for example)
"Switch Machine 1 to profile 'A'"
"Switch Machine 2 to profile 'B'"
"Switch Machine 3 to profile 'A'"
"Leave machine 4 alone to continue to mining to its current profile"
"Switch Machine 5 to 'Failover'"
Switch Machine 6 to 'Automatic'"

Then by simply choosing one of the macros you defined, all machines will switch their profiles according to the rules you defined in your macro.  Of course, there will be some auto-created macros (Failover, Automatic and Manual Donate) which will switch all profiles on all machines to their Failover, Automatic or Manual Donate profile.

Just thought I'd post this information so you can start familiarizing yourselves with the concept.
sr. member
Activity: 897
Merit: 284
I'm holding off on switching the rest fo my rigs over to Linux until the multi-machine support is enabled. I use LinuxCoin with SmartCoin on 2 of them now, so I'm really, really anxious for the multi-machine support. Incredible work!
Pages:
Jump to: