Author

Topic: Another S9 not working (Read 488 times)

legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
July 10, 2019, 11:42:47 AM
#28
Yes, in addition to target temperature exposed in the ui there are two hidden parameters you can change: Hot, which is when fans go 100%, and dangerous which is the one that makes it stop mining/restart.

Here is a sample in the bOS thread.

Remember, it is your personal choice and responsibility, and you can damage your equipment if you mis-configure these parameters. You should find ways to improve the airflow in your place. It is also possible to disable sensors, or ignore just the chip sensors (there are two types of sensors, on chip (remote), and on-board). With: disable-sensors and disable-remote-sensors.

bOS is the most flexible firmware because it doesn't do any hand holding for you, but you alone assume the risks of your mistakes.
jr. member
Activity: 44
Merit: 10
July 10, 2019, 10:12:50 AM
#27
Thanks again.

Ive left them now but will play again tomorrow.

They are still running on slush so all good.

Is there away of changing the max temp on BOS? Checked my other S9 and they tend to run at mid 90s
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
July 10, 2019, 09:41:38 AM
#26
9.4v is too much, if you have to go over 9v, its better to reduce speed in my opinion.

And yes, assume that board is failing, don't expect it to work at spec speed anymore, if it works fine at 500mhz, so be it.



When you install BrainsOS, go and lower those voltages. Yes it loves 9.4v, but don't. I know someone that starts at 8.7v always. Hundreds and hundreds of S9s...

If you went back to factory firmware, it probably changed its ip, you'll have to find it again. Depending on how you made your bOS image, there is a small chance you cloned up the mac address. Plug the miner controller directly to a computer and scan its ip, log in and inspect the hw mac address and compare it with the second one, just in case.

There is nothing factory software can do that you can't replicate manually with bOS. If anything, the factory firmware is going to limit you more. Also bOS is over protective by default, since it reboots if over 95°C while factory might leave it running until 115°C or such. bOS is giving you a second chance to fix your overheating, while factory will happily burn it. More sales right? Smiley

Also as told elsewhere, the sensors tend to fail. That's when you appreciate bOS the more, with factory you would think the hash board is useless, but still works using bOS...
jr. member
Activity: 44
Merit: 10
July 10, 2019, 07:14:30 AM
#25
I got this when I came on this morning.

https://imgur.com/qL1ZJ2N

I have about 30miners running in that room and don't have any issues with others.

Im not worried about damaging this one. but if I have to run it on the Bitmain software with the one of the boards disconnected its not the end of the world.

I have dropped the frequency to 550 and left the voltage at 9.4

I am wanting to install the rest of the s9s and T1's to Braiins but worried I'm going to have issues with them getting hot and turning off.



https://imgur.com/85rCnWJ

Just another update...

Its not working think I will just go back to the Bitmain software.
Unless you have any other ideas?



So I took the memory card out of the miner and moved jumper 4 back.

Now I can't find the miner with the IP address.

I also tried the Braiins firmware on a healthy miner. It got to hot and cut out within minutes. I took the sd card out of that miner and moved the jumper back and cant find that one either!

Please help!
legendary
Activity: 2436
Merit: 6643
be constructive or S.T.F.U
July 09, 2019, 05:48:13 PM
#24
Ok let's break this down

The first and second board are running pretty fine, very a few HW and the temps on the first are fine, and they should be fine on the middle board, maybe a bit warmer but that is still fine, I would not touch the first and second boards.

The 3rd board however has way to many errors, so now it's you call, you have one of two options as Artemis3 mentioned but let me explain a bit more

option 1: increase voltage

pros : you don't lose hash rate
cons :the board will run a hotter

option 2:decrease frequency

pros: board runs cooler
cons: you lose some hash power

do whatever you feel more comfortable with, you need to have a few HW per hour for the hash rate to actually count on the pool side.

after having fixed that, keep the miner on for a couple hours and then check the pool, if the hash rate on your miner is the similar to the one on the pool, you are good, if not then you have some internet/networking issue, pool is too far, latency ,etc.
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
July 09, 2019, 01:28:32 PM
#23
Well as i said if you get too many errors you have to either lower the speed or bump the voltage. And remember the middle board tends to overheat more due to a Bitmain miss design (intake fan too close to the board).

I say your ambient temperatures are fine, they are supposed to work up to 40°C ambient, as long as you keep that air moving, 200cfm per s9 as some have suggested (fresh air separate from exhaust air). Humidity should be above 20% but under 70% (they say up to 90% is ok, but things like the fan grills get rust).
jr. member
Activity: 44
Merit: 10
July 09, 2019, 10:14:08 AM
#22
Thanks very much for your help!!

There has been an issue today this is what its showing now

https://imgur.com/CVgEYdS

https://imgur.com/yY0RyIo

What else can I try? Its 28 deg in the room there in and it gets to about 33max in summer
legendary
Activity: 2436
Merit: 6643
be constructive or S.T.F.U
July 09, 2019, 07:49:35 AM
#21
but under 90°C is still safe i guess, don't let them touch that temp.

This topic has been debated before , a dozen times, some say anything above 90 is bad, other say since Bitmain has 115 as the max chip temp so anything below that is fine, and the two different groups have some reasonable explanations on why their point is right

Now science and school aside, speaking from my own personal experience,  I think temps don't really affect the lifespan of the hash board !

believe it or not, I had some brand new gears run very very very cool and simply burn, I have a small farm built on the roof of my house, where the sun is frying them everyday and they operate above 90 almost most of the time, and some of those are super old gears and guess what? 0 problems !

I know the school boys will argue with that and explain to you how does heat reduce the lifespan of anything electronic , not saying they are wrong ( I do believe that is true ), but these boards are 99% luck based, doesn't matter how nicely you treat them  Roll Eyes
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
July 09, 2019, 04:27:34 AM
#20
Chain 1 looks good.

Chain 2 and 3, you could try lowering the speed a little if you are not satisfied with chip/ideal rate or up the voltage a notch if you don't mind getting them hotter, tho 83°C and fans already maxed is looking a bit too much in the too hot side, but under 90°C is still safe i guess, don't let them touch that temp.

When you used Bitmain's, you probably saw a board at around 602ish and another at like 590ish, that should give you a hint.

I don't have experience with T1s, apparently it works fine with G19s and G29s, but i have seen people occasionally having issues with the 256mb ram controllers.
jr. member
Activity: 44
Merit: 10
July 09, 2019, 02:28:36 AM
#19
Thanks for that just under clocked it with all the boards connected, its been running for a cook of hours and I'm getting this.

https://imgur.com/o6wDmFB

I will see how it does as it gets warmer today.

Im thinking about doing the rest of my s9s and T1s.

I had a look which version to use for T1 and wasn't sure. Does it work well on the t1?
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
July 08, 2019, 05:20:48 PM
#18
I have tried it with the middle board disconnected and it won't run with the Braiins. Well it does but just gets too hot!

It ran with two boards on Antminer Firmware.

You can (should) downclock the boards. If you used Bitmain firmware, you should remember the speed they had in Mhz, then use a speed close to that and lower voltages until you can see ideal and actual rate match, then go back up.

Ie. If 600 Mhz 8.7v, lower to 8.6, 8.5, then back up to 8.6. Etc. (Ignore what Braiins says for auto voltage).

Always seek the lowest voltage you can get while still getting good results (actual and ideal speed match, few or no hw errors).

You can also force the fans, so make sure you check that as well. You can also tweak the target speed (ie 70°C rather than 75°C). Remember that these parameters are not identical to Bitmain's, find your own sweet spot.
jr. member
Activity: 44
Merit: 10
July 08, 2019, 12:47:05 AM
#17
I have tried it with the middle board disconnected and it won't run with the Braiins. Well it does but just gets too hot!

It ran with two boards on Antminer Firmware.
legendary
Activity: 3472
Merit: 3217
Playbet.io - Crypto Casino and Sportsbook
July 05, 2019, 03:46:45 PM
#16
I finally managed to install the braiins os.

I am not getting this

~snip~

I don't know much about braiins OS but it looks like your miner running fine according to the image above.
The only problem is the temp in the 2nd board according to the image it's not showing any temp it means that the 2nd board temp sensor is defective or damaged this might be the reason why the other hashboard showing abnormal temp or status.

How about the reported hashrate on the pool? Are you getting correct hashrate speed?
If everything is fine with Braiins OS much better to stay with this OS just to bypass the 2nd board temp checking.
jr. member
Activity: 44
Merit: 10
July 05, 2019, 09:22:43 AM
#15
I finally managed to install the braiins os.

I am not getting this

https://imgur.com/XE2id7W

https://imgur.com/3tElH3y
legendary
Activity: 3472
Merit: 3217
Playbet.io - Crypto Casino and Sportsbook
June 14, 2019, 04:20:07 PM
#14
Hi Sorry the log was with chain 7 disconnected.

So there is still an issue.

I will try the software on this miner tomorrow when I get some time. I was confused with which firmware to download!

Which one of these do I download?

https://imgur.com/u4a2kdx

The recommended firmware already mention above.

Then step by step guide you can find here https://docs.braiins-os.org/#phase-1-creating-bootable-sd-card-image-antminer-s9-example
or Check this video https://www.youtube.com/watch?v=-W3N0HCObbA for windows or this one https://www.youtube.com/watch?v=6Ft68yIjwUE&vl=en for linux.

Hi Please can someone help me install the Braiins OS software?

Follow this https://docs.braiins-os.org/#option-a-install-from-sd-card
member
Activity: 113
Merit: 31
June 14, 2019, 09:11:58 AM
#13
If installing to an SD card download this file
https://feeds.braiins-os.org/braiins-os_am1-s9_sd_2019-06-05-0-0de55997.img

If installing directly to the controller via a web browser use this
https://feeds.braiins-os.org/braiins-os_am1-s9_web_2019-06-05-0-0de55997.tar.gz

jr. member
Activity: 44
Merit: 10
June 14, 2019, 01:17:41 AM
#12
Hi Sorry the log was with chain 7 disconnected.

So there is still an issue.

I will try the software on this miner tomorrow when I get some time. I was confused with which firmware to download!

Which one of these do I download?

https://imgur.com/u4a2kdx



Hi Please can someone help me install the Braiins OS software?

I don't understand which file to use from there website to flash my SD card with?

Thanks!
legendary
Activity: 3472
Merit: 3217
Playbet.io - Crypto Casino and Sportsbook
June 11, 2019, 09:35:31 AM
#11
It looks like the chain [7] is running fine now according to your kernel logs and the PIC fw is showing the same as other hashboard so I think your problem is solved?

Do you have another problem with the miner?

How about the miner's status in WebGUI?
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
June 11, 2019, 05:24:38 AM
#10
Thanks just been looking on https://braiins-os.org. It all looks a bit complicated.

Do you know if I can download the old software from Bitmain?

There is no complication if your miner has the sd card reader (some don't).

All you do is:
  • put the image in the sd card
  • turn off miner
  • move jumper jp4
  • insert card
  • turn on miner

It cannot be more simpler than that. Well it actually can, if you decide to flash it to your controller instead...

Never ever use Bitmain firmware from 2019, it won't let you downgrade, and closes ssh.
jr. member
Activity: 44
Merit: 10
June 11, 2019, 12:21:05 AM
#9
legendary
Activity: 3472
Merit: 3217
Playbet.io - Crypto Casino and Sportsbook
June 10, 2019, 11:00:24 AM
#8
Do you know if I can download the old software from Bitmain?

Do you mean firmware?

You can download the old firmware from Bitmain website from here https://service.bitmain.com/support/download?product=Antminer%20S9

Are you still having an issue with one hashboard?

Can you post the whole Kernel logs here so that we can analyze your miner's issue.

You can find the kernel logs under system>kernel log> then copy them all and paste it to pastebin.com and put the link here or paste it here as a code.
Don't forget to put the Hashboard chain(7) then run for 15 minutes then start copying the kernel logs and post it here so that we can check if the chain (7) has PIC fw issue.
jr. member
Activity: 44
Merit: 10
June 10, 2019, 03:20:15 AM
#7
Thanks just been looking on https://braiins-os.org. It all looks a bit complicated.

Do you know if I can download the old software from Bitmain?
legendary
Activity: 2436
Merit: 6643
be constructive or S.T.F.U
June 10, 2019, 03:03:25 AM
#6
Ok great news, your miner is working now , seems like i guessed it right ( surprisingly  Grin) , despite there is no reason " or non that i know about" on why would a missing asic on a single board will stop the miner from mining,  still it is a fact.

so what do we know now ?

1- the control board is ok
2- fans are good
3- psu is probably good ( there is a chance that it can run 2 boards but not 3 - but this is very unlikely)

now the only way for you to get that other board to work  would be using a different firmware, you might get lucky and find a firmware that will still force that board to run despite the missing Asic/s, for a starter if you are not using 2019 then simply try old firmware such as the 2017 versions , try every firmware bitmain has except for the 2019 version.

if all fails, or if you are already using 2019 firmware , then i would follow @Artemis3's suggestion and try Braiins OS, in fact you have a better "hope" with it than with all bitmain firmware's combined, but for the sake of simplicity it's always wise to try the "easy" fix first.
jr. member
Activity: 44
Merit: 10
June 10, 2019, 02:02:47 AM
#5
Thanks everyone for the advice.

I have had to pinch a fan from it for another miner. When you say chain 7 do you mean the J7? Would this mean the middle hash board is faulty?



Ok I have just tried disconnecting J7 with using a fan from an A3

These are the results

https://imgur.com/5000xNY
legendary
Activity: 2030
Merit: 1573
CLEAN non GPL infringing code made in Rust lang
June 09, 2019, 07:56:07 AM
#4
Do try booting with Braiins OS, you can simply put an sd card and move jumper jp4 to boot from it. Braiins OS allows fixed frequency settings among other goodies. Take our the sd card and move the jumper back and its like nothing ever happened.

Far better than messing with Bitmain firmware,  especially avoid 2019 versions.
full member
Activity: 538
Merit: 175
June 08, 2019, 12:49:16 PM
#3
What firmware version are you running? It's unusual for an autotune miner to have that ideal rate. I would recommend trying the May 2019 firmware (or Dec 2018, if you are uneasy about no longer having SSH access to the miner), and see if that helps.
legendary
Activity: 2436
Merit: 6643
be constructive or S.T.F.U
June 07, 2019, 03:44:36 AM
#2
You need to post the complete Kernel log.

anyhow, the middle board/chain is missing 1 asic, shouldn't be a problem but I have personally seen a whole gear fail to start mining due a few asics missing from a single board. so here is a list of trouble shooting process you need to go through.

1- disconnect chain 7 , power the miner on and wait for 10-15 mins  see if this solves the problem ( very likely )

2- flash a different firmware version, roll back to old firmware 2017 if possible fixed frequency.

3-perform a hardware reset using the IP report method, while miner is powered-off , hold the IP report button, power on , don't let go of the IP report button , release after 10 seconds.

This issue can also very possibly be related to  PSU so if all the above fails i would test with a working PSU.
jr. member
Activity: 44
Merit: 10
June 06, 2019, 11:49:17 PM
#1
Hi.

I have  had this running for a while and have been having issues with it recently. I cleaned it out this morning with a compressor and now its showing this:

https://imgur.com/nZ9wgMw
https://imgur.com/dadN4tG

Please can someone help? Thanks.
Jump to: