Pages:
Author

Topic: Hacking KNC Titan / Jupiter / Neptune miners back to life. Why not? - page 28. (Read 76605 times)

newbie
Activity: 16
Merit: 0
Another question: isnt the standard password för SSH "admin"?
newbie
Activity: 16
Merit: 0
Does the green light go out when the ASICs are connected?

The white light coming on followed by green usually means the FPGA has booted and all is well. That's really weird.


The green light is on. This is really wierd. Have done a lot of reboot and hardware resets aswell.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Does the green light go out when the ASICs are connected?

The white light coming on followed by green usually means the FPGA has booted and all is well. That's really weird.
newbie
Activity: 16
Merit: 0
Hi!

I have a problem with my KNC Neptune. The Web interface is fucked up, the status-site and the advanced-site does now load. The others are ok.
Does anyone else have this problem, and did you manage to solve it? Or is it a hardware-failure?


others on here know more but off the top of my head get a new SD card. I use on my Titans Sandisk Class 10 type.

So it is an easy way to tell. I had it happen on a Titan where it just mucked up..and so just copied my settings and re-did it on a new SD cleared up.

Again others on here would say...also probably obvious put I assume you powered down and rebooted the works? (not sure of you expertise)


anyway something to do till better folk chime in here anyway



Ive rebooted the miner from the software a lot of times, and turned of the PSU a lot of times aswell. Reseted the controller with sd-card aswell. Have a Sandisk card.

There is a way to re-install the software on an SD card up on the KNC web site. For a neptune it's simply putting a bunch of files on a blank sd card, then plugging it in and letting it rip. That should restore it. For a Titan you use something like w32diskimager to re-install the software, then go from there.

If that doesn't work does the white light come on then off on the controller board or does it just flash?

C

Done that.  The status page and the advanced page are ok untill the asics are connected to the board. Then they stop working. The white light flashes one time and then the green starts to shine.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Hi,

I am running a titan with 4 cubes since a few weeks, one of the cubes started acting up.
It had 3 dies running (4th disables), suddenly one of the working dies went idle, no matter what I do it doesn't seem to work, so I disabled it.
Running with 2 dies.
After a few days another die started acting up, after a few hours (at best) runtime the whole unit stops hashing and the system tries to reconfigure the dies (gentarkin mod), last time it said it failed to reconfigure said die in the cube. I also saw a "Got nonce for unknown work in slot 3" message for that die at least once.
It was running on 325/-0.0513 , changed it to 300/-0.0513 and gonna let it run the night, will see how it goes in the morning. *update* Didn't help...

Someone said I should have it reflowed, would that be able to help or any chance it could make this worse? What should I do?

Thanks for any help Smiley

Hm. How good are you at taking things apart? It's possible you have a heat sink compound that is no longer connecting the unit to the sink. When you take the sink and bracket off, clear all the old stuff off with alcohol, all of it, then put a thin layer of compond on the ship and sink, and screw down the screws tight enough that the heat sink no longer can torque around. Doesn't need to be crushing tight just snug.

C
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Hi!

I have a problem with my KNC Neptune. The Web interface is fucked up, the status-site and the advanced-site does now load. The others are ok.
Does anyone else have this problem, and did you manage to solve it? Or is it a hardware-failure?


others on here know more but off the top of my head get a new SD card. I use on my Titans Sandisk Class 10 type.

So it is an easy way to tell. I had it happen on a Titan where it just mucked up..and so just copied my settings and re-did it on a new SD cleared up.

Again others on here would say...also probably obvious put I assume you powered down and rebooted the works? (not sure of you expertise)


anyway something to do till better folk chime in here anyway



Ive rebooted the miner from the software a lot of times, and turned of the PSU a lot of times aswell. Reseted the controller with sd-card aswell. Have a Sandisk card.

There is a way to re-install the software on an SD card up on the KNC web site. For a neptune it's simply putting a bunch of files on a blank sd card, then plugging it in and letting it rip. That should restore it. For a Titan you use something like w32diskimager to re-install the software, then go from there.

If that doesn't work does the white light come on then off on the controller board or does it just flash?

C
newbie
Activity: 16
Merit: 0
Hi!

I have a problem with my KNC Neptune. The Web interface is fucked up, the status-site and the advanced-site does now load. The others are ok.
Does anyone else have this problem, and did you manage to solve it? Or is it a hardware-failure?


others on here know more but off the top of my head get a new SD card. I use on my Titans Sandisk Class 10 type.

So it is an easy way to tell. I had it happen on a Titan where it just mucked up..and so just copied my settings and re-did it on a new SD cleared up.

Again others on here would say...also probably obvious put I assume you powered down and rebooted the works? (not sure of you expertise)


anyway something to do till better folk chime in here anyway



Ive rebooted the miner from the software a lot of times, and turned of the PSU a lot of times aswell. Reseted the controller with sd-card aswell. Have a Sandisk card.
copper member
Activity: 2898
Merit: 1464
Clueless!
Hi!

I have a problem with my KNC Neptune. The Web interface is fucked up, the status-site and the advanced-site does now load. The others are ok.
Does anyone else have this problem, and did you manage to solve it? Or is it a hardware-failure?


others on here know more but off the top of my head get a new SD card. I use on my Titans Sandisk Class 10 type.

So it is an easy way to tell. I had it happen on a Titan where it just mucked up..and so just copied my settings and re-did it on a new SD cleared up.

Again others on here would say...also probably obvious put I assume you powered down and rebooted the works? (not sure of you expertise)


anyway something to do till better folk chime in here anyway

newbie
Activity: 16
Merit: 0
Hi!

I have a problem with my KNC Neptune. The Web interface is fucked up, the status-site and the advanced-site does now load. The others are ok.
Does anyone else have this problem, and did you manage to solve it? Or is it a hardware-failure?
newbie
Activity: 47
Merit: 0
Hi,

I am running a titan with 4 cubes since a few weeks, one of the cubes started acting up.
It had 3 dies running (4th disables), suddenly one of the working dies went idle, no matter what I do it doesn't seem to work, so I disabled it.
Running with 2 dies.
After a few days another die started acting up, after a few hours (at best) runtime the whole unit stops hashing and the system tries to reconfigure the dies (gentarkin mod), last time it said it failed to reconfigure said die in the cube. I also saw a "Got nonce for unknown work in slot 3" message for that die at least once.
It was running on 325/-0.0513 , changed it to 300/-0.0513 and gonna let it run the night, will see how it goes in the morning. *update* Didn't help...

Someone said I should have it reflowed, would that be able to help or any chance it could make this worse? What should I do?

Thanks for any help Smiley
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
This is interesting. So I fixed a bridge board for someone and since the source of the failure was their Pi was shorted hard (oh well) I got a new Rpi 2 model B.

Note: The one on the Titan is a Rpi MOdel B+ version 1.2 which is *NOT* source compatible with the Rpi2 in the boot loader code. Thus the default Titan code from KNC will not boot on a Model 2B+.

Instead you need the Rpi Model B+ version 1.2.

I could probably hack it to work but I feel a bit lazy today.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Well the taco board is working:

--------------------------------------------------------------------------------
 KNC 0:       | 43.62/40.47/40.58Mh/s | A:2478 R:3+0(.12%) HW: 331/.41%
 KNC 1:       | 74.94/74.33/74.54Mh/s | A:4749 R:5+0(.11%) HW: 863/.59%
--------------------------------------------------------------------------------

KNC0 runs with two dies at 300mhz each, the other two cause the weird mumbling error with lost nonces all over the place. Turn them off and it runs more happily. KNC1 is the control running at 275mhz.

Fixing this sag required me to put the board in the jig, snug it up to add some tension, then pre-heat to 300c followed by 390c hot air on the chip for 2 minutes. That seems to have melted the solder (yes, lots of 951 flux) balls enough to re-seat part of the chip. Further heat probably will not fix the other two dies, and I am advising the owner that 2 dies is a lot better than 0. :-)

On to the next problem.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Here's a new one: A board that is shaped kind of like a Taco:





And the back is burned as well:



Note: Don't screw down the heat sink too hard. Adding washers should be done with a lot of care as this can cause the board to flex, get hot, then delaminate under the chip.

Hm......
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
So a busy weekend. On this second board that came in with the totally burned +12v lines I decided to try and hook every supply into the +12v bypass bus like so:



For a total of seven connections to the board (the last supply is too close to the power bus to be able to connect it directly). Once again the board works fine at 200mhz on all dies but when I brought supplies 1,2,3,4 (the four supplies up by the front of the board, not the back near the supply) to 250mhz I would get running fine for hours, then one supply would drop off and shut down that die. Odd.

So adding supply points doesn't seem to fix that. Still, it's a 55-60mh cube now instead of a 0mh cube so that's nice. But it seems that wiring every supply does not improve the performance to full.

Back to figuring out the shorts in the chip dies, but at this point I seem to be able to fix totally cremated plugs as long as the chip is not shorted (can be seen by checking resistance of pins 4,6,8 to ground)
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Had to rebuild another board with a burned +12 supply line. One pin was gone, one pin had blown the inside open, the third pin had 20 ohms of resistance. Not too great.

Up and running right now at 200mhz, purrs along. Tomorrow the new Rpi comes in, so we can test if the transfer boards can be fixed. Should be do-able, just need to find my wire-wrap wire.....

legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Not sure. Why would you want to go to 1.00? 1.06 runs fine.
hero member
Activity: 754
Merit: 500
1xBit the largest casino
How can i flash Firmware from 1.06 to 1.00.

The Current Neptune firmware persist everytime.


How can i can flash it via SSH ?. "downgrade" for jupiter.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
One of my bridge boards did this, exactly as you described.  I've got a new (identical) replacement pi.  Would you say that the copper line repair is more of a novice or advanced job?  One of my copper lines seems to have survived, the other smoked.
Eh, not that bad. To do it right you need to remove one of the sockets with air heat, then clear the fault and run a nice 24 gauge wire-wrap wire between that post and pin 3 on the bottom pin set. But you could just glue down the bad bits and jumper directly with said wire if you're in a hurry. And who isn't in a hurry.

But you need to keep that burned line secured with glue or something in the center of the connector, otherwise it is going to short to a gpio line and all hell *will* break loose.
newbie
Activity: 14
Merit: 0
One of my bridge boards did this, exactly as you described.  I've got a new (identical) replacement pi.  Would you say that the copper line repair is more of a novice or advanced job?  One of my copper lines seems to have survived, the other smoked.
legendary
Activity: 3094
Merit: 2239
I fix broken miners. And make holes in teeth :-)
Well, here is how the Titan bridge boards blow up:



I can fix that More to the point the failure chain is this:

Raspberry Pi shorts out it's internal DC-DC converter circuits.

Results in high resistance on +5/gnd on pins 1/2, but as soon as power is applied the board shorts as it tries to bring up the low voltage supplies.

Big +5 supply from the controller board shorts, blows up via line.

Fixable.
Pages:
Jump to: