Pages:
Author

Topic: Avalon ASIC users thread - page 79. (Read 438582 times)

sr. member
Activity: 294
Merit: 250
September 17, 2013, 07:26:27 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks


yes u have to...
legendary
Activity: 4592
Merit: 1851
Linux since 1997 RedHat 4
September 17, 2013, 07:08:48 PM
Or you could just plug it via USB into a computer and run cgminer on that computer
(i.e. the tp-link being the problem use a computer instead)
newbie
Activity: 27
Merit: 0
September 17, 2013, 02:20:10 PM
Good day everyone, I just wanted to post a State of the Hanns message...

Since I last posted, in order to rule out the TP-Link as the issue, we wired the modules from Hanns (broken Avalon) into Ada (working Avlaon).

Using the USB cord that would normally go from Ada's Control Unit to Ada's TP-Link, we instead plugged into Hanns' TP-Link.

The result was that Ada was able to mine through Hanns' TP-Link totally normally (even showed up with Hanns' credentials in the pool).  So this would seem to mean that the problem isn't with the TP-Link itself.  Next I was thinking of moving the Control Unit from Hanns and swapping into Ada to confirm that the issue is with the Control Unit. 

There is a blue and white ribbon cable running from the CU to the PDU and I wanted to confirm that it is able to be removed from the Control Unit?

I'll also be trying a few of the software fixes mentioned here, but since we were able to mine with the card using the TP-Link from the broken machine, it would seem that it isn't a software issue related to CGMiner or the router itself.

One final thing, I was in contact with Christian over at Bitmine.  He has been very helpful and I can't say enough good things about their customer service for being so prompt and honest in their dealings.  What I have now ascertained is that the Control Units that Bitmine is able to supply don't work with the PDU for Batch 3 Avalons. 

Does anyone know more about the difference between the Batch 3 and previous batches PDU's and where one might obtain such a rare and valuable gem? 

Anyways, hopefully that helps someone along to claiming the 1BTC bounty that I am currently offering for anyone who can provide a fix for this ailing machine. 

newbie
Activity: 14
Merit: 0
September 17, 2013, 01:24:38 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks


Mine do too, and yes, I did connect them all: 3 x 8-pin pci-e, 2 x 8-pin EPS 12v, 1 x 24pin mobo

All the higher wattage PSU's I've seen come with two eps 12v connectors anyway (one 8pin and one 2x4pin).

I have another one running stable for couple of days, there I connected 1x24pin, 2xEPS and 2xPCIe(!).
I just wanted to know if I can get some cheaper 650-750W PS which usually come with 1xEPS.
I will probably test it myself but if anyone has feedback it would be appreciated.
sr. member
Activity: 278
Merit: 250
September 17, 2013, 01:17:54 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks


Mine do too, and yes, I did connect them all: 3 x 8-pin pci-e, 2 x 8-pin EPS 12v, 1 x 24pin mobo

All the higher wattage PSU's I've seen come with two eps 12v connectors anyway (one 8pin and one 2x4pin).
newbie
Activity: 14
Merit: 0
September 17, 2013, 01:10:38 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks


Can you take a photo?  How many PCIe connectors does it have?

Can't take photo now, but it has two plus one 8-pin PCIe, 2xEPS,
1x24 ATX and 4x external (module) connectors.
donator
Activity: 1218
Merit: 1079
Gerald Davis
September 17, 2013, 12:49:08 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks


Can you take a photo?  How many PCIe connectors does it have?
newbie
Activity: 14
Merit: 0
September 17, 2013, 12:46:48 PM
Guys, my batch3 3 module unit has two 12V EPS connectors on PDU. Do I have to connect them both?
Thanks
full member
Activity: 175
Merit: 100
September 17, 2013, 06:52:09 AM
Have you tried connecting your laptop to the fpga controller with a new usb cable?
Or even just swap the usb cables between units and see if the problem follows.

To disable wifi completely remove it from Network - Interfaces tab then from Network - DHCP and DNS tab uncheck "This is the only DHCP in the local network" option.

You can also free up some memory by removing the wifi kernel modules: https://bitcointalksearch.org/topic/m.2907980

Also if you have an rpi you can try replacing the tplink with it and a new cable: https://bitcointalksearch.org/topic/m.2931964
sr. member
Activity: 294
Merit: 250
September 17, 2013, 06:36:52 AM
Ok then.
Stupid question, you got the other unit fully working on usb mining? just so we are sure that the usb mining setup is correct.
It is important to connect all power cables wherever they is a space, it is way too much current for one or two cables. There is proof of a guy frying his unit when only connecting one powercable (all current on one cable).
Then, you can try the boards one by one in the miner, see if you get them to work with only one unit (make sure to sett the proper mining amount cards in the cgminer settings).
Last step, would be to test each of the distribution cards from the working miner in your faulty one.
To set lan, just login to your unit, set the lan interface to a fixed ip on the same subnet as your router, disconnect from the wifi, connect the cable to your router.
you can the access the unit using lan on the ip that you just set it to.
newbie
Activity: 27
Merit: 0
September 16, 2013, 06:48:28 PM
did you try skipping the router, plugging usb directly to pc? disconnect inside avalon.
sometimes the router pcb faults due to overheating.. the cards are 20usd on ebay..
disable wlan. only use lan. seems your lan goes on and off?
did you try modules one by one in your working module to see if one is faulty?

I've run CGMiner directly using the USB and received all the same error messages. 

Just now I've used the working machine to run the modules from the sick machine (named Hanns) and can now confirm that all the modules work and are able to mine when connected to the still-working Avalon, an identical Batch 3 machine named Ada.  This would seem to narrow down the problem a little bit, we used the cords from Ada, so it is possible that there is a problem somewhere with a cable gone wrong.

I'll try to get it mining via lan-only, but I've had trouble configuring that in the past for some reason.  Any tips or a link to a decent guide (or even half a guide) on how to mine via lan would be extremely helpful.  It seems that I'm a little confused as to how to actually disable the wireless.

As a curious side note, when I opened up Ada only 2 of the 4 plugs going from the PSU to the machine were actually plugged in inside.  In contrast Hanns had all four 8 pin connections in place.  I found this odd as both machines have been mining for quite some time with this configuration and was almost certain that the way Hanns came was the way it should be set up.  Seeing that Ada hasn't had any problems and was only opened up so I could test the modules in Hanns and do a little cleaning, I was curious if anyone else had found their units to be like this inside when open?

Anyways, will be checking this thread often and trying everything.  Currently am pursuing a replacement FPGA as it seems like one of the leading candidates for being the issue and wouldn't be a bad thing to have laying around if it isn't.

Thanks again everyone.  Let's save Hanns! 

P.S. Did I mention the 1BTC bounty?
member
Activity: 77
Merit: 10
September 16, 2013, 04:56:51 PM
Regarding the wire mod, I did it on B3, 3-mod system. I also improved the cooling by replacing the original thingamajig with thermal compound on each chip.  Before any changes, I was at ~83.5 Mhs with 356 MHz. After the wire + cooling mods, I got up to 85.9 Mhs with 366 MHz, and Device Hardware% => 1.6485.
Hopefully that helps you to decide whether to do the mods.

I'm also finalizing the resistor mod with 33k's. I'll have an update on that shortly.
sr. member
Activity: 294
Merit: 250
September 16, 2013, 04:46:35 PM
did you try skipping the router, plugging usb directly to pc? disconnect inside avalon.
sometimes the router pcb faults due to overheating.. the cards are 20usd on ebay..
disable wlan. only use lan. seems your lan goes on and off?
did you try modules one by one in your working module to see if one is faulty?
newbie
Activity: 27
Merit: 0
September 16, 2013, 01:49:35 PM
goto processes - find cgminer - kill it, then once the page refreshes kill it again. then goto system and startup and find cgminer and then push RESTART.

Just tried this to no avail.  Same status after I kill the process.  Of note is that before killing the process it does say that cgminer is at 120% memory usage.  Killed twice, the process disappears from the list.  Go back and Restart (not Start) and CGMiner starts back up but doesn't mine and produces the same result.  For good measure I restarted the whole machine and tried this process again.  Also proceeded to reflash the firmware and tried this method again.  No luck.

Of interest to anyone who is after the Bounty or who can otherwise help, the first time I went to start up the Avalon to try this method, I was unable to get the fans to spin up using the original power supply. I was working on this unit less than 12 hours previous, nothing had been touched and now I was seeing the Red and Yellow LED's and no spinning fans from the old PSU.  (whereas in my previous post I indicated that I was able to at least get the unit to power up with the old unit)

So, I went back to the new confirmed working PSU that I had bought, and this time instead of reusing the PCI-E cables that come with the Avalon, I used the PCI-E cables that came with my new PSU.  (Reusing the old cables made me fairly confident that I had plugged in PCI-E to PCI-E and CPU to EPS before)  The new cables or just the divine providence of Satoshi made it so that the new PSU actually powered up the fans and got me back to the Green and Yellow lights and the same state of affairs as before.  Not mining.

Since I now have a working new PSU, I'm going back and trying everything that has been tried before (disconnecting and reconnecting modules individually, etc...) to make sure I haven't missed a step in checking everything.

Thanks to everyone who has tried to help so far, hopefully someone will get us pointed in the right direction and the bounty gets paid out soon.
legendary
Activity: 966
Merit: 1000
September 16, 2013, 12:58:29 PM
Are you sure you connected the new power supplies correctly?

Pay careful attention to which connectors are for the PCIe power cables, and which is for the EPS 12V.  They should be labeled on the PCB.

legendary
Activity: 1036
Merit: 1000
Nighty Night Don't Let The Trolls Bite Nom Nom Nom
September 16, 2013, 12:51:18 PM
goto processes - find cgminer - kill it, then once the page refreshes kill it again. then goto system and startup and find cgminer and then push RESTART.
newbie
Activity: 27
Merit: 0
September 16, 2013, 12:38:45 PM

Newest error message

CGminer Status (freshly flashed with only network settings changed)

CGminer, Kernal and System Logs are in link below
https://bitcointalksearch.org/topic/m.3089903


If anyone has any idea of what could be wrong, help is greatly appreciated..
Thanks in advance

1. Your top 2 pictures/screen shots are too small ... can't read a thing
2. Firmware wise .. use this one: http://ck.kolivas.org/apps/cgminer/avalon/20130821/openwrt-ar71xx-generic-tl-wr703n-v1-squashfs-factory.bin
[
Hi there, thanks for the respownse.  I'll post up a better screenshot later today, but the biggest takeaway from it would be the large number of Local GetWorks that were being produced, as well as an error message "AVA0: Get avalon read get err 0"

The issue of the Local GetWork is also illustrated in the larger picture of the output from Luci.  In the meantime, I will give a brief summary and offer up a 1BTC bounty for whoever can help solve this issue:

After happily mining along at 350mhz, this Batch 3 four module unit suddenly just halts all mining.  I was home when it happened and it wasn't any kind of slow wind-down, it just kicked off.  This would happen every so often and this machine or its sister unit (identical batch, modules, settings) would always just restart themselves, no worries.  (ckolivas, thank you so very much for all the time and effort you have put into making CGMiner the great program that it is, btw)

Of course, this time, the LED's on the back just stayed Green and Yellow.  I attempted the following fixes:

Made sure that all cables internally were secure
Reset the firmware to factory by flashing it via the button on the back (failsafe mode)
Reflashed the firmware to the one you linked to above (as well as a couple of others, just for good measure)
Removed the power and data cables from all modules and reconnected them individually
Attempted to mine using the USB and running CGMiner from a laptop
And finally tried to replace the power unit with two different compatible units (will explain the results of that below)

On the last point, both power units when swapped in would produce the same result.  The top LED would turn Red and the bottom would remain Yellow and then even after turning off and disconnecting the power unit from the wall, the LED's would remain lit for a short period of time.  The fans would never spin up and I never tried to communicate with or otherwise use the Avalon in that state.  When putting the original PSU back in, the fans would start-up and the LED's would go back the the (now) usual state of being Green and Yellow.

A few other things of note: 

It will ping to the outside when testing it using the Luci console and trying to reach openwrt.org

The FPGA shows a single solid green LED

Hopefully this is enough for someone to provide some guidance on what to try moving forward, I'd love for nothing more than to have to pay this bounty today. Smiley

CGminer, Kernal and System Logs are in link below
https://bitcointalksearch.org/topic/m.3089903
sr. member
Activity: 278
Merit: 250
September 16, 2013, 11:09:35 AM
Just my luck... another burnt part, in a different unit.  Unlike my other one there were no unusual symptoms beforehand.  Same settings as the last unit.




Now I'm wondering if unplugging the affected miner from the bus would work.  Might be tough to do, but it would let me use the other seven miners on that module without having to try and repair the broken one.


I suspect you might just be overclocking way too aggressively.

I have mine (Batch 1) overlcocked to 350 (constant) and I don't let it pass from 45C. The fans are set to 40%-100% with them running mostly at about 2000 to 2400 RPM~. Note: That is because I am using dust filters on the intake. So if I take them off the fan RPM would go even lower....but the unit become dirty on the inside and needs to be cleaned out.

I let avalon-auto do the overclocking.  350 is where this one was staying, for days and days.  Fans are at 100%, it's a batch 3 and the temp always hovered in the mid 60's.

Perhaps the recommendations being adhered to were by a very good batch 3 that could take thermals that high?

Yours seems to be suffering...if a third module goes out then I think we can probably say that not all Avalon Batch 3's should be allowed to go that high in temps. Especially when Avalon usually has a very poor thermal interface with the metal heatsink.

Oh and I also have a heavy duty laptop cooler beneath my Avalon Batch 1 to cool the bottom aluminum plate.

http://www.amazon.com/gp/product/B004G603E2/ref=oh_details_o03_s02_i00?ie=UTF8&psc=1


You might be right about the recommended temp being too high.  I'm torn between keeping my two unaffected units on avalon-auto up to 350Mhz and 70c (to squeeze out every last satoshi while I can), and lowering the target temp to prolong their lives.


hero member
Activity: 516
Merit: 500
September 16, 2013, 09:40:41 AM

Newest error message

CGminer Status (freshly flashed with only network settings changed)

CGminer, Kernal and System Logs are in link below
https://bitcointalksearch.org/topic/m.3089903


If anyone has any idea of what could be wrong, help is greatly appreciated..
Thanks in advance

1. Your top 2 pictures/screen shots are too small ... can't read a thing
2. Firmware wise .. use this one: http://ck.kolivas.org/apps/cgminer/avalon/20130821/openwrt-ar71xx-generic-tl-wr703n-v1-squashfs-factory.bin
newbie
Activity: 27
Merit: 0
September 15, 2013, 05:49:08 PM
http://i44.tinypic.com/20j3hxv.jpg

Newest error message

http://i39.tinypic.com/29gfm78.png

CGminer Status (freshly flashed with only network settings changed)

CGminer, Kernal and System Logs are in link below
https://bitcointalksearch.org/topic/m.3089903





If anyone has any idea of what could be wrong, help is greatly appreciated..
Thanks in advance
Pages:
Jump to: