Pages:
Author

Topic: Avalon ASIC users thread - page 100. (Read 438596 times)

full member
Activity: 206
Merit: 100
August 14, 2013, 02:42:38 PM
Some power measurements for Avalon.

Unit is batch#3 w/4 modules, fw 20130723.
Temperature target 65C. Keep in mind this is equiv with 45 for batch#1&2.
Temp1 (input temperature) is 21C .
Temp3 (output temperature) is roughly 15C lower then Temp2.
Wall is wall power in watts.
Errors is percentage of HW errors versus diff1 shares.

freqwallfan1fan3temp2errors
256 685 2280 2040 60
270 722 2280 2160 60
282 755 2400 2160 61
300 803 2400 2160 63 0.94%
325 873 2520 2280 66 1.08%
350 943 3480 3240 63 1.35%

Produces arround 109-110Gh/s @350MHz, confirmed by btcguild (more or less).
At 375MHz unit cannot maintain 65C temperature target. Hash is less, around 105Gh/s, due to large percentage of HW errors.  
-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
August 14, 2013, 09:46:49 AM
I am running 0703 and kind of scared to update to 0813/0814.  One person got bricked, several others report decreased hash rates.  Anyone got positive results yet?
Me?
legendary
Activity: 1246
Merit: 1002
August 14, 2013, 09:12:49 AM
Was happily running 20130703 since release with no issues at all. Highly recommended (on 20 or so units - batch 1 and 2, on my batch 3's I just run stock because of the temp sensor change etc). I saw the new firmware and decided to give it a shot on a batch 1 and it bricked it. No reset or nothing could get it back.

I figured no problem. I picked up some extra TL-MR3020 to replace the TP-LINK TL-WR703N router but those didn't flash and in the end I gave up on messing with that (from what I can tell you need a custom one or something).


A retail TP-Link 703n can be found on eBay or Amazon for about $20.  It can be flashed with the 20130703 firmware, and then plugged into the USB cable inside the Avalon.  The LAN cable or WiFi can be used to access it.  It has a mini-USB power cable.  I ran a for-repair Avalon a few hours this way before I got the soldering iron and made a permanent repair.

full member
Activity: 238
Merit: 100
August 14, 2013, 08:44:28 AM
I am running 0703 and kind of scared to update to 0813/0814.  One person got bricked, several others report decreased hash rates.  Anyone got positive results yet?
member
Activity: 110
Merit: 11
August 14, 2013, 08:22:13 AM
Was happily running 20130703 since release with no issues at all. Highly recommended (on 20 or so units - batch 1 and 2, on my batch 3's I just run stock because of the temp sensor change etc). I saw the new firmware and decided to give it a shot on a batch 1 and it bricked it. No reset or nothing could get it back.

I figured no problem. I picked up some extra TL-MR3020 to replace the TP-LINK TL-WR703N router but those didn't flash and in the end I gave up on messing with that (from what I can tell you need a custom one or something).

In the end I found a very simple solution that works awesome:
1) Unplug the USB extender from your TL-WR703N and Ethernet cord
2) Remove the USB extender you won't need it anymore
3) Connect a Raspberry PI to the USB (no power cord needed) and Ethernet
4) Download the latest minepeon from http://sourceforge.net/projects/minepeon/
5) Write this to an SD card for your Raspberry PI
6) Its plug and play so just configure your pools with the web interface

Overclocking (Optional): Warning if you do : Batch 1 upgrading your PSU is mandatory, Batch 2 I didn't upgrade and had no issues. But I definately burned out some batch 1 PSUs.
7) If you want to overclock it which you probably do then just SSH into your PI user: minepeon and pass: peon
- to find PIs easy on your network just install arp-scan (pacman -S arp-scan or yum install arp-scan or apt-get arp-scan)
   and run "sudo arp-scan --interface=eth0 --localnet | grep b8:27:eb" this will show you all your PIs on your network
Cool Once you have SSHed into your PI edit /opt/minepeon/etc/ and change the config file to look something like:
{"algo":"c","avalon-options":"115200:24:10:50:350","api-listen":true,"api-port":"4028","avalon-freq":"350","avalon-temp":"20","avalon-auto":true,"expiry":"120","hotplug":"5","log":"5","no-pool-disable":true,"queue":"1","scan-time":"60","shares":"0","kernel-path":"\/opt\/minepeon\/bin","api-allow":"W:0\/0","pools":[{"url":"stratum.btcguild.com:3333","user":"your_user","pass":"x"},{"url":"stratum.btcguild.com:3333","user":"your_user","pass":"x"},{"url":"stratum.btcguild.com:3333","user":"your_user","pass":"x"}]}

Hopefully this helps someone else out there who has burned out a router or maybe someone wants to try overclocking without messing with their firmware. If you need any help or have any questions just shoot me a PM.

member
Activity: 76
Merit: 10
August 13, 2013, 10:16:00 PM
Maybe I misunderstood something I read earlier, I'm a bit confused with the different batches wanting different temperatures.

I thought with the new firmware that "70 was the new 50" for Batch 2 Avalons.  Did I get that totally wrong?
For batch 3 Avalons, so yes you got it wrong. Remove those lines ASAP.

Thank you so much for your advice, support, and software!
I've removed the temp settings, now only have this:
--avalon-auto --avalon-freq 325-375

I'll throw some BTC your way later today or tomorrow, need to eat now.

Thanks again!

_theJestre
-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
August 13, 2013, 09:56:04 PM
Maybe I misunderstood something I read earlier, I'm a bit confused with the different batches wanting different temperatures.

I thought with the new firmware that "70 was the new 50" for Batch 2 Avalons.  Did I get that totally wrong?
For batch 3 Avalons, so yes you got it wrong. Remove those lines ASAP.
member
Activity: 76
Merit: 10
August 13, 2013, 09:55:00 PM
--avalon-auto --avalon-cutoff 80 --avalon-freq 300-360 --avalon-temp 70
Interesting. I uploaded a 20130814 (which is basically the same as the 20130813-1 firmware, just with the cgminer 3.3.4 tag). What I've found with the changed code is that my (batch2) avalon hovers at a lower frequency of 345 instead of 352, but ends up with the same hashrate. If there was something I'd recommend for you on batch 2, it is NOT setting your temperature so high. That is almost certainly contributing to your hardware errors and may damage your chips. If you really want to run them high, I'd suggest 60/70 instead of 70/80, but the defaults are there for a reason.

Maybe I misunderstood something I read earlier, I'm a bit confused with the different batches wanting different temperatures.

I thought with the new firmware that "70 was the new 50" for Batch 2 Avalons.  Did I get that totally wrong?

Thanks,

_theJestre
-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
August 13, 2013, 09:45:40 PM
Hi all,

Just saw the new firmware 20130813-1 and thought I'd upgrade from 20130703, which was giving me ~78Gh/s on my batch 2 Avalon.  I was using just --avalon-auto to achieve this.

I'm getting less Gh/s now, probably about 72 or so.  I'm using these options:

--avalon-auto --avalon-cutoff 80 --avalon-freq 300-360 --avalon-temp 70

Am I doing something wrong, do these numbers look completely off base?

Should batch 2 Avalons just stick to the 20130703 firmware?

Thanks,

_theJestre
Interesting. I uploaded a 20130814 (which is basically the same as the 20130813-1 firmware, just with the cgminer 3.3.4 tag). What I've found with the changed code is that my (batch2) avalon hovers at a lower frequency of 345 instead of 352, but ends up with the same hashrate. If there was something I'd recommend for you on batch 2, it is NOT setting your temperature so high. That is almost certainly contributing to your hardware errors and may damage your chips. If you really want to run them high, I'd suggest 60/70 instead of 70/80, but the defaults are there for a reason.
member
Activity: 76
Merit: 10
August 13, 2013, 09:17:09 PM
Hi all,

Just saw the new firmware 20130813-1 and thought I'd upgrade from 20130703, which was giving me ~78Gh/s on my batch 2 Avalon.  I was using just --avalon-auto to achieve this.

I'm getting less Gh/s now, probably about 72 or so.  I'm using these options:

--avalon-auto --avalon-cutoff 80 --avalon-freq 300-360 --avalon-temp 70

Am I doing something wrong, do these numbers look completely off base?

Should batch 2 Avalons just stick to the 20130703 firmware?

Thanks,

_theJestre
legendary
Activity: 1484
Merit: 1026
In Cryptocoins I Trust
August 13, 2013, 05:53:25 PM
Has anyone produced a graph of clock frequency vs wall power on a batch #2 unit?

I have a GX 850 power supply, and I have mixed messages whether it is enough to power a 4 module unit clocked at 347 MHz.

Although I was advised against this, I've had all 7 four module Avalons running at 350mhz with Avalon's stock 850w PSU running stable for exactly 2 weeks now. I did have one PSU die the first day, but I have had no problems with the others. I just replaced the PSU and she was back to hashing within minutes. I assume you would have to be incredibly unlucky for a blown PSU to damage the Avalon's components. In all my years (aka. 1 year  Wink) of GPU mining, never has a blown PSU caused any extra hardware failure (again, in my experience... your mileage may vary.)

As we used to say when I was a snowboard bum in Colorado for a few years after graduating high school, "go big or home." (then proceed to pee your pants while hucking a 50 ft cliff)  Wink
-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
August 13, 2013, 05:43:32 PM
2x 4 module with 1250W PSU and firmware 20130723.

I'm using --avalon-auto --avalon-fan 90-100 and Chip Frequency: 350M, MHS5s shows ~105000 for both.


Questions:

1) Is there a way to determine if the chips are really hashing and not just producing what I know as "Hardware Errors"?

2) One of the machines only shows a number under "Fan3", the other one under "Fan1" and "Fan3". Need I be worried?

1. Hashrate is only calculated from non-HW errors on the avalon driver
2. No
legendary
Activity: 1246
Merit: 1002
August 13, 2013, 04:46:30 PM
I used firmware 20130703 before upgrading to 20130813-1, I get more HW errors and the chips are clocked to 346mhz instead of the normal 350-355mhz.
It is a batch2 avalon.

What could be the case?

I have updated from 20130703, to 20130810 for a few hours, then to 20130810-1 most of today.  I will let it run overnight, but I seem to see a decrease also.
donator
Activity: 1890
Merit: 1010
Parental Advisory Explicit Content
August 13, 2013, 04:37:31 PM
I used firmware 20130703 before upgrading to 20130813-1, I get more HW errors and the chips are clocked to 346mhz instead of the normal 350-355mhz.
It is a batch2 avalon.

What could be the case?
sr. member
Activity: 472
Merit: 250
August 13, 2013, 02:17:04 PM
Has anyone produced a graph of clock frequency vs wall power on a batch #2 unit?

I have a GX 850 power supply, and I have mixed messages whether it is enough to power a 4 module unit clocked at 347 MHz.


I wouldn't even dare.

4 module B3 stock @300mhz draws ~850W from the wall.  Load a PSU @ 100% 24/7 is asking for a bad time!
legendary
Activity: 1246
Merit: 1002
August 13, 2013, 02:10:39 PM
Has anyone produced a graph of clock frequency vs wall power on a batch #2 unit?

I have a GX 850 power supply, and I have mixed messages whether it is enough to power a 4 module unit clocked at 347 MHz.

sr. member
Activity: 259
Merit: 250
Dig your freedom
August 13, 2013, 01:35:21 PM
2x 4 module with 1250W PSU and firmware 20130723.

I'm using --avalon-auto --avalon-fan 90-100 and Chip Frequency: 350M, MHS5s shows ~105000 for both.


Questions:

1) Is there a way to determine if the chips are really hashing and not just producing what I know as "Hardware Errors"?

2) One of the machines only shows a number under "Fan3", the other one under "Fan1" and "Fan3". Need I be worried?

2weiX thx for ask for that.

Is it possible to perform  tests which will determine which chip in the chain is  generates errors/are broken?
Usually chains contain 10 chips (Avalon and Burnin BB)
Any one can write some procedure which will return the test result of individual chips ?
full member
Activity: 203
Merit: 100
August 13, 2013, 12:54:52 PM
Have you seen this wiki post:

 About [usb 1-1: clear tt 1 (8030) error -71]

Not all 703n have this problem. ignore this section if you never meet this error

    There is a power issue with the 703N, The 703N is drawing to much power it caused the USB HUB chip on Senseless's FPGA controller to nearly destroy itself. See the destruction [ http://www.mysenselesslife.com/avalon/DSCN5212.JPG here]

    In order to fix it you need had to power down the WiFi modem by disable it, use Eithernet instead. The kernel no long report -71 errors. thanks to senseless and others who help on identify the issue.

    If your avalon was far away from your router. eithernet cable not fit. you may want try those kind of devices
        TP-LINK TL-PA500(For mainland China) : http://www.tp-link.com.cn/product_adapter_263.html
        TP-LINK TL-PA511 : http://www.tp-link.com/en/products/details/?model=TL-PA511
member
Activity: 84
Merit: 10
August 13, 2013, 12:34:26 PM
I am using WIFI...
Today (after 3 days of not checking the unit), i found it with LOAD of over 18, and hashing at 5xxx MH only...
I did a cgminer restart in system-services and hash rates returned to normal... (90.000MH), but load stayed over 15 ...
any ideas?

i checked kernel log, and looks normal until approx 3,5day into operation when bunch of entries like these show:
[316582.730000] usb 1-1: clear tt 1 (9031) error -71
[316584.160000] usb 1-1: clear tt 1 (8030) error -71
[316584.180000] usb 1-1: clear tt 1 (8030) error -71
[316584.190000] usb 1-1: clear tt 1 (8030) error -71
[316584.200000] usb 1-1: clear tt 1 (0030) error -71
[316589.840000] usb 1-1: clear tt 1 (8030) error -71
[316589.850000] usb 1-1: clear tt 1 (8030) error -71
[316589.860000] usb 1-1: clear tt 1 (8030) error -71
...
..
16824.590000] usb 1-1: clear tt 1 (8030) error -71
[316824.620000] usb 1-1: clear tt 1 (0030) error -71
[316828.000000] usb 1-1.1: USB disconnect, device number 3
[316828.150000] usb 1-1: reset high-speed USB device number 2 using ehci-platform
[316828.630000] usb 1-1.1: new full-speed USB device number 4 using ehci-platform
[316828.810000] ftdi_sio 1-1.1:1.0: FTDI USB Serial Device converter detected
[316828.850000] usb 1-1.1: Detected FT232RL
[316828.850000] usb 1-1.1: Number of endpoints 2
[316828.850000] usb 1-1.1: Endpoint 1 MaxPacketSize 16384
[316828.860000] usb 1-1.1: Endpoint 2 MaxPacketSize 16384
[316828.860000] usb 1-1.1: Setting MaxPacketSize 64
[316828.870000] usb 1-1.1: FTDI USB Serial Device converter now attached to ttyUSB0
[316830.890000] ftdi_sio ttyUSB0: FTDI USB Serial Device converter now disconnected from ttyUSB0
[316830.900000] ftdi_sio 1-1.1:1.0: device disconnected




I notice that sometimes the total MHS5s hashrate drops to weird low number (like 4 digits only)... is this normal?

No.

If you are using an ethernet connection, you must delete the WWAN device from the 'network' interfaces, and under network/wifi/avalon_ap make sure it shows 'wireless network is disabled'. Until I did both of these step I had the same problem.



Since this describes my problem, i tried this link below, but its not working no more?
ideas?

thanks,
Jaka

One of my batch 3 avalons is performing very poorly due to a high HW errors.  Unplugging all by one module, each module individually has between a 25 and 75% HW error rate at either 256 or 300Mhz.

This degraded performance starting after about 6 hours of mining at 300Mhz.

Is there anything I should try before trying to make a warranty claim?
I was calculating the hardware error rate wrong.  It should be HW/LocalWork, which brings the error rates i'm seeing to a much more reasonable 1%.

Regardless, 3 out of 4 of my batch 3 units end up with <10Gh/s after 3-12hrs of mining, some more frequently than others.

Could this behavior be due to the new temperature throttling feature?  I have the default temperature limits of 70C target and 90C cutoff, but I don't see the temps going over 70C.

I'm having this same issue.

EDIT: so is the solution is to disable wifi or to implement the load monitor with auto-restart? Is this issue resolved?

For me, disabling/removing wifi/wlan wasn't enough.

If you already removed wifi and it's still a problem, look at your load avg, if it's spiking, then you probably need the auto-restart I posted at https://bitcointalk.org/index.phptopic=140539.msg2898478#msg2898478

If the load avg is low (under 1.0) then you should check all the connectors inside your avalon.  I did have one of the wide ribbon cables wiggle loose (I probably bumped it installing the psu) and the result was about 20%+ lower peak hashes and odd hangups.


legendary
Activity: 2058
Merit: 1005
this space intentionally left blank
August 13, 2013, 11:16:09 AM
2x 4 module with 1250W PSU and firmware 20130723.

I'm using --avalon-auto --avalon-fan 90-100 and Chip Frequency: 350M, MHS5s shows ~105000 for both.


Questions:

1) Is there a way to determine if the chips are really hashing and not just producing what I know as "Hardware Errors"?

2) One of the machines only shows a number under "Fan3", the other one under "Fan1" and "Fan3". Need I be worried?
Pages:
Jump to: