Author

Topic: [FIXED] Avalon URGENT ISSUE: both of my avalon do not work any more (Read 11913 times)

legendary
Activity: 1890
Merit: 1003


If you are not familiar, you'll have plenty of fun trying to master vi first though ;-)
Fun....yes....lots of um fun.  Grin
legendary
Activity: 1890
Merit: 1003
With your collective help I got it going.

Despite following the Wiki's instructions I did not know I had to restart the cgminer process.

Like JohnyJ I assume it would automatically "work" if you just plugged it in. Strangely, even with a hard reset it still didn't work. If I manually reset the process it did pick up the changes.
sr. member
Activity: 336
Merit: 251
Avalon ASIC Team
the easyest way is to flash it without keep settings(you just need te add the pool and user)
the other way if you upgraded the firmware
after you set the password
use winscp and connect to the ip of the avalon using scp connection + user root  + passsword that you set
navigate to folder /etc/init.d/ select cgminer press F4 for edit add ip save
from system stop start process cgminer.

No need, flashing new firmware does NOT overwrite existing settings.
legendary
Activity: 1112
Merit: 1000
had the same problem
if you flash without keep settings you need to change the the cgminer file in  /etc/init.d/ and add "W:127.0.0.1"
https://en.bitcoin.it/wiki/Avalon#20130321

How do I go about doing that? (I don't know the linux system very well)

You don't need to log into the system as root with SSH if you don't feel like it, you can add the W:127.0.0.1 value in
the Cgminer Configuration tab, the bottom field "API Allow"

If you want to do it by hand, then log into the system with SSH as user root, go to /etc/config/
and edit the file cgminer by adding at the end of the file:

Quote
   option api_allow 'W:127.0.0.1'

if you restart cgminer it should pick up the new config values

Check the wiki for instructions

If you are not familiar, you'll have plenty of fun trying to master vi first though ;-)
sr. member
Activity: 388
Merit: 250
the easyest way is to flash it without keep settings(you just need te add the pool and user)
the other way if you upgraded the firmware
after you set the password
use winscp and connect to the ip of the avalon using scp connection + user root  + passsword that you set
navigate to folder /etc/init.d/ select cgminer press F4 for edit add ip save
from system stop start process cgminer.
sr. member
Activity: 249
Merit: 250
had the same problem
if you flash without keep settings you need to change the the cgminer file in  /etc/init.d/ and add "W:127.0.0.1"
https://en.bitcoin.it/wiki/Avalon#20130321

How do I go about doing that? (I don't know the linux system very well)

LuCI Web Upgrade Process
Download a suitable OpenWrt firmware image file
Login to the WebInterface of the router (default: http://192.168.1.1)
Select System ⇒ System ⇒ Custom Files
Select System ⇒ Flash Firmware
Upload the OpenWrt image file you downloaded to your PC at step 1 to your router via LuCI
LuCI will calculate the MD5 checksum of the file, if it's correct, you are green to go
wait until the router comes back online

Edit: Bitsyncom beat me to it lol.
sr. member
Activity: 336
Merit: 251
Avalon ASIC Team
had the same problem
if you flash without keep settings you need to change the the cgminer file in  /etc/init.d/ and add "W:127.0.0.1"
https://en.bitcoin.it/wiki/Avalon#20130321

How do I go about doing that? (I don't know the linux system very well)

http://wiki.openwrt.org/doc/howto/generic.sysupgrade#luci.web.upgrade.process
legendary
Activity: 1890
Merit: 1003
had the same problem
if you flash without keep settings you need to change the the cgminer file in  /etc/init.d/ and add "W:127.0.0.1"
https://en.bitcoin.it/wiki/Avalon#20130321

How do I go about doing that? (I don't know the linux system very well)
sr. member
Activity: 388
Merit: 250
had the same problem
if you flash without keep settings you just need to add the pool and user
if not you need to change the the cgminer file in  /etc/init.d/ and add "W:127.0.0.1"
https://en.bitcoin.it/wiki/Avalon#20130321
hero member
Activity: 798
Merit: 1000
try different pool and/or different protocol?
check all internal connections?
try to change PS?
legendary
Activity: 1890
Merit: 1003
I applied the API allow fix, didn't work.

I reflashed the firmware without saving the previous configuration. It started to hash away on the default settings that come with the firmware.

Soon as I changed the settings to BTCGuild (with stratum) it went right back into the same error.

------------------------

Any ideas?
legendary
Activity: 1890
Merit: 1003
I also find these lines in the Kernel Log:

[   27.800000] br-lan: received packet on eth0 with own address as source address
[   30.800000] br-lan: received packet on eth0 with own address as source address
[   30.830000] br-lan: received packet on eth0 with own address as source address
[   31.030000] br-lan: received packet on eth0 with own address as source address
[  961.170000] br-lan: received packet on eth0 with own address as source address
[ 1090.480000] br-lan: received packet on eth0 with own address as source address
legendary
Activity: 1890
Merit: 1003
I ran into the same problem tonight, it is caused first by 30 minutes of no internet connectivity due to my ISP's problem with their gateway (20 minutes), then cgminer restarted itself several times by cron and eventually entered into a mode that only throw error message like this in status page:

Socket connect failed: Connection refused
Socket connect failed: Connection refused
Socket connect failed: Connection refused
Socket connect failed: Connection refused

After network connection alive again, even a hard restart won't change the cgminer's error, so flash the firmware is the only choice and it fixed the problem for now
I have come across the exact same problem. It won't mine.

Did you flash the firmware without the previous configuration?
legendary
Activity: 1988
Merit: 1012
Beyond Imagination
I ran into the same problem tonight, it is caused first by 30 minutes of no internet connectivity due to my ISP's problem with their gateway (20 minutes), then cgminer restarted itself several times by cron and eventually entered into a mode that only throw error message like this in status page:

Socket connect failed: Connection refused
Socket connect failed: Connection refused
Socket connect failed: Connection refused
Socket connect failed: Connection refused

After network connection alive again, even a hard restart won't change the cgminer's error, so flash the firmware is the only choice and it fixed the problem for now
hero member
Activity: 896
Merit: 532
Former curator of The Bitcoin Museum
Thanks for your sharing.
but not everyone will have this problem, seems you are unlucky. Sad

Shouldn't 297 other people have this problem?
full member
Activity: 120
Merit: 100
legendary
Activity: 1064
Merit: 1000
so what happened, why did the firmware need to be reinstalled/upgraded?

due to early shipment. they release a patch after that particular unit had been shipped.
sr. member
Activity: 379
Merit: 250
Thanks for your sharing.
but not everyone will have this problem, seems you are unlucky. Sad
hero member
Activity: 675
Merit: 507
Freedom to choose
so what happened, why did the firmware need to be reinstalled/upgraded?
legendary
Activity: 1064
Merit: 1000
firmware updated: problem fixed.
newbie
Activity: 46
Merit: 0
so the issue related to the openwrt firmware? cgminer works well or not?
hero member
Activity: 924
Merit: 1000
To all gentlemen,

Issue fixed immediately after I upload latest firmware to avalon.   Grin

Thanks avalon team for their prompt supporting.

+ 1 To Avalon.
+ 1 to Liberty for sharing your problem.
+ 1 to all those who offered help.
sr. member
Activity: 388
Merit: 250
there is also the oc version

20130127:
  * Add overclock code
  * Change the cgminer configure to UCI system
  * Add the simple web ui

openwrt-ar71xx-generic-tl-wr703n-v1-squashfs-factory-20130128-32-oc.bin 27-Feb-2013 20:05  3.8M

can you chack it out ?

Regards
Thorvald
legendary
Activity: 1064
Merit: 1000
full member
Activity: 137
Merit: 100
cant open that Sad  would u send to me? [email protected]

Sir, check your email please.
legendary
Activity: 1918
Merit: 1570
Bitcoin: An Idea Worth Spending
To all gentlemen,

Issue fixed immediately after I upload latest firmware to avalon.   Grin

Thanks avalon team for their prompt supporting.

Note, Avalon CS in China working at 1:30 PM Saturday afternoon, with issue starting ~5 hours earlier.
legendary
Activity: 1064
Merit: 1000
full member
Activity: 137
Merit: 100
full member
Activity: 126
Merit: 100
I made like three posts asking you to update to latest firmware, dude, including the first reply of the thread (even though I edited it in 30 seconds after making the post).
Really happy to see that it fixed the issue for you.  Smiley
legendary
Activity: 1064
Merit: 1000
To all gentlemen,

Issue fixed immediately after I upload latest firmware to avalon.   Grin

Thanks avalon team for their prompt supporting.

So where to get the latest firmware?

can you please put a link in first post? there is one friend in need.

thanks Smiley
legendary
Activity: 1064
Merit: 1001
To all gentlemen,

Issue fixed immediately after I upload latest firmware to avalon.   Grin

Thanks avalon team for their prompt supporting.

Fantastic!

Thanks for the update, and happy mining Wink
full member
Activity: 137
Merit: 100
To all gentlemen,

Issue fixed immediately after I upload latest firmware to avalon.   Grin

Thanks avalon team for their prompt supporting.
legendary
Activity: 1064
Merit: 1000
anyone knows if its possible to run both conected to only one 703n? its usb and they have usb hub. did u try that?
foo
sr. member
Activity: 409
Merit: 250
Did issue begin when the difficulty recently readjusted, perhaps?

Bingo! Seems to me that OP's problems started exactly then. If that is the case then jgarzik's Avalon should also have b0rked, unless the new firmware fixed something.

I did not adjust anything recently.  For btcguild it is 32. For ozco it is -1 (means dynamic).
Network difficulty, not share size. The difficulty changed from 3,651,011 to 4,367,876 at 2013-03-01 17:39:09 UTC. Was that when your Avalons stopped?
full member
Activity: 137
Merit: 100
Did you change anything?

No. I did not change anything. In fact it was quite stable in the past 14 days. Never had issue before.

Is there free space?

Code:
root@OpenWrt:~# df
Filesystem           1K-blocks      Used Available Use% Mounted on
rootfs                     320       224        96  70% /
/dev/root                 2816      2816         0 100% /rom
tmpfs                    30916        88     30828   0% /tmp
tmpfs                      512         0       512   0% /dev
/dev/mtdblock3             320       224        96  70% /overlay
overlayfs:/overlay         320       224        96  70% /
full member
Activity: 137
Merit: 100
Did issue begin when the difficulty recently readjusted, perhaps?

Bingo! Seems to me that OP's problems started exactly then. If that is the case then jgarzik's Avalon should also have b0rked, unless the new firmware fixed something.

I did not adjust anything recently.  For btcguild it is 32. For ozco it is -1 (means dynamic).
foo
sr. member
Activity: 409
Merit: 250
Did issue begin when the difficulty recently readjusted, perhaps?

Bingo! Seems to me that OP's problems started exactly then. If that is the case then jgarzik's Avalon should also have b0rked, unless the new firmware fixed something.
legendary
Activity: 1064
Merit: 1001
Sounds like a software or network issue since it's unlikely that both units would simultaneously have hardware failure.  I'd drag it over to a friends place or something and see if it works on their network, update the firmware, reset it to whatever defaults it has, etc.

This is my thought as well...near simultaneous failures from the hardware generally (at least in the SysAdmin world) rules out hardware errors (though it's possible). Have any recent updates been applied (intentionally or not), can any other pools be reached, or have any networking changes occurred (not being in China, I don't know how much they'd limit connections for Bitcoin)?

Just some food for thought. I'll think about it for a bit, though I'm sure there are dozens of capable minds in this community that can far surpass mine. I'm curious to see what BitSyncom comes back with.
hero member
Activity: 608
Merit: 500
Sounds like a software or network issue since it's unlikely that both units would simultaneously have hardware failure.  I'd drag it over to a friends place or something and see if it works on their network, update the firmware, reset it to whatever defaults it has, etc.
legendary
Activity: 966
Merit: 1000
Did issue begin when the difficulty recently readjusted, perhaps?

You might try mining on testnet and see if that behaves any differently.

I'm not sure how much a pool hides the network difficulty from a miner.  cgminer somehow seems to always know the network difficulty and display it for me regardless of what pool I use.
full member
Activity: 196
Merit: 100
Code:
 [2013-03-02 04:02:40] New block: 00fb4966084a69b4... diff 18.4E                    
Bus error

18.4E is 18.4 * 10^18. This also happens to be the maximum value of a 64 bit unsigned int. That is not the correct difficulty.

The bus error suggests an attempt to read from memory outside the real range.

Both of these point to some kind of memory corruption issue, which is either a cgminer bug (not very likely) or a bug in the avalon driver (quite likely). (A hardware error on the control board is also possible, but unlikely on two independent devices.)
legendary
Activity: 1946
Merit: 1006
Bitcoin / Crypto mining Hardware.
full member
Activity: 196
Merit: 100
Another block in the wall
This is what happens when ya forgot to update.....
full member
Activity: 221
Merit: 100
And the remaining window fro GPU mining got some fraction wider. Good thing I didn't turn off back in October when they told me the sky was falling.
legendary
Activity: 1176
Merit: 1001
Crashed within one second each one another from 2 different locations and also certainly with 2 different running times... Dosent make any freaking sense. How far way where the locations from each other?

Since that its a "modular design" and that you get a bus error, have you tried disconnecting some of the modules and boot with only one or 2 module?

Maybe the pool sent a "share of death" that killed your hardware. How funny would that be? (Sorry for the joke)
hero member
Activity: 658
Merit: 500
Did you change anything? Is there free space?
full member
Activity: 137
Merit: 100
You might also try the firmware update.

Ok. I have emailed them for latest firmware.
newbie
Activity: 54
Merit: 0
How about open the case and upload some pictures to see if there's something obvious?
legendary
Activity: 966
Merit: 1000
You might also try the firmware update.
hero member
Activity: 518
Merit: 500
Manateeeeeeees
Well, I guess we'll find out how good Avalon support/RMA is!
hero member
Activity: 938
Merit: 501
Try using stratum+tcp://us.ozco.in:3333
legendary
Activity: 889
Merit: 1000
Bitcoin calls me an Orphan
Sir, I opened avalon case and reconnected all connections cables. But, seems nothing helpful:


Damn.. now i am bummed for ya. I was almost certain this to be the case. Hopefully BitSyncom will be on this evening if not tomorrow morning.
full member
Activity: 137
Merit: 100
Sir, I opened avalon case and reconnected all connections cables. But, seems nothing helpful:

Code:
root@OpenWrt:~# cgminer -S /dev/ttyUSB0 -o stratum.ozco.in:3333 -O denis1111.1:123 -D --avalon-options 115200:24:10:45:282
 [2013-03-02 04:02:37] Started cgminer 2.10.4                   
 [2013-03-02 04:02:37] Avalon Detect: Attempting to open /dev/ttyUSB0 (baud=115200 miner_count=24 asic_count=10 timeout=45 frequency=282)                   
 [2013-03-02 04:02:37] Avalon: Sent(1):                   
 [2013-03-02 04:02:37] 00000000   ad                                               |.               |                   
 [2013-03-02 04:02:37] Avalon: Sent: Buffer delay: 36000000                   
 [2013-03-02 04:02:37] Avalon: Sent: Buffer full: No                   
 [2013-03-02 04:02:37] Avalon: get:                   
 [2013-03-02 04:02:37] 00000000   aa 55 aa 55 00 00 00 00  00 00 00 00 00 00 00 00 |.U.U............|                   
 [2013-03-02 04:02:37] 00000010   00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00 |................|                   
 [2013-03-02 04:02:37] 00000020   00 00 00 00 00 00 00 00  00 00 00 00 41 06 d1 bf |............A...|                   
 [2013-03-02 04:02:37] 00000030   00 00 00 00 00 00 00 00  00 00 00 00 0a 0a 32 18 |..............2.|                   
 [2013-03-02 04:02:37] Avalon: Reset succeeded                   
 [2013-03-02 04:02:37] Avalon Detect: Found at /dev/ttyUSB0, mark as 0                   
 [2013-03-02 04:02:37] Probing for an alive pool                   
 [2013-03-02 04:02:37] Testing pool http://stratum.ozco.in:3333                   
 [2013-03-02 04:02:37] Popping work to stage thread                   
 [2013-03-02 04:02:37] Probing for GBT support                   
 [2013-03-02 04:02:38] HTTP request failed: Empty reply from server                   
 [2013-03-02 04:02:38] Failed to connect in json_rpc_call                   
 [2013-03-02 04:02:38] No GBT coinbase + append support found, using getwork protocol                   
 [2013-03-02 04:02:39] HTTP request failed: Empty reply from server                   
 [2013-03-02 04:02:39] Failed to connect in json_rpc_call                   
 [2013-03-02 04:02:40] Stratum authorisation success for pool 0                   
 [2013-03-02 04:02:40] Pool 0 http://stratum.ozco.in:3333 active                   
 [2013-03-02 04:02:40] Pushing ping to thread 0                   
 [2013-03-02 04:02:40] Avalon: Sent(1):                   
 [2013-03-02 04:02:40] 00000000   ad                                               |.               |                   
 [2013-03-02 04:02:40] Avalon: Sent: Buffer delay: 36000000                   
 [2013-03-02 04:02:40] Avalon: Sent: Buffer full: No                   
 [2013-03-02 04:02:40] Avalon: get:                   
 [2013-03-02 04:02:40] 00000000   aa 55 aa 55 00 00 00 00  00 00 00 00 00 00 00 00 |.U.U............|                   
 [2013-03-02 04:02:40] 00000010   00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00 |................|                   
 [2013-03-02 04:02:40] 00000020   00 00 00 00 00 00 00 00  00 00 00 00 41 06 d1 bf |............A...|                   
 [2013-03-02 04:02:40] 00000030   00 00 00 00 00 00 00 00  00 00 00 00 0a 0a 32 18 |..............2.|                   
 [2013-03-02 04:02:40] Avalon: Reset succeeded                   
 [2013-03-02 04:02:40] Avalon: Opened on /dev/ttyUSB0                   
 [2013-03-02 04:02:40] Generated stratum merkle 8d10ecdb6bea28fae28b45275a14233a42908095a0db3831e10ae5688087f7df                   
 [2013-03-02 04:02:40] Generated stratum header 00000002501dfd42865c726006c430cc2cc729fe69b4e81f4966084a000000fb000000008d10ecdb6bea28fae28b45275a14233a42908095a0db3831e10ae5688087f7df51314eab1a03d74b00000000000000800000000000000000000000000000000000000000000000000000000000000000000000000000000080020000                   
 [2013-03-02 04:02:40] Work job_id 7740 nonce2 00000000 ntime 51314eab                   
 [2013-03-02 04:02:40] Generated target 0000000000000000000000000000000000000000000000000000ffff00000000                   
 [2013-03-02 04:02:40] Generated stratum work                   
 [2013-03-02 04:02:40] Pushing work from pool 0 to hash queue                   
 [2013-03-02 04:02:40] New block: 00fb4966084a69b4... diff 18.4E                   
Bus error
root@OpenWrt:~#

hero member
Activity: 910
Merit: 550
Figured I'd share an odd story about connections. I'm an auto mechanic and worked on a 14 year old van the other day We've been working on it since it was new. The drivers window stopped working so I went to the switch to check it out and the switch wasn't plugged in. That door had never been apart before it must have not been plugged in fully by the factory and came loose over time. Plugged it in and has been working fine since. Connections can be oddly fickle.
legendary
Activity: 1330
Merit: 1026
Mining since 2010 & Hosting since 2012
its not your network when your showing something like this.

Yes. I think so.

 
Do you feel comfortable enough to open your unit and make sure the connections inside are good? I know there is a controller board and such that could have been bumped around in shipping. This is where I would start. And yes I know it was fine for 14 days. However, it could have had a good enough connection for now.. who knows.. but something to check

Sir, After I got my avalon ten days ago I connect them with power and never touch them again. And, two avalon crash at the same time. I think it should be has nothing to do with internal connections.

Thinking it should be, is different that checking.  Please make sure all the cables are properly seated.

D
full member
Activity: 137
Merit: 100
You posted a Bus Error. This is something I would personally check myself. The error just leads that way.

BTW.. I have read about Avalons being bumped in shipping.. this is also why i would look into these things

Sir, I agree with your idea. I will try to open the avalon case and check it now.
full member
Activity: 137
Merit: 100
Code:
root@OpenWrt:~# df
Filesystem           1K-blocks      Used Available Use% Mounted on
rootfs                     320       240        80  75% /
/dev/root                 2816      2816         0 100% /rom
tmpfs                    30916        88     30828   0% /tmp
tmpfs                      512         0       512   0% /dev
/dev/mtdblock3             320       240        80  75% /overlay
overlayfs:/overlay         320       240        80  75% /

full member
Activity: 126
Merit: 100
I think jgarzik had problems with getwork. Could you try btcguild with Stratum?
full member
Activity: 137
Merit: 100
Now I run cgminer with debug option:

Code:
root@OpenWrt:~# cgminer -S /dev/ttyUSB0 -o http://us.ozco.in:8331 -O xxxxxxx.0:yyyyyy-D --avalon-options 115200:24:10:45:282
 [2013-03-02 00:21:59] Started cgminer 2.10.4                    
 [2013-03-02 00:21:59] Avalon Detect: Attempting to open /dev/ttyUSB0 (baud=115200 miner_count=24 asic_count=10 timeout=45 frequency=282)                    
 [2013-03-02 00:21:59] Avalon: Sent(1):                    
 [2013-03-02 00:21:59] 00000000   ad                                               |.               |                    
 [2013-03-02 00:21:59] Avalon: Sent: Buffer delay: 36000000                    
 [2013-03-02 00:21:59] Avalon: Sent: Buffer full: No                    
 [2013-03-02 00:21:59] Avalon: get:                    
 [2013-03-02 00:21:59] 00000000   aa 55 aa 55 00 00 00 00  00 00 00 00 00 00 00 00 |.U.U............|                    
 [2013-03-02 00:21:59] 00000010   00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00 |................|                    
 [2013-03-02 00:21:59] 00000020   00 00 00 00 00 00 00 00  00 00 00 00 41 06 d1 bf |............A...|                    
 [2013-03-02 00:21:59] 00000030   00 00 00 00 00 00 00 00  00 00 00 00 0a 0a 32 18 |..............2.|                    
 [2013-03-02 00:22:00] Avalon: Reset succeeded                    
 [2013-03-02 00:22:00] Avalon Detect: Found at /dev/ttyUSB0, mark as 0                    
 [2013-03-02 00:22:00] Probing for an alive pool                    
 [2013-03-02 00:22:00] Testing pool http://us.ozco.in:8331                    
 [2013-03-02 00:22:00] Probing for GBT support                    
 [2013-03-02 00:22:00] Popping work to stage thread                    
 [2013-03-02 00:22:17] HTTP request failed: The requested URL returned error: 404                    
 [2013-03-02 00:22:17] Failed to connect in json_rpc_call                    
 [2013-03-02 00:22:17] No GBT coinbase + append support found, using getwork protocol                    
 [2013-03-02 00:22:19] X-Roll-Ntime expiry set to 120                    
 [2013-03-02 00:22:20] Calculating midstate locally                    
 [2013-03-02 00:22:20] Successfully retrieved and deciphered work from pool 0 http://us.ozco.in:8331                    
 [2013-03-02 00:22:20] Pushing pooltest work to base pool                    
 [2013-03-02 00:22:20] New block: 0171c0e18fb4ae38... diff 18.4E                    
Bus error
root@OpenWrt:~#
legendary
Activity: 889
Merit: 1000
Bitcoin calls me an Orphan

Sir, After I got my avalon ten days ago I connect them with power and never touch them again. And, two avalon crash at the same time. I think it should be has nothing to do with internal connections.

Yes I totally understand that. However if you feel comfortable enough to double check the connections I would do so. You posted a Bus Error. This is something I would personally check myself. The error just leads that way.

BTW.. I have read about Avalons being bumped in shipping.. this is also why i would look into these things
full member
Activity: 137
Merit: 100
its not your network when your showing something like this.

Yes. I think so.

 
Do you feel comfortable enough to open your unit and make sure the connections inside are good? I know there is a controller board and such that could have been bumped around in shipping. This is where I would start. And yes I know it was fine for 14 days. However, it could have had a good enough connection for now.. who knows.. but something to check

Sir, After I got my avalon ten days ago I connect them with power and never touch them again. And, two avalon crash at the same time. I think it should be has nothing to do with internal connections.
legendary
Activity: 889
Merit: 1000
Bitcoin calls me an Orphan
its not your network when your showing something like this.

Quote
root@OpenWrt:~# cgminer -S /dev/ttyUSB0 -o http://us.ozco.in:8331 -O xxxxxx.0:yyyyy--avalon-options 115200:24:10:45:282
 [2013-03-01 22:47:22] Started cgminer 2.10.4                    
 [2013-03-01 22:47:22] Avalon: Reset succeeded                    
 [2013-03-01 22:47:22] Probing for an alive pool                    
 Bus error

root@OpenWrt:~#

Do you feel comfortable enough to open your unit and make sure the connections inside are good? I know there is a controller board and such that could have been bumped around in shipping. This is where I would start. And yes I know it was fine for 14 days. However, it could have had a good enough connection for now.. who knows.. but something to check
full member
Activity: 137
Merit: 100
One thing I can think of that might cause both to fail at the same time that doesn't have to do with software is a power surge. I have no direct experience but I can imagine the power quality in China might not be the best. Surges from industrial equipment turning on can damage all electronics. The power supply in theory should provide some protection but not always. Do you live in an industrial area?

Sir,

I think it has nothing to do with power surge. Here is BeiJing China the power supply is quite stable.

And, we could see openwrt still running well inside of avalon.

And I still have other icarus running quite stable in the same room.
full member
Activity: 137
Merit: 100


ifconfig

Code:

root@OpenWrt:~# ifconfig
br-lan    Link encap:Ethernet  HWaddr 14:CF:92:6D:56:E0 
          inet addr:192.168.0.100  Bcast:192.168.0.255  Mask:255.255.255.0
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)

eth0      Link encap:Ethernet  HWaddr 14:CF:92:6D:56:E0 
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)
          Interrupt:4

lo        Link encap:Local Loopback 
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:236 errors:0 dropped:0 overruns:0 frame:0
          TX packets:236 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:22038 (21.5 KiB)  TX bytes:22038 (21.5 KiB)

wlan0     Link encap:Ethernet  HWaddr 14:CF:92:6D:56:E0 
          inet addr:192.168.1.106  Bcast:192.168.1.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:4441 errors:0 dropped:45 overruns:0 frame:0
          TX packets:1241 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:32
          RX bytes:527105 (514.7 KiB)  TX bytes:161184 (157.4 KiB)

root@OpenWrt:~#


route -n

Code:

root@OpenWrt:~# route -n
Kernel IP routing table
Destination     Gateway         Genmask         Flags Metric Ref    Use Iface
0.0.0.0         192.168.1.1     0.0.0.0         UG    0      0        0 wlan0
192.168.0.0     0.0.0.0         255.255.255.0   U     0      0        0 br-lan
192.168.1.0     0.0.0.0         255.255.255.0   U     0      0        0 wlan0
root@OpenWrt:~#


cat /etc/resolv.conf

Code:

root@OpenWrt:~# cat /etc/resolv.conf
search lan
nameserver 127.0.0.1
root@OpenWrt:~#


ping -nc5 4.2.2.2

Code:

ping -nc5 4.2.2.2

root@OpenWrt:~# ping -nc5 4.2.2.2
ping: invalid option -- n
BusyBox v1.19.4 (2013-01-12 16:12:39 CST) multi-call binary.

Usage: ping [OPTIONS] HOST

Send ICMP ECHO_REQUEST packets to network hosts

        -4,-6           Force IP or IPv6 name resolution
        -c CNT          Send only CNT pings
        -s SIZE         Send SIZE data bytes in packets (default:56)
        -t TTL          Set TTL
        -I IFACE/IP     Use interface or IP address as source
        -W SEC          Seconds to wait for the first response (default:10)
                        (after all -c CNT packets are sent)
        -w SEC          Seconds until ping exits (default:infinite)
                        (can exit earlier with -c CNT)
        -q              Quiet, only displays output at start
                        and when finished



ping us.ozco.in

Code:

root@OpenWrt:~# ping us.ozco.in
PING us.ozco.in (66.207.163.131): 56 data bytes
64 bytes from 66.207.163.131: seq=0 ttl=50 time=280.340 ms
64 bytes from 66.207.163.131: seq=1 ttl=50 time=250.218 ms
64 bytes from 66.207.163.131: seq=2 ttl=50 time=251.694 ms
64 bytes from 66.207.163.131: seq=3 ttl=50 time=252.998 ms
64 bytes from 66.207.163.131: seq=4 ttl=50 time=267.599 ms
64 bytes from 66.207.163.131: seq=5 ttl=50 time=297.384 ms
64 bytes from 66.207.163.131: seq=6 ttl=50 time=308.474 ms
^C
--- us.ozco.in ping statistics ---
7 packets transmitted, 7 packets received, 0% packet loss
round-trip min/avg/max = 250.218/272.672/308.474 ms


hero member
Activity: 560
Merit: 500
One thing I can think of that might cause both to fail at the same time that doesn't have to do with software is a power surge. I have no direct experience but I can imagine the power quality in China might not be the best. Surges from industrial equipment turning on can damage all electronics. The power supply in theory should provide some protection but not always. Do you live in an industrial area?
full member
Activity: 126
Merit: 100
  • Start cgminer with your regular commands and add -D --verbose
Code:
cgminer -D --verbose -S /dev/ttyUSB0 -o http://us.ozco.in:8331 -O xxxxxx.0:yyyyy--avalon-options 115200:24:10:45:282
legendary
Activity: 966
Merit: 1000
Plz post the results of these commands:

ifconfig

route -n

cat /etc/resolv.conf

ping -nc5 4.2.2.2
full member
Activity: 137
Merit: 100
Since they're failing at the same time, I would actually think it's a software issue. Very unlikely that hardware would fail at the same time. However, if it's software, the same problem could happen to both machines at the same time.

I think so. But, the strange thing is that, I restarted cgminer several times, and I rebooted avalon several times, and, I changed mining pools too.  So, not easy to understand.
full member
Activity: 126
Merit: 100
In list of things to try:
  • Can you try connecting them via a different network, like wireless to your laptop and then VPN or something.

Jgzarik.. Hows your Avalon?
Can you say in red that you don't have an Avalon.  Grin
hero member
Activity: 631
Merit: 500
I have to mention 2 avalon because they crash at the very same time. I feel it quite abnormal.

...

I am afraid it is hardware related issue.


Since they're failing at the same time, I would actually think it's a software issue. Very unlikely that hardware would fail at the same time. However, if it's software, the same problem could happen to both machines at the same time.
full member
Activity: 126
Merit: 100
Things to try
  • Get the latest firmware from jgarzik or some other Avalon customer who downloaded it
  • Check if it's related to change in difficulty. Are you mining with stratum and high enough difficulty?
legendary
Activity: 1973
Merit: 1007
Same time different rooms, maybe network issue? Don't they come with a static ip? Perhaps they are trying to use the same address? Maybe try running only one of them. Or, send one to my house and I'll let you know if it still works.
hero member
Activity: 631
Merit: 500
Firmware question:
Did you get the firmware update that jgarzik got?


doesn't look like it. he is on cgminer 2.10.4.
legendary
Activity: 1890
Merit: 1003
ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.

Sir, I unplug power cable from wall and plug it again after a while. Nothing help.
What temperature is the room at when you ran the devices?
full member
Activity: 126
Merit: 100
Firmware question:
Did you get the firmware update that jgarzik got?
full member
Activity: 137
Merit: 100
ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.

Sir, I unplug power cable from wall and plug it again after a while. Nothing help.
full member
Activity: 137
Merit: 100
You had 2 working avalon for 14 days and thought was not worth

I have to mention 2 avalon because they crash at the very same time. I feel it quite abnormal.


anyways, a bus error might indicate a core dump due to many reasons, a change in data supplied (increase in difficulty to 4*10ˆ6) or over heating

I am afraid it is hardware related issue.


Can you point it to another pool?

Sure. I mentioned it. I tested them with ozco nothing different.


Turn one machine off and let it cool down for an hour?

Yes.


What does dmesg give you as unusual errors?

Nothing useful. Seems avalon does not output error message to system log.
full member
Activity: 160
Merit: 100
Hopefully this is not related to the comment in their most recent email:

Quote from: Avalon email
Lessons learned. Batch #2 process will have improvements, minor design adjustments and other goodies.
full member
Activity: 126
Merit: 100
ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.
full member
Activity: 137
Merit: 100
Do you not have physical access, or a remote controlled PDU? Sometimes hard reset (power off, and on again) might be needed.

Nothing.  Anyway I will unplug power cable and plug it again to have a check now.
newbie
Activity: 55
Merit: 0
My avalons run well for the past 14 days and then crash at the same time !
You had 2 working avalon for 14 days and thought was not worth just mentioning that in the relevant threads where people are begging for delivery info? Wow... that is going to score karma points...

anyways, a bus error might indicate a core dump due to many reasons, a change in data supplied (increase in difficulty to 4*10ˆ6) or over heating

Can you point it to another pool? Turn one machine off and let it cool down for an hour?

What does dmesg give you as unusual errors?
full member
Activity: 137
Merit: 100
check the cable connections inside the box too

Sir, they are two avalon located in different rooms. No one touch them because they are all working well last night before I went to sleep.
full member
Activity: 137
Merit: 100

Unlikely both have failed at exactly the same time. More likely is you started them at the same time and some sort of memory leak / OS error is causing this. I expect a restart will probably fix the problem.

Sir, I run them at 14 days before at the same time. And based on the above btcguild picture we could see they do not mine at the same time.

I just restarted them many times nothing turns good.
full member
Activity: 126
Merit: 100
Do you not have physical access, or a remote controlled PDU? Sometimes hard reset (power off, and on again) might be needed.
full member
Activity: 137
Merit: 100
You one of the Chinese custormers, right?

Yes.


have you tried turning it off and on again?

Sure.  I tried reboot command under openwrt console several times. Each time after avalon rebooted with ps command I could find cgminer dispear soon.
hero member
Activity: 631
Merit: 500
check the cable connections inside the box too
full member
Activity: 160
Merit: 100
Silly of me to ask maybe, but have you tried turning it off and on again?

Unlikely both have failed at exactly the same time. More likely is you started them at the same time and some sort of memory leak / OS error is causing this. I expect a restart will probably fix the problem.
full member
Activity: 126
Merit: 100
You one of the Chinese custormers, right?

Silly of me to ask maybe, but have you tried turning it off and on again? Did you get the firmware update that jgarzik got?
full member
Activity: 137
Merit: 100
Today morning I login btcguild to check my avalon. To my surprise I find both of my avalon IDLE there with mining speed 0.  Shocked



I am not sure what happened so I login openwrt console and run the following command:

Code:
root@OpenWrt:~# /etc/init.d/cgminer stop
no cgminer found; none killed


Code:
root@OpenWrt:~# 
root@OpenWrt:~# /etc/init.d/cgminer start
ntpd: resolved peer 3.openwrt.pool.ntp.org to 202.112.31.197
ntpd: sent query to 202.112.31.197
ntpd: resolved peer 2.openwrt.pool.ntp.org to 202.118.1.130
ntpd: sent query to 202.118.1.130
ntpd: resolved peer 1.openwrt.pool.ntp.org to 202.112.29.82
ntpd: sent query to 202.112.29.82
ntpd: resolved peer 0.openwrt.pool.ntp.org to 218.75.4.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 202.112.31.197: reach 0x01 offset -0.325146 delay 0.656282 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.118.1.130: reach 0x01 offset -0.275834 delay 0.559704 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.29.82: reach 0x01 offset -0.178994 delay 0.365631 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: reply from 218.75.4.130: reach 0x01 offset -0.003093 delay 0.060521 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x03 offset -0.009608 delay 0.048428 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 202.112.29.82
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x07 offset -0.007173 delay 0.051018 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.112.31.197: reach 0x03 offset -0.026169 delay 0.058811 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.118.1.130: reach 0x03 offset -0.025689 delay 0.060743 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.29.82: reach 0x03 offset -0.024491 delay 0.062413 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.29.82
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x0f offset -0.006441 delay 0.050414 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.112.29.82: reach 0x07 offset -0.022051 delay 0.057496 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 202.112.29.82
ntpd: reply from 202.118.1.130: reach 0x07 offset -0.022068 delay 0.058948 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.31.197: reach 0x07 offset -0.024856 delay 0.061325 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.112.29.82: reach 0x0f offset -0.020955 delay 0.061241 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.118.1.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x1f offset -0.007505 delay 0.047845 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.118.1.130: reach 0x0f offset -0.023846 delay 0.054938 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.112.29.82
ntpd: reply from 202.112.31.197: reach 0x0f offset -0.019475 delay 0.062804 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.112.29.82: reach 0x1f offset -0.020469 delay 0.069275 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x3f offset -0.006091 delay 0.052049 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
root@OpenWrt:~#

And then I run ps command still could not find cgminer running.

I am not sure if it has relation with btcguild pool so I change pool to ozco and run the above commands again. Still nothing luck.


Now, I run the following command:

Code:
root@OpenWrt:~# cgminer -S /dev/ttyUSB0 -o http://us.ozco.in:8331 -O xxxxxx.0:yyyyy--avalon-options 115200:24:10:45:282
 [2013-03-01 22:47:22] Started cgminer 2.10.4                    
 [2013-03-01 22:47:22] Avalon: Reset succeeded                    
 [2013-03-01 22:47:22] Probing for an alive pool                    
 Bus error

root@OpenWrt:~#

Base on the above fact I am afraid both of my two avalon are broken.
How stange it is !  
Both of them broken at the same time !
My avalons run well for the past 14 days and then crash at the same time !

Any one could understand it ?

Because I fail to contact nzhang via email and phone call so I have to post topic here.  

YiFu, nzhang, xiangfu, anyone of your three please contact with me via phone call or email I will prepare teamviewer for you with it you could login my avalon to have a check.

Urget help needed!

Jump to: