Hi,
I am having some problems with my Avalon since it arrived yesterday. I don't really mind if it restarts a few (like 2-5) times a day. If it automatically gets back up working in a minute, it is fine for me. What is crucial for me is to prevent it manually switching off and on. It happened this (its first) night that it somehow crashed and in the morning I thought it was operating well (fans worked normally) but I was not able to connect to it at all. Had to manually switch it off and on to have it on the network and hashing again.
I am using firmware 20130703 from CKolivas (runs much better than the original, which restarted every 10-15 minutes), flashed with Keep settings enabled, default settings mostly (except for my new attempts to solve the problems - see below), default clock 300.
I would like to ask you some questions:
1) The default "ideal" operating temperature is set to 50 C. How sensitive is Avalon to this? Might it help if I lower it down to 45 C? I might be able to cool it down even to 40 C, so I would like to know if there is any benefit of doing so. Any clues?
2) It is quite simple to "analyze" what is happening there if it just stops hashing and restart cgminer. That can be seen in the logs and I do have these ugly guys when it restarts:
...
Sat Jul 6 05:55:10 2013 auth.emerg kernel: [ 3266.570000] usb 1-1: clear tt 1 (0030) error -71
Sat Jul 6 05:55:15 2013 auth.emerg kernel: [ 3271.630000] usb 1-1: clear tt 1 (0030) error -71
Sat Jul 6 05:55:20 2013 auth.emerg kernel: [ 3276.690000] usb 1-1: clear tt 1 (0030) error -71
Sat Jul 6 05:55:21 2013 auth.info kernel: [ 3277.860000] usb 1-1.1: USB disconnect, device number 3
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.000000] usb 1-1: reset high-speed USB device number 2 using ehci-platform
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.460000] usb 1-1.1: new full-speed USB device number 4 using ehci-platform
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.600000] ftdi_sio 1-1.1:1.0: FTDI USB Serial Device converter detected
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.600000] usb 1-1.1: Detected FT232RL
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.610000] usb 1-1.1: Number of endpoints 2
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.610000] usb 1-1.1: Endpoint 1 MaxPacketSize 16384
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.620000] usb 1-1.1: Endpoint 2 MaxPacketSize 16384
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.620000] usb 1-1.1: Setting MaxPacketSize 64
Sat Jul 6 05:55:22 2013 auth.info kernel: [ 3278.630000] usb 1-1.1: FTDI USB Serial Device converter now attached to ttyUSB0
Sat Jul 6 05:55:25 2013 auth.info kernel: [ 3281.740000] ftdi_sio ttyUSB0: FTDI USB Serial Device converter now disconnected from ttyUSB0
Sat Jul 6 05:55:25 2013 auth.info kernel: [ 3281.760000] ftdi_sio 1-1.1:1.0: device disconnected
...
But my question is now - how to analyze what happened after I had to switch it off and on - i.e. logs gone. Or is this just try/fail all the time?
Those error -71 are quite known here. Some of you get rid of them just using the new firmware. Somehow this did not work for me. I now tried to use avalon-temp 45 + disable DHCP server and we'll see. But since the problem with total network disconnect happened after several hours of working, I'm not sure how to test it properly. I do not know if this -71 has something to do with that total crash, I just know it is related to cgminer restarts.
I would really hate to have to open the case. I do not really trust myself with this
Switching jumpers, playing with cables, screwing something somewhere up and get brick ...
So unless it is necessary I would like to operate on the level of web interface and ssh.
Thanks for any help, clues, links.