Anyone running 8 GPUs with bamt?
Yes. Well, uh .... no! I *tried* but got stuck where you are standing now.
I've got 4x 6870x2 (DUAL-GPU) that I've been struggling with for some days.
Goal is to put them on a single board altogether.
When I try to boot BAMT (0.4b, all recent fixes), I got the same problem. This probably hasn't got
to do with BAMT directly, but with X, or more specifically with the AMD fglrx drivers that X calls.
When I remove one of the Dual-GPUs, hence only 6 GPU's left in the system, BAMT boots happily
doing its work.
I've put some time into that issue today, because I'm very keen on putting on this rig ultimately with
all FOUR cards. Operating with only three cards is simply not an option.
So, when i had 8 GPUs installed, BAMT died when it tried to display the GUI. So it made it through the BAMT startup screen and the initial startup.
Exactly same thing here. What happens is this:
X tries to start up and initialize all cards. When 8 GPU are at work the fglrx driver segfaults with following error:
Backtrace:
0: /usr/bin/X (xorg_backtrace+0x3b) [0x80adedb]
1: /usr/bin/X (0x8048000+0x5aab5) [0x80a2ab5]
2: (vdso) (__kernel_rt_sigreturn+0x0) [0xb778740c]
3: /usr/lib/xorg/modules/drivers/fglrx_drv.so (xdl_x750_atiddxPreInit+0x2554) [0xb6946d84]
4: /usr/bin/X (InitOutput+0x5c8) [0x80b09b8]
5: /usr/bin/X (0x8048000+0x1e7f0) [0x80667f0]
6: /lib/i686/cmov/libc.so.6 (__libc_start_main+0xe6) [0xb74bec76]
7: /usr/bin/X (0x8048000+0x1e5a1) [0x80665a1]
Segmentation fault at address 0x8
Fatal server error:
Caught signal 11 (Segmentation fault). Server aborting
The full Xorg.0.log shows that all 8 GPUs (plus the primary iGPU) get detected correctly,
and then the drivers are loaded. After this, the backtrace shows up and X can't get started.
Because you have the same issue, but probably are connected through the first (primary) GPU
with your monitor, you are not seeing anything because on bailout of X the cards get reset
and the system get stuck in an unstable state. I could only see this after enabling the iGPU
(onboard gfx) from the mobo and plugging the monitor there. (Didn't help on the effect, though)
Unfortunately, I'm not sure if upgrading to latest 11.8 helps. Frankly, I doubt it.
I removed the GPU on the 1x extender and all is well. Is this a problem of me needing a MB with 4 16x PCIe slot or is it a problem with BAMT?
It's probably not the extender and the mobo is well, too, as long as you get to boot into the kernel and stop right before X starts and bails out with the segfault. If it stops way before, i.e. throwing things at you before booting into the FS and INIT, you're encountering a different problem.
Has anyone had a similar or the same problem while migrating to BAMT? I already found this one: http://blog.zorinaq.com/?e=46, but there's said that 8 GPU work fine whereas 10 GPU are currently the limiting number.