At least for GCN 3 devices (like Tonga), ethminer is within 1-2% of the memory read performance limit of the cards. GCN assembler won't make it any faster for those cards.
Plus with the profitability of eth mining to drop significantly in the next few months, miner developers are generally not interested in doing more work into eth.
Now, this is technical and i'm out of my element, but the Tonga cards should be able to workout 384-bit mem bus, problem is, after a bit of discussion i found out, you would have to pretty much write a bios from scrap, but, theoretically remove that bottleneck giving a 50% increase in performance, the 380x would suffer less if there are any bottlenecks, and the memory straps could also be modded in the way that certain ETH mods gave hashrates of 25+/s (mind you with wild results with regard to heat and stability the higher you went)
Tonga cards have a 256-bit memory bus, not 384. If you are referring to the fact that the GPU die supports a 384-bit bus, even if you could enable the extra two memory controllers, those bus lines aren't even connected to the package.