The C1 only doubled the hashrate because they doubled the boards. 4 boards to the C1 vs 2 boards for the s3. They could do this because of the ability to fit 4 boards into the existing cases of the s3's with only minor alterations. Now if they were to double the boards(which would double the chips) then it could be a 2TH/s machine, however that is a lot of heat in a very small case, I don't think it would be possible to dissipate the heat fast enough with 4 boards so close together. The chips might be fine but the rest of the PCB would probably be toast quite quickly. If one were to mount the boards into something like an S4 case, that would allow enough room to create in theory a 3-4 TH/s unit with a watercooling block sandwiched between 2 boards. I would say 6 boards total for approx. 3.3TH/s keeping in mind the need for 2 separate PSU's to power the boards, and enough room for cooling lines to be run, fans to cool the PCB's and an extra controller (unless they develop one that can carry more then 4 boards) At max I could see potential for 3.8-4.5TH/s machine setup as the chips can be chained up to 256 chips, but I think that would take a lot more R&D. Also I don't know about pricing. It would be really cool to see something like this, however the cost would likely skyrocket with this type of setup. One can dream though.
If bitmain cared to donate enough boards to me, I could see what I could come up with.........