You could limit the set of allowed miners per block by using POS. This is sybil attack proof.
Then if you are allowed - you use your POW.
That's an interesting idea and it's certainly sybil attack proof. Using POS to limit the set of POW miners would also solve an issue that I initially overlooked in my proposal. In the pre-selection phase, nodes who are interested in mining have a disincentive to not broadcast any hash puzzle they receive since that would lessen their chance of being among the first n puzzle solvers who can make it into the actual mining phase.
The problem with POS is the fact that there are no guarantees that someone with a high stake is also interested in mining coins by POW. So, you would probably have to admit a (very) large set of nodes to ensure that there will be enough active miners, which adversely affects the goal of having small mining sets to reduce power consumption.
Maybe we could use a completely different approach to enforce nodes to only mine periodically, thus saving energy. The approach is based on the idea of having two separate blockchains instead of one. The first block chain serves as a meta-chain with the purpose of selecting the allowed miners of the second chain and is empty otherwise. The second chain is the block chain that includes all the transactions. Let's call it transaction chain.
1) Meta-chain. It is built like the regular Bitcoin blockchain by POW. However, in order to create new blocks (that are accepted by your peers), a condition must be met which is not given most of the time (this is to ensure low power consumption). The condition is that the number of eligible miners of the transaction chain is within a certain range [min,max]. Miners of the meta-chain would thus only create blocks to extend the chain to its new length once the number of eligible miners falls below the threshold. So, mining would only take place every now and then. When creating a new block of the meta-chain, the node has to include his address (public key) for later identification while there's no immediate block reward for this block.
2) Transaction chain. The transaction chain is built without any POW/POS mechanism whatsoever. Instead, the mining nodes (and thus the order of the blocks) are determined by a deterministic pseudo-random selector function (with a public seed) that is applied to the meta-chain. For example, the PRNG outputs a random value in the node address range. The node whose address has the lowest hamming distance to the value is selected as the next block creator and is then allowed to build the next transaction block to get a reward. For proper identification, each node has to sign the block with his private key so that the others can check it against his address (public key).
Block creation is performed according to a fixed time schedule with regular intervals. If a node fails to deliver his block in the allotted time, the other nodes would only accept a block built by the node who comes next according the the PRNG function. This ensures a well-arranged sequence of blocks. As the order of the miners is known in advance and blocks are created at a regular pace, the block time could be set much lower than in standard Bitcoin.
As it's not desirable to allow miners of the meta-chain to build transaction blocks forever, we have to set an upper limit for each participating node. Once a node (or more exactly: an instance of his address enshrined in the meta-chain) has built k blocks, it will be disabled from creating more blocks. That is, his address won't take part in the PRNG raffle any longer. As a result, more and more nodes will be barred over time, so that eventually the number of remaining eligible nodes will fall below the min threshold. In that case, step 1) will be reactivated and the meta-chain will be extended until the number of eligible nodes reaches the max threshold again.
I think that this approach is sybil-proof. Although you can create as many addresses as you like, your computing power limits your ability to build the meta-chain and to become a node who is allowed to build the transaction chain.