Lets make it a bit easier so we can think more thoroughly about this.
1000 Miners are active ...
These are the jobs at the beginning
Job 1:
WCET: 100
POW Reward: 1
Workers: 0
Job 2:
WCET: 200
POW Reward: 1
Workers: 0
So you are right, if they all "fetch work simultaneously" this would happen
Job 1:
WCET: 100
POW Reward: 1
Workers: 1000
Job 2:
WCET: 200
POW Reward: 1
Workers: 0
We ignore the target, since it is just there to throttle the POW frequency, let's assume is quickly enough converges so that the miners who are involved finds exactly 10 POW per block.
Now, assume that 1000 Workers in Job 1 have exactly the same computation power. Thinking further .. again to make it easy ... assume 1 WCET equals to 1 Second of computation.
Each of the 1000 miners in this case has the chance of 10*1/1000 = 1 in 100 (10 out of 1000 miners find a POW in each block) to find a POW in a block, hence has to work 100*100WCET*1s = 10000 seconds on average to find one POW solution.
It is, for the first miner to "think this through" more lucrative to work on the second job, if one miner switches he has a 100% chance to fining at least 1 POW per block, since he is the only miner. If two switch, the chance is still 50%, and so on.
Lets say, one after the other, miners reevaluate the best thing to do
The equilibrium in this case, where every miner has found the best selfish strategy, however would be:
Job 1:
WCET: 100
POW Reward: 1
Workers: 667
Job 2:
WCET: 200
POW Reward: 1
Workers: 333
In this case everyone in the first group has a chance of 10/667 = 1/66.7 to find a POW which takes on average 100WCET*66.7*1s = 6670s (a lot better than the 10000 seconds from before the equilibrium was reached).
The second group has a chance of 10/333 = 1/33, but has a higher WCET taking on average 200WCET*33*1s = 6660s to find one POW.
Switching to either job makes no sense for any of the two groups since they both operate at the "local optimum", switching would mean they increase their own "time to POW".
Ob course, the second job has less workers, but he has to blame himself ... he could have increased his POW rewards ;-)
The only problem that might arise is, when miners synchronize and "decide" on the strategy at the same time. Then such oscillations (the ones you described) can occur.
Not sure how we could prevent such synchronizations (which could occur as all miners will probably reevaluate when a new block is found)
Maybe a pseudorandomly long "lock" on a job, each different for each miner, could be the solution.