That's correct, originally it was intended to be CPU minable and now it's GPU minable, and the ability to mine on GPUs and only GPUs is one of the major selling points for it right now. This will help maintain GPU minability.
The problem is getting the user base to adopt this... BIPs are notoriously difficult to implement. Since this change will not occur until about another couple years though, all coblee really needs to do is simply add this to the blockchain parameters in the program and get users to download the new version. The miners will also have to be updated, but that should be trivial.
Probably someone will need to do a testnet version as well that rapidly changes r values, so that we can ensure the pliability of r and test GPU mining with various r values. But these are all really minor changes. Coblee hasn't said anything about this one way or the other, maybe because he's too busy with other things.
I really think now's the time to future-proof the chain, though.
Whether or not the chain is ASIC minable or not remains to be seen, too. We know that a scrypt core only requires about 20,000 circuits (not much different than a SHA256 core). The clock rate of this core should be about 0.7x the memory. If an ASIC is only operating at 200-300 MHz, it shouldn't be hard to slap on some DDR3 memory clocked at ~800MHz and get vastly higher power efficiency than a GPU. The major difference will be that it will not be 100x faster than a GPU, and will probably only mine slightly faster than a GPU. The power consumption of this ASIC would probably be about 25-35W versus 200W for a GPU. That means you're looking at less than an order of magnitude enhancement in efficiency moving to ASICs, which should offput their widespread adoption for some time.
The easiest way to make a chain non-ASIC minable is to make the circuits insanely large and force them to utilize all the onboard components of the AMD GPUs. It will need to be architecture specific to ensure that the entirety of the GPU is being used, and there will need to be extensive testing to ensure that it does not have vulnerabilities in circuit simplification. The construction of such an algorithm will be a complicated matter.
Edit: I neglected to remember that SHA256 is included in the implementation used in Litecoin, so then I think ~45,000 circuits would be require for a whole core on the ASIC.