Looking at the ASCII picture of a Merkle tree on this page might help you understand:
https://en.bitcoin.it/wiki/Dump_format#CBlockNotice that the transaction hashes are added to the bottom of the tree in order of where they appeared in the block.
Is the merkle branch for my transaction not stored in my wallet until after someone on the network has included my transaction in a new block? (I guess this is obvious, but looking for validation)
Right.
When a node is creating a new block with my transaction how does it go about creating the merkle tree, is that arbitrary? Does the block creator only include transactions he wants in any order he wants?
The order is arbitrary, though Bitcoin currently sorts them by time of acceptance. The structure of the tree is based on the transactions: given the same transaction hashes in the same order, you will calculate the same tree.
Assuming I understand this correctly, once the block is created does the block creator then send out a merkle branch to each person involved in a transaction contained in that block?
No. They publish the block, which contains the transactions. Everyone downloads the block and generates the Merkle tree from the transactions. Trees aren't normally transmitted over the network.
Lightweight clients throw away the block once they've generated the tree. Ultra-lightweight clients that don't download full blocks rely on the sender of BTC to give them the Merkle branch by some means.
When a merkle branch is stored in a wallet, I assume it stores the hash at each level of the branch, does it also store something that links the hash at that level to another transaction in that block?
I'm not sure what you mean. The Merkle tree contains transaction hashes (at the very bottom of the tree), which are for transactions in the block.
It seems like if something isn't stored that links a hash at a level in my merkle branch to a transaction then I would only need to spoof one hash in my merkle branch to convince someone a spoofed transaction was in the block chain. Does that make sense?
You would "only" need to spoof one hash, but you'd need to modify the tree in such a way that the Merkle root remains the same. This is nearly impossible.