You are using words that don't seem to make sense.
You are saying that the miner needs to "broadcast the transaction just before broadcasting the block of transactions". That's a bit like saying that if you want to drive your car to the store you need to drive your engine to the store before you drive your car there.
The miner certainly can broadcast the transaction by itself before he broadcasts the entire block of transactions, but he isn't required to. He can just send the block (which has the transaction in it).
Depending on the peers that the miner is communicating with there might be some importance to the order that the miner transmits the portions of the block, but that is a matter of communications protocol with the peer and not an intrinsic requirement for bitcoin's operation.
There is no need to transmit any transaction separately before broadcasting the solved block of transactions.
The header is 80 bytes. It consists of:
- A 4 byte version number
- The 32 byte SHA256 hash of the most recently solved previous block header (this is what turns the blocks into a "chain)
- The 32 byte SHA256 merkle root of the transactions in the block
- A 4 byte timestamp (this is chosen when the block header is built and can be different from actual time by as much as a few hours)
- A 4 byte representation of the current difficulty target
- A 4 byte nonce
Since most of the network will already have most of the transactions from a block, there is an opportunity for faster propagation of blocks if the block header and merkle tree are sent first, and then peers are allowed to request the transactions that are in the merkle tree which they don't yet have. In this way the miner doesn't need to broadcast all transactions, they just need to hand over the transactions that any peers haven't seen yet. Each peer should not consider the block to be valid, or solved, until they have received all the missing transactions and can verify that the block and all transactions are valid.