A multi-head Turing machine is a Turing machine with a single tape but multiple heads. Initially, all the heads are positioned above the first cell of the input. The transition function is modified to allow reading, writing, and moving of the heads. Formally,
$\delta: Q \times \Gamma^k \rightarrow Q \times \Gamma^k \times \{L, R\}^k$
where $k$ is the number of heads. The expression
$\delta(q_i, a_1, ..., a_k) = (q_j, b_1, ..., b_k, L, R, ..., L)$
means that if the machine is in state $q_i$ and the heads from 1 to $k$ read symbols $(a_1, ..., a_k)$, then the machine transitions to state $q_j$, writes symbols $(b_1, ..., b_k)$, and moves the heads left and right as specified.
Prove that any multi-head Turing machine can be simulated by a single-tape deterministic Turing machine.
I already know how to demonstrate multi-tape Turing machines are equivalent to normal Turing machines. I could easily demonstrate that for any multi-head Turing machine $M_{MH}$, there exists some multi-tape Turing machine $M_{MT}$ such that L($M_{MH}$) = L($M_{MH}$).
The exercise asks you to explicitly use single-tape deterministic Turing machine, unfortunately even using different markers and settings for $w$ I can't solve the exercise.
Any hint?
