When a real sound originates from the right of the listener, it is received by the right ear before the left ear. This timing difference between the ears allows the listener to localize the sound as coming from the right.

 

Although the music is player regularly, embedded translations to the song are delayed between the ears to match the timing of if the sound had originated from the right. This allows the listener to make a clearer distinction between the two sources and shift their attention to the source they want.