Why doesn't the audio match the text?

Follow