This is something which subtitle programs use, you get shown a diagram like this which includes the amplitude over time, together with the playing position.
It would be nice to have this also for music files or files for language learning.
Is there a chance that this could be implemented?