I couldn’t get past the first few seconds of each speaker repeating the question and their slow inefficient speaking. What an infuriating way to consume information, what’s wrong with just plain text? Especially since some of these people basically repeat the same things. What’s next, a video version so we can waste bandwidth on their pointless hand gestures and facial expressions? Respect people’s attention spans.
Two things I'd like to see
1) show me the progress of the clip I'm listening to. You've got the length of the clip, but I have to listen to the whole thing, I can't skip through it. I can barely get through the first clip which is only 1:30
2) let me play at a faster rate. I'd easily listen to this at 1.25x or 1.5x