This is mindblowing. Like this could be real, and I'm learning stuff from it: <i>There's an Indian epic that's 10 times as long as ...</i><p>There's some audio distortion (sounds like clips cut together, little "notches" in the soundscape) but apart from that, and some weirdness in sensing "the spatial location" where this audio was recorded...the concepts and the dialog are amazing.<p>Some parts are weird...but people can be weird. It you tidied this up, and added the right sounds affects and audio processing to this, without the cue that this is AI generated...holy fuck, I think people would believe it. Particularly if you cut it together as a "highlights reel". Jobs does sound a bit off tho, a bit thin...there should be enough data on him to do a sparse reconstruction of his voice to a level of accuracy beyond human discernment tho.<p>The thing this got wrong about Job's voice cadence, tho is: Jobs speaks a lot more slowly and deliberately, and with a lot more pauses, than here. I suspect the cadence / timing is not so emphatically modelled by this AI.<p>I think also they're missing some emotional trajectory coherence in both their voices. Like the emotional register of the voice does not sound or transition as naturally, and is less diverse.<p>Incredible PoC. AI folks are the new dark wizards. WTF can they not do? That list is shorter