Does anyone remember about 10 years ago, maybe 15, there was a guy somewhere in Europe that created something similar, as his graduate or doctoral thesis. He demonstrated it by taking the video of Michael Jackson denying sexual abuse allegations. However, he, the creator, was speaking into the system, and the video and audio of Michael Jackson "saying" the same thing was playing in near real time.<p>He explained his system at a fairly detailed level for a short 5 minute video. He was breaking down the source clip into very small snippets of time, and with some algorithm, giving each snippet a single value, or a score. Then, as his realtime audio was analyzed in the same manner, the source snippet with the closest score to the realtime snippet was output, both audio and video. I've always wondered what happened to that project, and I've since forgotten the creator or the name of his project.
So I shared the video with a friend yesterday. To my surprise they were not convinced. Which makes me wonder if it breaks down for very adept lip readers.
The future is scary.<p>* deepfake can superimpose a face on another face<p>* Adobe is developing a program to make a voice say anything (from a couple of hours of voice input)<p>* this can sync you mouth to the new voice generated by adobe.<p>Combine all three.
I would love to hear a digital forensics expert weigh in on the doomsday prophesies surrounding this sort of thing. Yes,digital fakery is going to get more and more convincing, but we are not without weapons in the defense of truth. <a href="http://ieeexplore.ieee.org/abstract/document/5487389/" rel="nofollow">http://ieeexplore.ieee.org/abstract/document/5487389/</a>