I played the example video ("how to change language in Slack"). I liked the result as a video, but I kept thinking that I'd like it even more as a regular article.<p>If the input is photo and text then I think I'd prefer to consume it by just looking at photos and text. What's the value of generating a video out of them?<p>For context, I do watch how to videos too, but I do that for actions that are harder to describe with just text and pictures; like learning an instrument, whittling, skateboarding, etc. It's very likely that I'm not the target audience.