In the is voice the AI slop fix article we managed to automate the generation of the short explainer video fully. The biggest achievement was creating it entirely using AI with no human involvement at all. It is not perfect, but it is exponentially better than what was possible twelve months ago.
Why bother? I believe that with this tooling we should aspire to create ever more multi-modal articles. The idea is to have a short two-minute video that plays with the article, giving readers a sense of what's contained and helping them decide whether to read further, save for later, or take another action. I believe there is little value in having a brand-new technology and simply doing old things faster.

