Someone generated a telekinesis short film with a single prompt: "Use mental powers to make a TV appear to levitate—when the hand forms a fist, the TV gets crushed mid-air."
The timing and pacing could still be optimized, but the fact that this level of physics-based interaction and object manipulation can be generated from one text prompt is wild. We're seeing AI models handle complex spatial reasoning, force dynamics, and temporal sequencing in ways that would've required manual VFX work not long ago.
This kind of prompt-to-physics generation hints at where video synthesis is heading—less about frame interpolation, more about understanding causality and physical constraints in 3D space.