Wan 2.1 is solid but you start to get pretty bad continuity / drift issues when genning more than 81 frames (approx 5 seconds of video) whereas FramePack lets you generate 1+ minute.
Wow, the examples are fairly impressive and the resources used to create them are practically trivial. Seems like inference can be run on previous generation consumer hardware. I'd like to see throughput stats for inference on a 5090 too at some point.
This is the first decent video generation model that runs on consumer hardware. Big deal and I expect ControlNet pose support soon too.
https://github.com/Lightricks/LTX-Video
It can leave LLMs behind...
'Cause LLMs don't dance, and if they don't dance, well, they're no friends of mine.