this is so cool - generating an image, then animating it all orchestrated by a minimal agent. full open source stack: agent, llm, image generator and video generator tied together with mcp. gpu poor? use hugging face generous zero gpu allowance to try.
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav14.8. klo 01.10
OpenAI gpt-oss 120B orchestrates a full video using Hugging Face spaces! 🤯 All of it, in one SINGLE prompt: create an image of a Labrador and use it to generate a simple video of it 🛠️ Tools used: 1. Flux.1 Krea Dev by @bfl_ml 2. LTX Fast by @Lightricks That's it, gpt-oss 120B is one of the BEST open source models I've used for tool calling so far! Kudos @OpenAI 🤗
7,62K