Zhipu AI Launches RealVideo: Real-Time AI Video Generator Now on Hugging Face

Zhipu AI has launched RealVideo, a real-time AI system that creates lifelike video responses with synced voice and lip movements. The technology is now available on Hugging Face for developers and researchers.

⬤ Zhipu AI just dropped RealVideo, a real-time conversational video system that creates animated responses straight from text prompts. The model is now live on Hugging Face, giving developers a new tool for building AI-generated characters and interactive digital assistants. This puts Zhipu AI right in the mix with other companies building multimodal AI that combines text, audio, and video into dynamic outputs.

⬤ RealVideo works as a streaming system that connects an LLM text generator with voice synthesis and video production. The process starts with the LLM creating a text response, which instantly converts into cloned voice audio. That audio feeds into a DiT model that shapes the visual stream, then a VAE decoder produces the final real-time video. "The system supports character initialization through a reference image, enabling consistent identity and lip-sync throughout the interaction," according to the technical documentation.

⬤ Being able to generate continuous, synced video in real time is a big step up from older batch-processed video models. RealVideo runs as a looped streaming engine where each text input triggers new speech and video frames. It's built for conversational uses like AI companions, digital presenters, virtual call agents, and interactive media tools.

Memori Launches SQL-Native Memory Engine with 80-90% Cost Reduction for LLM Agents

Memori introduces an open-source memory engine that gives AI agents long-term memory using standard SQL databases instead of expensive vector storage, cutting costs by up to 90 percent while working across all major LLM platforms.

⬤ RealVideo's launch comes as major AI companies race to deliver more immersive multimodal systems that blend video, audio, and natural language at scale. For investors, this signals the rapid growth of real-time generative media—a sector that could drive future demand for compute infrastructure and AI-driven applications. Zhipu AI's partnership with Hugging Face also shows how open access models continue to shape adoption trends across the AI ecosystem.

News Source

#AI News #LLM #RealVideo

Peter Smith E-mail

Peter Smith - web3.0 projects expert and writer exploring the intersection of blockchain, AI, and online entertainment.