Updated: April 18, 2026
Track 01: AI Video Agents
This is an inference from the public BetaHacks page as of April 18, 2026. StoryCast also overlaps with Content Automation, but AI Video Agents is the strongest fit.
StoryCast is an autonomous multimodal agent that turns a single topic into a narrated one-minute short film using the BytePlus Seed generation stack.
StoryCast takes one plain-English idea like the death of a star and autonomously writes a scene-based narrative, generates storyboard images, synthesizes scene narration, animates each scene into video, and assembles the final film. The result is a complete explainer-style video from a single prompt.
StoryCast is built to show what a real video agent looks like when the orchestration itself is part of the product. A user provides one topic, and the system breaks it into a structured scene blueprint with narration, visual direction, motion cues, and tone metadata. Each scene becomes a storyboard frame, each frame becomes a video clip, each scene receives narration, and the pipeline merges everything into a polished 60-second film. The full process is visible, modular, and explainable, which makes it easy to demo and easy for judges to evaluate as true agentic execution rather than a single opaque generation call.
Seed 2.0: scene planning and narrative structureSeedream 5.0: keyframe image generationSeedance 2.0: image-to-video clip generationSeed Speech or temporary fallback TTS: scene narrationffmpeg: clip muxing and final concatenationFastAPI: demo surfaceGitHub Actions: cloud render path inside the public repository0:00-0:20: Open with the finished StoryCast video output0:20-1:05: Show the user prompt, generation flow, and final result inside the app or GitHub workflow1:05-1:35: Walk through the architecture and explain the model roles1:35-2:00: Explain the future vision for education, science communication, and brand storytellingThe current successful public render used an ElevenLabs fallback for TTS because BytePlus Speech credentials were not available at run time. For strict BytePlus-only final compliance, switch the narration layer back to Seed Speech and rerun the pipeline with real BYTEPLUS_TTS_APP_ID, BYTEPLUS_TTS_TOKEN, and BYTEPLUS_TTS_CLUSTER values.