Inspiration
Most AI video tools feel magical at first — until you try to change one small detail.
Remove a logo. Fix a hand. Change a background.
Suddenly, you’re forced to regenerate the entire video, losing consistency, spending more compute, and repeating the same trial-and-error loop.
Professional creators don’t work this way. They iterate. They refine. They make targeted changes without destroying everything else. Editors expect to scrub timelines, lock clips, tweak individual frames, and build layer by layer.
Segmenta was inspired by a simple question:
What if AI video generation worked like real editing — with localized changes, deterministic control, and iterative refinement?
Instead of treating AI video as a one-shot gamble, we wanted to treat it as a controllable creative medium.
Lights. Camera. Segmenta.
What it does
Segmenta is an AI-orchestrated video iteration system.
Instead of generating one monolithic video, Segmenta:
- Breaks a video into logical segments
- Allows users to grab and edit individual frames
- Applies surgical AI edits
- Regenerates only the affected segment
- Preserves the rest of the timeline unchanged
This transforms AI video creation from:
generate → discard → retry
into:
generate → refine → lock → iterate
With Segmenta, creators can make small visual corrections without starting over, maintain consistency across edits, control costs by avoiding full regenerations, and treat AI video like a living timeline instead of a static output.
How we built it
User Prompt / Assets ↓ Gemini Orchestrator (planning & segmentation) ↓ Segment Objects (constraints per clip) ↓ Video Renderer (Replicate / compatible models) ↓ Timeline Assembly (FFmpeg)
Frontend:
- React-based timeline editor
- Multi-track video and audio lanes
- Frame grabber and preview canvas
- Inspector panel for AI-powered frame surgery
AI Orchestration:
- Gemini breaks prompts into structured segments
- Maintains narrative and visual consistency
Frame-Level Editing:
- Users edit individual frames
- Edits become constraints for regeneration
Assembly:
- FFmpeg stitches segments deterministically
Challenges we ran into
- Designing an intuitive workflow
- Preventing visual drift
- Working within compute limits
- Explaining that Segmenta is a workflow, not just a generator
Accomplishments that we're proud of
- Built a real timeline-based AI editor
- Enabled localized regeneration
- Created renderer-agnostic architecture
- Delivered an end-to-end prototype
What we learned
- Generative AI is more useful with structure
- Control matters more than novelty
- UX is as important as models
What's next for Segmenta
- More renderers
- Consistency checking
- Multi-language audio
- Brand style memory
- Collaboration tools
Segmenta’s long-term vision is to become the reference workflow for controllable AI video production.
Lights. Camera. Segmenta.
Log in or sign up for Devpost to join the conversation.