Inspiration

Most AI video tools feel magical at first — until you try to change one small detail.

Remove a logo. Fix a hand. Change a background.

Suddenly, you’re forced to regenerate the entire video, losing consistency, spending more compute, and repeating the same trial-and-error loop.

Professional creators don’t work this way. They iterate. They refine. They make targeted changes without destroying everything else. Editors expect to scrub timelines, lock clips, tweak individual frames, and build layer by layer.

Segmenta was inspired by a simple question:

What if AI video generation worked like real editing — with localized changes, deterministic control, and iterative refinement?

Instead of treating AI video as a one-shot gamble, we wanted to treat it as a controllable creative medium.

Lights. Camera. Segmenta.


What it does

Segmenta is an AI-orchestrated video iteration system.

Instead of generating one monolithic video, Segmenta:

  • Breaks a video into logical segments
  • Allows users to grab and edit individual frames
  • Applies surgical AI edits
  • Regenerates only the affected segment
  • Preserves the rest of the timeline unchanged

This transforms AI video creation from:

generate → discard → retry

into:

generate → refine → lock → iterate

With Segmenta, creators can make small visual corrections without starting over, maintain consistency across edits, control costs by avoiding full regenerations, and treat AI video like a living timeline instead of a static output.


How we built it

User Prompt / Assets ↓ Gemini Orchestrator (planning & segmentation) ↓ Segment Objects (constraints per clip) ↓ Video Renderer (Replicate / compatible models) ↓ Timeline Assembly (FFmpeg)

Frontend:

  • React-based timeline editor
  • Multi-track video and audio lanes
  • Frame grabber and preview canvas
  • Inspector panel for AI-powered frame surgery

AI Orchestration:

  • Gemini breaks prompts into structured segments
  • Maintains narrative and visual consistency

Frame-Level Editing:

  • Users edit individual frames
  • Edits become constraints for regeneration

Assembly:

  • FFmpeg stitches segments deterministically

Challenges we ran into

  • Designing an intuitive workflow
  • Preventing visual drift
  • Working within compute limits
  • Explaining that Segmenta is a workflow, not just a generator

Accomplishments that we're proud of

  • Built a real timeline-based AI editor
  • Enabled localized regeneration
  • Created renderer-agnostic architecture
  • Delivered an end-to-end prototype

What we learned

  • Generative AI is more useful with structure
  • Control matters more than novelty
  • UX is as important as models

What's next for Segmenta

  • More renderers
  • Consistency checking
  • Multi-language audio
  • Brand style memory
  • Collaboration tools

Segmenta’s long-term vision is to become the reference workflow for controllable AI video production.

Lights. Camera. Segmenta.

Share this project:

Updates