A Very Big Video Reasoning Suite

We bet on a future that video reasoning is the next fundamental intelligence paradigm, after language reasoning, where spatiotemporal embodied world experiences could be more naturally captured.

Data Engines

VIEW ALL DATA ENGINE

circle_central_dot

GitHub

Knowledge out-of-domain testset

Prompt

A row of dots is shown. Circle the dot that is in the middle by count (the one with an equal number of dots on each side).

First Frame

Last Frame

Video

shape_outline_then_move

GitHub

Abstraction in-domain testset

Prompt

The scene shows an analogy A→B→C :: D→?→? with two rows of shapes and arrows. On the top row, a filled trapezoid first becomes an outline-only trapezoid (step 1), then moves up by a small amount (step 2). On the bottom row, the heart starts filled. Apply the same two-step transformation: first convert it to outline-only style, then move it up by a small amount, keeping its shape and size the same while only the style and position change.

First Frame

Last Frame

Video

find_keys_and_open_doors

GitHub

Spatiality training set

Prompt

In the maze, the agent is the green circle. First move the agent to collect the key (diamond shape), then move the agent to the door (hollow rectangle). Use the shortest path for each movement. Show the complete movement step by step.

First Frame

Last Frame

Video

multiple_occlusions_horizontal

GitHub

Transformation training set

Prompt

The scene shows 3 objects arranged horizontally on the right side of the frame, with a dark rectangular mask initially positioned on the left side. Move the mask horizontally to the right in a continuous motion until it leaves the frame. As it moves, the mask passes in front of the objects, temporarily blocking them from view.

First Frame

Last Frame

Video

identify_pentagons

GitHub

Perception out-of-domain testset

Prompt

Multiple polygons are shown; exactly one of them is a pentagon (5 sides). Identify that pentagon and mark it with a red circle that expands from the inside out to encircle the shape. Do not change anything else.

First Frame