Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images

Matias Turkulainen^1*, Akshay Krishnan², Filippo Aleotti³, Mohamed Sayed³, Guillermo Garcia-Hernando³, Juho Kannala^1,4, Arno Solin^1,5, Gabriel Brostow^3,6, Daniyar Turmukhambetov³

¹Aalto University, ²Georgia Tech, ³Niantic Spatial, ⁴University of Oulu, ⁵ELLIS Institute Finland, ⁶UCL

🏠 Website 📚 BibTeX

News

[2026-03-26] Release geolocalized Tanks and Temples and DL3DV dataset information.

Overview

This repository contains code and data for Cross-View Splatter.

Code coming soon...

Geolocalized Benchmark Dataset

We propose a new task, referred to as "novel-view synthesis with georeferenced images" and construct a suitable benchmark dataset by pairing selected outdoor scenes from Tanks and Temples and DL3DV datasets with orhographic satellite imagery. We manually register and 3DoF align satellite images to COLMAP reconstructions for these scenes. In ./data folder, we release JSON files for Tanks and Temples and DL3DV containing, for each scene, the queried satellite image location (latitude, longitude) together with an SE(3) transform that maps the COLMAP point cloud into the satellite-aligned reference frame.

Each JSON entry provides the scene identifier (name), the satellite query location (latitude, longitude), and a transform with quaternion rotation, 3D translation, and isotropic scale.

COLMAP-to-geoaligned transform for a scene entry can be constructed as follows:

import json
import numpy as np
from scipy.spatial.transform import Rotation as R

def colmap_to_geoaligned(scene):
	"""Return the 4x4 transform that aligns the COLMAP point cloud to the satellite image at the query location."""
	transform_data = scene["fields"]["transform"]
	rot = transform_data["rotation"]
	trans = transform_data["translation"]
	scale = transform_data["scale"]["x"]

	rot = [rot["_x"], rot["_y"], rot["_z"], rot["_w"]]
	translation_vec = [trans["x"], trans["y"], trans["z"]]

	colmap2georef = np.identity(4)
	rotation_matrix_3x3 = R.from_quat(np.array(rot)).as_matrix()
	colmap2georef[0:3, 0:3] = rotation_matrix_3x3 * scale
	colmap2georef[0:3, 3] = np.array(translation_vec)
	return colmap2georef

with open("./data/dl3dv_point_cloud_transform_enu.json", "r") as f:
	scene_entries = json.load(f)

scene = scene_entries[0]
colmap2georef = colmap_to_geoaligned(scene)

BibTeX

If you find this work useful for your research, please consider citing our paper:

@inproceedings{turkulainen2026crossviewsplatter,
	title     = {{Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images}},
	author    = {Turkulainen, Matias and Krishnan, Akshay and Aleotti, Filippo and Sayed, Mohamed and Garcia-Hernando, Guillermo and Kannala, Juho and Solin, Arno and Brostow, Gabriel and Turmukhambetov, Daniyar},
	booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
	year      = {2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images

News

Overview

Geolocalized Benchmark Dataset

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images

News

Overview

Geolocalized Benchmark Dataset

BibTeX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages