Aram Davtyan

I am a Postdoctoral Researcher in the Computer Vision Group at the University of Bern. I earned my Ph.D. in Computer Science from the University of Bern in 2024, where I was supervised by Prof. Dr. Paolo Favaro. Prior to that, I completed a Specialist degree (equivalent to B.S. + M.S.) in Fundamental Mathematics and Mechanics at MSU in 2020. Additionally, I graduated from YSDA in 2018. My research interests include Machine Learning, Computer Vision, Generative AI, and World Models.

CV Scholar Github Twitter

Publications

Rethinking Visual Intelligence: Insights from Video Pretraining

Pablo Acuaviva, Aram Davtyan, Mariam Hassan, Sebastian Stapf, Ahmad Rahimi, Alexandre Alahi, Paolo Favaro

arXiv, 2025

The second version of the Gen2Gen paper, in which we demonstrate that VDMs are more data efficient than LLMs in learning new visual tasks.

PDF arXiv

From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models

Pablo Acuaviva, Aram Davtyan, Mariam Hassan, Sebastian Stapf, Ahmad Rahimi, Alexandre Alahi, Paolo Favaro

arXiv, 2025

A few-shot fine-tuning framework that repurposes VDMs for new tasks using only a handful of examples.

Project Page PDF arXiv Code

KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products

Zixuan Xia, Aram Davtyan, Paolo Favaro

NeurIPS, 2025

An extension of KOALA, a neural network optimization algorithm based on Kalman filtering, with implicit full weights covariance matrix.

PDF arXiv

MIRAGE: Unsupervised Single Image to Novel View Generation with Cross Attention Guidance

Llukman Cerkezi, Aram Davtyan, Sepehr Sameni, Paolo Favaro

ICCVW, 2025

Single image to novel view synthesis without any supervision.

PDF arXiv

Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling

Aram Davtyan, Leello Tadesse Dadi, Volkan Cevher, Paolo Favaro

ICLR, 2025

A method that straightens sampling trajectories in the flow matching framework via storing and exchanging locally optimal data-noise couplings across minibatches.

Project Page Code

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Mariam Hassan, Sebastian Stapf, Ahmad Rahimi, Pedro M B Rezende, Yasaman Haghighi, David Brüggemann, Isinsu Katircioglu, Lin Zhang, Xiaoran Chen, Suman Saha, Marco Cannici, Elie Aljalbout, Botao Ye, Xi Wang, Aram Davtyan, Mathieu Salzmann, Davide Scaramuzza, Marc Pollefeys, Paolo Favaro, Alexandre Alahi

CVPR, 2025

A multi-modal and multi-domain ego-vision world model with precise control over object dynamics, ego-agent motion and human poses.

Project Page PDF arXiv Code

CAGE: Unsupervised Visual Composition and Animation for Controllable Video Generation

Aram Davtyan, Sepehr Sameni, Björn Ommer, Paolo Favaro

AAAI, 2025

A model to compose and animate scenes from sparse sets of visual features.

Project Page PDF arXiv Code

Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation

Aram Davtyan, Paolo Favaro

AAAI, 2024

A model to animate single frames with sparse motion control.

Project Page PDF arXiv Code

Efficient Video Prediction via Sparsely Conditioned Flow Matching

Aram Davtyan, Sepehr Sameni, Paolo Favaro

ICCV, 2023

Conditioning only on a few randomly chosen past frames at each denoising step of flow matching results into a more efficient training procedure.

Project Page PDF arXiv Code

Controllable Video Generation through Global and Local Motion Dynamics

Aram Davtyan, Paolo Favaro

ECCV, 2022

A model to discover agents' action spaces from a dataset of videos in an unsupervised way. The action spaces are decomposed into global (2D shifts) and local (discrete) actions.

Project Page PDF arXiv Code

KOALA: A Kalman Optimization Algorithm with Loss Adaptivity

Aram Davtyan, Sepehr Sameni, Llukman Cerkezi, Givi Meishvili, Adam Bielski, Paolo Favaro

AAAI, 2022

A neural network optimization algorithm based on Kalman filtering.

Project Page PDF arXiv Code