Experience
π‘ Sakana AI
Research Scientist - Open-endedness
Tokyo, Japan
10/2024-Present
Open-Ended Model and Synthetic Task Coevolution: Implement system for endlessly coevolving an LLM-based multi-agent system alongside a continuously evolving synthetic task pool.
Model Archive Evolution: Leverage evolutionary model merging as genetic operators and quality-diversity algorithms for selection. Our population achieves better coverage than baselines on downstream benchmarks.
Task Archive Evolution: Design LLM-based agent to generate novel synthetic tasks implemented in code.
Infrastructure: Implement system using Celery orchestration across GPU workers, handling model merging, real-time vLLM inference, and sandboxed Docker evaluation with seamless scalability.
Diffusion for LM reasoning: Distill the reasoning process of an LLM into a Diffusion Model.
VLM Webagents: Develop Process Reward Model (PRM) for training VLM-based Webagents with RL.
π TU Darmstadt
Student Researcher - Reliable Multimodal AI Lab (Prof. Marcus Rohrbach)
Darmstadt, Germany
10/2023-09/2024
Modelling: Develop multimodal LLM (MLLM) for video understanding using PyTorch. Finetune image-text pretrained MLLM on video moment localization data, leveraging distributed training across 8 A100 GPUs.
Results: Beat state-of-the-art models by up to 11% on 3 benchmarks. First-author paper Chrono.
π¬ Fraunhofer Heinrich Hertz Institute
Student Researcher - Applied Machine Learning
Berlin, Germany
04/2022-09/2023
Data Engineering: Implement scraping and preprocessing units, retrieving data from varying sources.
Neural Network Modelling: Develop, implement, and tune graph neural network architecture for time series forecasting of dynamic
traffic flow using PyTorch.
π¬ Fraunhofer Heinrich Hertz Institute
Student Researcher - Photonic Components
Berlin, Germany
02/2019-03/2022
Education
π Technical University of Berlin, MSc
MSc Computer Science
GPA: 1.2/1.0 (equivalent to first-class honors)
Berlin, Germany
10/2021-09/2024
Thesis: Exploring Multimodal Large Language Models for Video and Language Tasks (Grade 1.0/1.0)
π Technical University of Berlin, BSc
BSc Computer Engineering
GPA: 1.8/1.0
Berlin, Germany
10/2018-09/2021
Thesis: Deployment and Evaluation of Deep-Reinforcement-Learning-Based Navigation Approaches on Real
Robots (Grade 1.0/1.0)
Community Contributions
π Reviewing Experience
I try to be a reviewer at top conferences that people donβt hate :)
2025: CVPR (Outstanding Reviewer), ICML, ICCV
2024: CVPR
π Teaching
I have a YouTube and Medium channel where I publish educational Machine Learning content targeted more towards beginners.