Skip to main content

About me

Gemma Canet Tarrés

I have recently completed my PhD in Computer Vision advised by Prof. John Collomosse and Dr Andrew Gilbert at the Center for Vision, Speech and Signal Processing at the University of Surrey. During my PhD studies, I completed two research internships at Adobe Research , under the supervision of Soo Ye Kim, and ones research internship in Francesc Moreno-Noguer's team at Amazon Science.

My research interests include image and video generation and editing, object compositing, style transfer, and visual storytelling. During my PhD, I explored how multimodal inputs can be leveraged to improve controllability in image generation models. This work has led to publications at top-tier conferences such as CVPR and ECCV, including a highlight paper at CVPR 2025.

Previously, I studied Mathematics and Physics Engineering at the Universitat Politècnica de Catalunya (UPC) under the double degree program of CFIS (Center for high interdisciplinary training). During my last year, I completed my thesis in University of Toronto, advised by Prof. Sven Dickinson. Later on, I completed the Masters in Computer Vision organized by Universitat Autònoma de Barcelona (UAB) where I worked with Montse Pardàs for my dissertation.

During my undergraduate studies, I worked as a software developer for two summer internships at Wiris Math and BaseTIS. While studying my masters, I concurrently worked part time in the former start-up Vilynx, taking part in different NLP projects. Finally, right before starting my PhD I spent 6 months at InterDigital, working as a research intern under the supervision of Louis Chevallier.

If you’re interested in knowing more about my work, check out my publications or feel free to contact me.


2025