Thomas Melistas, Nikos Spyrou, Nefeli Gkouti: Benchmarking Counterfactual Image Generation

Abstract:

Generative AI has revolutionised visual content editing, empowering users to effortlessly modify images and videos. However, not all edits are equal. To perform realistic edits in domains such as natural image or medical imaging, modifications must respect causal relationships inherent to the data generation process. Such image editing falls into the counterfactual image generation regime. Evaluating counterfactual image generation is substantially complex: not only it lacks observable ground truths, but also requires adherence to causal constraints. Although several counterfactual image generation methods and evaluation metrics exist, a comprehensive comparison within a unified setting is lacking. In this talk we will present our work on building a comparison framework to thoroughly benchmark counterfactual image generation methods. We integrate all models that have been used for the task at hand and expand them to novel datasets and causal graphs, demonstrating the superiority of Hierarchical VAEs across most datasets and metrics. Our published framework is implemented in a user-friendly Python package that can be extended to incorporate additional SCMs, causal methods, generative models, and datasets for the community to build on.

Biography:

Thomas Melistas is currently a PhD student at the Archimedes Research Unit (Athena RC) supervised by Prof. Sotirios Tsaftaris, Prof. Yannis Panagakis (National and Kapodistrian University of Athens) and Dr. Giorgos Papanastasiou (Pfizer). His main research interests are causal machine learning and generative modeling. He is excited about the application of the above in the domains of medical imaging and computer vision. He received an Integrated MEng in Electrical and Computer Engineering from the National Technical University of Athens. His master’s thesis was on symbolic music generation. Prior to his PhD, he has worked as a Machine Learning Engineer and did research on the fields of speech emotion recognition and multimodal learning.

Nikos Spyrou is a PhD student at Archimedes RU / Athena RC and at the National and Kapodistrian University of Athens, coadvised by Prof. Yannis Panagakis and Prof. Sotirios Tsaftaris. His research interests mainly involve computer vision, representation learning, causality, generative models and applications in the medical domain. He obtained a MEng in Electrical and Computer Engineering from the National Technical University of Athens with specialization in computer science.

Nefeli Gkouti is a PhD student at Archimedes RU / Athena RC and at the National and Kapodistrian University of Athens, supervised by Prof. Yannis Panagakis and Prof. Sotirios Tsaftaris. Her main research interests include causality and interpretability in machine learning, representation learning, generative models and their application in healthcare. She holds a MSc degree in Computer Science from the Department of Informatics of the Athens University of Economics and Business, with specialization in machine learning, and a Bachelor degree from the Department of Mathematics of the National and Kapodistrian University of Athens.

Related events

Nov 27 2024 13.00 - 14.00

Jonathan Mason: Translating research from IDCOM to cancer clinics worldwide

Industry took a keen interest in our work, and I have been working at Elekta, a leading radiotherapy device manufacturer, since 2020, where we have directly translated work from our research at IDCOM into a product and have realised the benefits we envisioned. In this talk, I will share this journey and the things I have learnt about industrial research, IP and product development.

Further information

Thomas Melistas, Nikos Spyrou, Nefeli Gkouti: Benchmarking Counterfactual Image Generation

Abstract:

Biography:

Tags

Thomas Melistas, Nikos Spyrou, Nefeli Gkouti: Benchmarking Counterfactual Image Generation