Seminars and Talks

Integration-free Kernels for Equivariant Gaussian Process Modelling

by Tim Steinert

Date:	Friday, Nov. 21
Time:	14:45
Location:	N10_302, Institute of Computer Science

Details

Our guest speaker is Tim Steinert from the University of Bern.

You are all cordially invited to the CVG Seminar on November 21st, 2025 at 2:45 pm CEST

in person at the Institute of Computer Science: room 302, Neubrückstrasse 10, 3012 Bern
via Zoom (passcode is 621775).

Abstract

We study the incorporation of equivariances into vector-valued GPs and more general classes of random field models. While kernels guaranteeing equivariances have been investigated previously, their evaluation is often computationally prohibitive due to required integrations over the involved groups. In this work, we provide a kernel characterization of stochastic equivariance for centred second-order vector-valued random fields and we construct integration-free equivariant kernels based on the notion of fundamental regions of group actions. We establish data-efficient and computationally lightweight GP models for velocity fields and molecular electric dipole moments and demonstrate that proposed integration-free kernels may also be leveraged to extract equivariant components from data.

Bio

Tim Steinert is a PhD student in statistics at the University of Bern, working in the research group of Prof. David Ginsbourger. His research focuses on kernel design and inference for Gaussian process models, with applications to equivariant modeling of molecular data. He holds a Master’s Degree in Applied Mathematics from ETH Zurich.

Mean-field Transformer Models

by Giuseppe Bruno

Date:	Friday, Nov. 7
Time:	14:45
Location:	N10_302, Institute of Computer Science

Details

Our guest speaker is Giuseppe Bruno from the Institute of Mathematical Statistics and Actuarial Science (IMSV) at the University of Bern.

You are all cordially invited to the CVG Seminar on November 7th, 2025 at 2:45 pm CEST

in person at the Institute of Computer Science: room 302, Neubrückstrasse 10, 3012 Bern
via Zoom (passcode is 459500).

Abstract

While transformers have revolutionized machine learning, a fundamental understanding of how they construct internal representations remains a central challenge. This talk will present a recent theoretical framework that models the evolution of tokens as a mean-field interacting particle system, with network depth interpreted as time. The resulting mathematical description of the token distribution shows that, under certain regimes, tokens self-organize into clusters across multiple timescales, creating structure from initially random states. This mechanism offers a potential explanation for how meaning emerges in these models, while uncovering links to classical mathematical equations and other machine learning paradigms, and raising several open problems.

Bio

Giuseppe Bruno is a PhD student at the Institute of Mathematical Statistics and Actuarial Science (IMSV) at the University of Bern, working in the research group of Prof. Andrea Agazzi. His research explores the mathematical foundations of machine learning, with a specific focus on interacting particle systems and the theory of transformer models. He holds a Master's Degree in Mathematics from the University of Pisa.

Quantitative convergence of trained neural networks to Gaussian processes

by Eloy Mosig

Date:	Friday, Oct. 3
Time:	14:45
Location:	N10_302, Institute of Computer Science

Details

Our guest speaker is Eloy Mosig from the University of Pisa.

You are all cordially invited to the CVG Seminar on October 3rd, 2025 at 2:45 pm CEST

in person at the Institute of Computer Science: room 302, Neubrückstrasse 10, 3012 Bern
via Zoom (passcode is 315364).

Abstract

In this talk, we study the quantitative convergence of trained shallow neural networks to their associated Gaussian processes in the infinite width limit. While previous work has established qualitative convergence under broad settings, precise, finite-width estimates remain limited, particularly during training. We provide explicit upper bounds on the quadratic Wasserstein distance between the network output and its Gaussian approximation at any positive training time, demonstrating polynomial decay with network width. Our results quantify how architectural parameters, such as width and input dimension, influence convergence, and how training dynamics affect the approximation error. This is joint work with Andrea Agazzi and Dario Trevisan.

Bio

Eloy Mosig is a PhD student at University of Pisa which is currently visiting Professor Andrea Agazzi's team at IMSV in Bern. His main research interests lie at the intersection of probability theory, machine learning and applied topology. He holds a Master's degree from University of Bologna.

Reconstructing Highly Folded Cortices - A Few-Shot Learning Approach to Investigate Universal Brain Folding

by Timo Blattner

Date:	Friday, Jun. 27
Time:	14:45
Location:	N10_302, Institute of Computer Science

Details

Our guest speaker is Timo Blattner. He will present his Master Thesis.

You are all cordially invited to the CVG Seminar on June 27th, 2025 at 2:45 pm CEST

in person at the Institute of Computer Science: room 302, Neubrückstrasse 10, 3012 Bern
via Zoom (passcode is 459500).

Abstract

Recently, it has been shown that all mammal brains fold in a similar fashion, following the same mechanical model of folding. However, cetaceans remain outliers, having a systematically more folded brain than expected. A current hypothesis suggests that this is due to the increase in ambient pressure on the brain when these species dive, but this remains to be shown. Reconstructing these cortical surfaces is extremely difficult due to their high degree of folding and has never been done accurately before. We present a novel cortical surface reconstruction method, based on a few-shot learning of 2D expert manual tracings in each scan, to segment the full 3D image. From the segmentation, we reconstruct the white matter surface and displace it to the pial surface using a diffeomorphism. We successfully reconstruct the brains of 3 non-cetacean and 4 cetacean brains. We investigate the number of labeled slices needed for training a model to accurately reconstruct the cortical surface, and benchmark our method in humans. We show that these models can be used to label unseen scans of anatomically similar species, eliminating the need for manual labor. Our measurements support the validity of this pressure hypothesis.

Bio

Timo Blattner is a Master's student in Computer Science at the University of Bern. During his studies, he worked part-time as a research assistant in the Neuroradiology Department at the University Hospital of Bern, where he focused on deep learning-based segmentation and neuro-morphometric measurements aimed at improving clinical diagnostics. His research sparked international collaborations with partners in the UK and Brazil, allowing him to broaden his knowledge from clinical applications to the wider field of comparative neuroscience and the foundational scaling of brain morphology.

Task Arithmetic for Removing Backdoor Attacks in Multi-Modal Foundation Models

by Christian-Alexandru Botocan

Date:	Friday, May. 23
Time:	14:45
Location:	N10_302, Institute of Computer Science

Details

Our guest speaker is Christian-Alexandru Botocan. He will present his Master Thesis.

You are all cordially invited to the CVG Seminar on May 23rd, 2025 at 2:45 pm CEST

in person at the Institute of Computer Science: room 302, Neubrückstrasse 10, 3012 Bern
via Zoom (passcode is 562026).

Abstract

Recent advancements in multi-modal models, like CLIP, have significantly enhanced AI tasks such as image classification, object recognition, and cross-modal retrieval by integrating image and language understanding. Assessing the robustness of Multi-Modal models is an important aspect for the safety of its users. In this talk, we will start with assessing the security of SOTA Multi-Modal models against L0-norm perturbation attacks by altereting less than 0.04% of the image. Then, we continue with the main talk focusing on the robustness of Multi-Modal Foundation Models against backdoor attacks. We will focus on addressing the issues of the current SOTA defence method and propose a new defence by using Task Arithmetic - a model-merging technique. The best proposed defense method incorporates Bayesian Optimization to find the optimal scaling factors of the task vectors representing different fine-tuned models. Our results show that these weighted combinations outperform the current SOTA defense, achieving a favorable balance between Attack Success Rate and Clean Accuracy.

Bio

Cristian-Alexandru Botocan recently graduated MSc. in Cybersecurity at EPFL-ETHZ. His academic journey starts with the Bachelor in Computer Science and Engineering at TU Delft where he opted for Data Science specialization, focusing on Reccomandation Systems both in academia and industry, with an internship at Amazon Music ML Team in Berlin. Cristian graduated his Bachelor with "Cum Laude" and also did additional research programme called "Honours Programme", where he was focusing on using AI for Side-Channel Attacks against Cryptographic Protocols. However, his research direction during the Master was in the AI security domain. Cristian did a research internship at armassuise Science + Technology, focusing on exploring the robustness of the Multi-Modal Models against Pixel-Perturbations (https://arxiv.org/pdf/2407.18251). His last research experience is represented by the Master Thesis, where he was focusing on a defence method against backdoor attacks for Multi-Modal Models.

begin
1(current)
2
3
end