with applications centered around AI4Science, robotics and PDE-constrained optimization.
Besides doing research, I am passionate about teaching
in academia and at
KI macht Schule
to bring AI and Machine Learning fundamentals to high school.
If you're passionate about research and interested in working with me, I'm always happy to hear from motivated
students. If you'd like to get involved, please reach out with a summary of your research interests,
along with your up-to-date CV and academic transcript.
Patents
Florian Wolf , Alexander Novy, Tim Harr
“Contrast Analysis Dashboard for Automated Anomaly Analysis of Vehicles”,
Mercedes-Benz Research and Development, Patent, 01/2023
Florian Wolf, Alexander Novy “Method for estimating energy consumption of electric vehicles and its use
for determining a navigation route”, Mercedes-Benz Research and Development, Patent, 11/2022,
public link
While maximum-entropy exploration is central to RL, real-world applications are often
hindered by safety constraints and a lack of additive structure. We propose Policy Gradient
Penalty (PGP), a single-loop method that enforces general occupancy-measure constraints via
quadratic penalty regularization. By leveraging hidden convexity,
we provide the first global last-iterate convergence guarantees for this non-convex setting.
First algorithms with provable global guarantees for constrained non-convex optimization via hidden convexity.
Our methods bypass constraint qualifications, handle hidden convex equality constraints,
and work directly with gradient oracles in the non-convex space.
Applications include safe control and reinforcement learning, where we establish global optimality and oracle
complexities matching unconstrained hidden convex optimization.
AE+SINDy-C: a data-efficient, interpretable, and scalable Dyna-style Model-Based RL framework for PDE control,
combining SINDy-C with autoencoders for dimensionality reduction. Applied to the 1D Burgers and 2D Navier-Stokes
equations, the method enables fast rollouts, reduces environment interactions by up to 10x,
and yields an interpretable latent dynamics model, outperforming a model-free baseline.
Bayesian spatio-temporal product partition model to cluster PM2.5 air quality data from multiple monitoring stations
in Northern Italy, capturing both spatial and temporal patterns. The model outperforms a spatial-only baseline in
predictive performance, offering smoother and more insightful pollution trend analysis.