An adaptively inexact first-order method for bilevel optimization with application to hyperparameter learning [slides available]

Name: An adaptively inexact first-order method for bilevel optimization with application to hyperparameter learning [slides available]
Start: 2024-12-05T00:00:00Z
Location: University of Sydney

Slides

Abstract

A common problem in data science is the determination of model hyperparameters. One approach for learning hyperparameters is to use bilevel optimisation, where the lower-level problem is the standard learning optimisation problem, and the upper-level problem is to learn the hyperparameters (e.g. by minimising validation error). In this setting, particularly for large-scale problems, neither exact function values nor exact gradients are attainable, necessitating methods that only rely on inexact evaluation of such quantities. I will present a new bilevel optimisation algorithm with adaptive inexactness suitable for hyperparameter learning. Numerical results on problems from imaging demonstrate its robustness and strong performance. This is joint work with Mohammad Sadegh Salehi, Matthias Ehrhardt (University of Bath) and Subhadip Mukherjee (IIT Kharagpur).

Date

5 Dec 2024

Event

Sydney Workshop on Mathematics of Data Science

Location

University of Sydney

Lindon Roberts

Lecturer

My research is in numerical analysis and data science, particularly nonconvex and derivative-free optimization.