Optimization – Machine Learning Research Blog

↧

Image may be NSFW.
Clik here to view.

Playing with positive definite matrices – I: matrix monotony and convexity

February 17, 2022, 12:31 pm

In a series of a few blog posts, I will present classical and non-classical results on symmetric positive definite matrices. Beyond being mathematically exciting, they arise naturally a lot in machine...

View Article

Image may be NSFW.
Clik here to view.

Playing with positive definite matrices – II: entropy edition

March 7, 2022, 1:26 pm

Symmetric positive semi-definite (PSD) matrices come up in a variety of places in machine learning, statistics, and optimization, and more generally in most domains of applied mathematics. When...

View Article

Image may be NSFW.
Clik here to view.

Information theory with kernel methods

April 3, 2022, 9:45 pm

In last month blog post, I presented the von Neumann entropy. It is defined as a spectral function on positive semi-definite (PSD) matrices, and leads to a Bregman divergence called the von Neumann...

View Article

Image may be NSFW.
Clik here to view.

Rethinking SGD’s noise

July 25, 2022, 6:32 am

It seemed a bit unfair to devote a blog to machine learning (ML) without talking about its current core algorithm: stochastic gradient descent (SGD). Indeed, SGD has become, year after year, the basic...

View Article

Image may be NSFW.
Clik here to view.

Rethinking SGD’s noise – II: Implicit Bias

September 18, 2022, 1:41 pm

In the previous post, we showed (or at least tried to!) how the inherent noise of the stochastic gradient descent algorithm (SGD), in the context of modern overparametrised architectures, is...

View Article

Image may be NSFW.
Clik here to view.

Sums-of-squares for dummies: a view from the Fourier domain

November 16, 2022, 4:53 am

In these last two years, I have been studying intensively sum-of-squares relaxations for optimization, learning a lot from many great research papers [1, 2], review papers [3], books [4, 5, 6, 7, 8],...

View Article

Image may be NSFW.
Clik here to view.

Discrete, continuous and continuized accelerations

December 15, 2022, 7:25 am

In optimization, acceleration is the art of modifying an algorithm in order to obtain faster convergence. Building accelerations and explaining their performance have been the subject of a countless...

View Article

Image may be NSFW.
Clik here to view.

Non-convex quadratic optimization problems

February 2, 2023, 6:46 am

Among continuous optimization problems, convex problems (with convex objectives and convex constraints) define a class that can be solved efficiently with a variety of algorithms and with arbitrary...

View Article

Image may be NSFW.
Clik here to view.

Revisiting the classics: Jensen’s inequality

March 13, 2023, 6:47 am

There are a few mathematical results that any researcher in applied mathematics uses on a daily basis. One of them is Jensen’s inequality, which allows bounding expectations of functions of random...

View Article

Image may be NSFW.
Clik here to view.

Unraveling spectral properties of kernel matrices – I

January 7, 2024, 10:18 am

Since my early PhD years, I have plotted and studied eigenvalues of kernel matrices. In the simplest setting, take independent and identically distributed (i.i.d.) data, such as in the cube below in 2...

View Article

Image may be NSFW.
Clik here to view.

Scaling laws of optimization

October 5, 2024, 8:16 am

Scaling laws have been one of the key achievements of theoretical analysis in various fields of applied mathematics and computer science, answering the following key question: How fast does my method...

View Article

Image may be NSFW.
Clik here to view.

My book is (at last) out!

December 21, 2024, 7:15 am

Just in time for Christmas, I received two days ago the first hard copies of my book! It is a mix of feelings of relief and pride after 3 years of work. As most book writers will probably acknowledge,...

View Article