CIRM - Videos & books Library

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

We introduce a new strategy for the solution of Mean Field Games in the presence of major and minor players. This approach is based on a formulation of the fixed point step in spaces of controls. We use it to highlight the differences between open and closed loop problems. We illustrate the implementation of this approach for linear quadratic and finite state space games, and we provide numerical results motivated by applications in biology and cyber-security.[-]

We introduce a new strategy for the solution of Mean Field Games in the presence of major and minor players. This approach is based on a formulation of the fixed point step in spaces of controls. We use it to highlight the differences between open and closed loop problems. We illustrate the implementation of this approach for linear quadratic and finite state space games, and we provide numerical results motivated by applications in biology and ...[+]

93E20 ; 60H10 ; 60K35 ; 49K45

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

Backward stochastic differential equations have been a very successful and active tool for stochastic finance and insurance for some decades. More generally they serve as a central method in applications of control theory in many areas. We introduce BSDE by looking at a simple utility optimization problem in financial stochastics. We shall derive an important class of BSDE by applying the martingale optimality principle to solve an optimal investment problem for a financial agent whose income is partly affected by market external risk. We then present the basics of existence and uniqueness theory for solutions to BSDE the coefficients of which satisfy global Lipschitz conditions.[-]

Backward stochastic differential equations have been a very successful and active tool for stochastic finance and insurance for some decades. More generally they serve as a central method in applications of control theory in many areas. We introduce BSDE by looking at a simple utility optimization problem in financial stochastics. We shall derive an important class of BSDE by applying the martingale optimality principle to solve an optimal ...[+]

91B24 ; 60H15 ; 60H10 ; 91G80

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

We consider competitive capacity investment for a duopoly of two distinct producers. The producers are exposed to stochastically fluctuating costs and interact through aggregate supply. Capacity expansion is irreversible and modeled in terms of timing strategies characterized through threshold rules. Because the impact of changing costs on the producers is asymmetric, we are led to a nonzero-sum timing game describing the transitions among the discrete investment stages. Working in a continuous-time diffusion framework, we characterize and analyze the resulting Nash equilibrium and game values. Our analysis quantifies the dynamic competition effects and yields insight into dynamic preemption and over-investment in a general asymmetric setting. A case-study considering the impact of fluctuating emission costs on power producers investing in nuclear and coal-fired plants is also presented.[-]

We consider competitive capacity investment for a duopoly of two distinct producers. The producers are exposed to stochastically fluctuating costs and interact through aggregate supply. Capacity expansion is irreversible and modeled in terms of timing strategies characterized through threshold rules. Because the impact of changing costs on the producers is asymmetric, we are led to a nonzero-sum timing game describing the transitions among the ...[+]

93E20 ; 91B38 ; 91A80

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

The talk will have two parts: In the first part, I will go over some of the basic feature of cubature methods for approximating solutions of classical SDEs and how they can be adapted to solve Backward SDEs. In the second part, I will introduce some recent results on the use of cubature method for approximating solutions of McKean-Vlasov SDEs.

65C30 ; 60H10 ; 34F05 ; 60H35 ; 91G60

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

65C30 ; 60K35 ; 65C35 ; 60H10

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

We propose a mean field kinetic model for systems of rational agents interacting in a game theoretical framework. This model is inspired from non-cooperative anonymous games with a continuum of players and Mean-Field Games. The large time behavior of the system is given by a macroscopic closure with a Nash equilibrium serving as the local thermodynamic equilibrium. Applications of the presented theory to social and economical models will be given.[-]

We propose a mean field kinetic model for systems of rational agents interacting in a game theoretical framework. This model is inspired from non-cooperative anonymous games with a continuum of players and Mean-Field Games. The large time behavior of the system is given by a macroscopic closure with a Nash equilibrium serving as the local thermodynamic equilibrium. Applications of the presented theory to social and economical models will be ...[+]

91B80 ; 35Q82 ; 35Q91

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

Branching methods have recently been developed to solve some PDEs. Starting from Mckean formulation, we give the initial branching method to solve the KPP equation. We then give a formulation to solve non linear equation with a non linearity polynomial in the value function u. The methodology is extended for general non linearities in the value function u. Then we develop the methodology to solve non linear equation with non linearities polynomial in u and Du with convergence results. At last we give some numerical schemes to solve the semi-linear case and even the full non linear case but currently without convergence results.[-]

Branching methods have recently been developed to solve some PDEs. Starting from Mckean formulation, we give the initial branching method to solve the KPP equation. We then give a formulation to solve non linear equation with a non linearity polynomial in the value function u. The methodology is extended for general non linearities in the value function u. Then we develop the methodology to solve non linear equation with non linearities ...[+]

60H15 ; 35R60 ; 60J80

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

We first introduce the Metropolis-Hastings algorithm. We then consider the Random Walk Metropolis algorithm on $R^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one dimensional law. It is well-known that, in the limit $n$ tends to infinity, starting at equilibrium and for an appropriate scaling of the variance and of the timescale as a function of the dimension $n$, a diffusive limit is obtained for each component of the Markov chain. We generalize this result when the initial distribution is not the target probability measure. The obtained diffusive limit is the solution to a stochastic differential equation nonlinear in the sense of McKean. We prove convergence to equilibrium for this equation. We discuss practical counterparts in order to optimize the variance of the proposal distribution to accelerate convergence to equilibrium. Our analysis confirms the interest of the constant acceptance rate strategy (with acceptance rate between 1/4 and 1/3).[-]

We first introduce the Metropolis-Hastings algorithm. We then consider the Random Walk Metropolis algorithm on $R^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one dimensional law. It is well-known that, in the limit $n$ tends to infinity, starting at equilibrium and for an appropriate scaling of the variance and of the timescale as a function of the dimension $n$, a diffusive limit is obtained ...[+]

60J22 ; 60J10 ; 60G50 ; 60F17 ; 60J60 ; 60G09 ; 65C40 ; 65C05

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

Optimal vector quantization has been originally introduced in Signal processing as a discretization method of random signals, leading to an optimal trade-off between the speed of transmission and the quality of the transmitted signal. In machine learning, similar methods applied to a dataset are the historical core of unsupervised classification methods known as “clustering”. In both case it appears as an optimal way to produce a set of weighted prototypes (or codebook) which makes up a kind of skeleton of a dataset, a signal and more generally, from a mathematical point of view, of a probability distribution.
Quantization has encountered in recent years a renewed interest in various application fields like automatic classification, learning algorithms, optimal stopping and stochastic control, Backward SDEs and more generally numerical probability. In all these various applications, practical implementation of such clustering/quantization methods more or less rely on two procedures (and their countless variants): the Competitive Learning Vector Quantization $(CLV Q)$ which appears as a stochastic gradient descent derived from the so-called distortion potential and the (randomized) Lloyd's procedure (also known as k- means algorithm, nu ees dynamiques) which is but a fixed point search procedure. Batch version of those procedures can also be implemented when dealing with a dataset (or more generally a discrete distribution).
In a more formal form, if is probability distribution on an Euclidean space $\mathbb{R}^d$, the optimal quantization problem at level $N$ boils down to exhibiting an $N$-tuple $(x_{1}^{*}, . . . , x_{N}^{*})$, solution to

argmin$_{(x1,\dotsb,x_N)\epsilon(\mathbb{R}^d)^N} \int_{\mathbb{R}^d 1\le i\le N} \min |x_i-\xi|^2 \mu(d\xi)$

and its distribution i.e. the weights $(\mu(C(x_{i}^{*}))_{1\le i\le N}$ where $(C(x_{i}^{*})$ is a (Borel) partition of $\mathbb{R}^d$ satisfying

$C(x_{i}^{*})\subset \lbrace\xi\epsilon\mathbb{R}^d :|x_{i}^{*} -\xi|\le_{1\le j\le N} \min |x_{j}^{*}-\xi|\rbrace$.

To produce an unsupervised classification (or clustering) of a (large) dataset $(\xi_k)_{1\le k\le n}$, one considers its empirical measure

$\mu=\frac{1}{n}\sum_{k=1}^{n}\delta_{\xi k}$

whereas in numerical probability $\mu = \mathcal{L}(X)$ where $X$ is an $\mathbb{R}^d$-valued simulatable random vector. In both situations, $CLV Q$ and Lloyd's procedures rely on massive sampling of the distribution $\mu$.
As for clustering, the classification into $N$ clusters is produced by the partition of the dataset induced by the Voronoi cells $C(x_{i}^{*}), i = 1, \dotsb, N$ of the optimal quantizer.
In this second case, which is of interest for solving non linear problems like Optimal stopping problems (variational inequalities in terms of PDEs) or Stochastic control problems (HJB equations) in medium dimensions, the idea is to produce a quantization tree optimally fitting the dynamics of (a time discretization) of the underlying structure process.
We will explore (briefly) this vast panorama with a focus on the algorithmic aspects where few theoretical results coexist with many heuristics in a burgeoning literature. We will present few simulations in two dimensions.[-]

Optimal vector quantization has been originally introduced in Signal processing as a discretization method of random signals, leading to an optimal trade-off between the speed of transmission and the quality of the transmitted signal. In machine learning, similar methods applied to a dataset are the historical core of unsupervised classification methods known as “clustering”. In both case it appears as an optimal way to produce a set of weighted ...[+]

62L20 ; 93E25 ; 94A12 ; 91G60 ; 65C05

Sélection Signaler une erreur

Déposez votre fichier ici pour le déplacer vers cet enregistrement.

In this talk I will present some recent developments in model-free reinforcement learning applied to large state spaces, with an emphasis on deep learning and its role in estimating action-value functions. The talk will cover a variety of model-free algorithms, including variations on Q-Learning, and some of the main techniques that make the approach practical. I will illustrate the usefulness of these methods with examples drawn from the Arcade Learning Environment, the popular set of Atari 2600 benchmark domains.[-]

In this talk I will present some recent developments in model-free reinforcement learning applied to large state spaces, with an emphasis on deep learning and its role in estimating action-value functions. The talk will cover a variety of model-free algorithms, including variations on Q-Learning, and some of the main techniques that make the approach practical. I will illustrate the usefulness of these methods with examples drawn from the Arcade ...[+]

68Q32 ; 91A25 ; 68T05

Sélection Signaler une erreur

TUTELLES

PARTENAIRES

Destination de la recherche

Raccourcis

Documents Gobet, Emmanuel 31 résultats

Mean field games with major and minor players - Carmona, René (Auteur de la conférence) | CIRM H Nouveau

An introduction to BSDE - Imkeller, Peter (Auteur de la conférence) | CIRM H Nouveau

Capacity expansion games with application to competition in power generation investments - Aïd, René (Auteur de la conférence) | CIRM H Nouveau

Cubature methods and applications - Crisan, Dan (Auteur de la conférence) | CIRM H Nouveau

Particle algorithm for McKean SDE: a short review on numerical analysis - Bossy, Mireille (Auteur de la conférence) | CIRM H Nouveau

On the interplay between kinetic theory and game theory - Degond, Pierre (Auteur de la conférence) | CIRM H Nouveau

Branching for PDEs - Warin, Xavier (Auteur de la conférence) | CIRM H Nouveau

The Metropolis Hastings algorithm: introduction and optimal scaling of the transient phase - Jourdain, Benjamin (Auteur de la conférence) | CIRM H Nouveau

Optimal vector quantization: from signal processing to clustering and numerical probability - Pagès, Gilles (Auteur de la conférence) | CIRM H Nouveau

Model-free control and deep learning - Bellemare, Marc (Auteur de la conférence) | CIRM H Nouveau