En poursuivant votre navigation sur ce site, vous acceptez l'utilisation d'un simple cookie d'identification. Aucune autre exploitation n'est faite de ce cookie. OK
1

Floodgate: inference for model-free variable importance

Bookmarks Report an error
Virtualconference
Authors : Janson, Lucas (Author of the conference)
CIRM (Publisher )

Loading the player...

Abstract : Many modern applications seek to understand the relationship between an outcome variable of interest and a high-dimensional set of covariates. Often the first question asked is which covariates are important in this relationship, but the immediate next question, which in fact subsumes the first, is \emph{how} important each covariate is in this relationship. In parametric regression this question is answered through confidence intervals on the parameters. But without making substantial assumptions about the relationship between the outcome and the covariates, it is unclear even how to \emph{measure} variable importance, and for most sensible choices even less clear how to provide inference for it under reasonable conditions. In this paper we propose \emph{floodgate}, a novel method to provide asymptotic inference for a scalar measure of variable importance which we argue has universal appeal, while assuming nothing but moment bounds about the relationship between the outcome and the covariates. We take a model-X approach and thus assume the covariate distribution is known, but extend floodgate to the setting that only a \emph{model} for the covariate distribution is known and also quantify its robustness to violations of the modeling assumptions. We demonstrate floodgate's performance through extensive simulations and apply it to data from the UK Biobank to quantify the effects of genetic mutations on traits of interest.

Keywords : Variable importance; effect size; model-X; heterogeneous treatment effects; heritability

MSC Codes :
62G15 - Tolerance and confidence regions
62G20 - Nonparametric asymptotic efficiency

Additional resources :
https://www.cirm-math.com/uploads/2/6/6/0/26605521/janson.pdf

    Information on the Video

    Film maker : Hennenfent, Guillaume
    Language : English
    Available date : 15/06/2020
    Conference Date : 05/06/2020
    Subseries : Research talks
    arXiv category : Statistics ; Methodology
    Mathematical Area(s) : Probability & Statistics
    Format : MP4 (.mp4) - HD
    Video Time : 00:48:44
    Targeted Audience : Researchers
    Download : https://videos.cirm-math.fr/2020-06-05_Janson.mp4

Information on the Event

Event Title : Mathematical Methods of Modern Statistics 2 / Méthodes mathématiques en statistiques modernes 2
Event Organizers : Bogdan, Malgorzata ; Graczyk, Piotr ; Panloup, Fabien ; Proïa, Frédéric ; Roquain, Etienne
Dates : 15/06/2020 - 19/06/2020
Event Year : 2020
Event URL : https://www.cirm-math.com/cirm-virtual-...

Citation Data

DOI : 10.24350/CIRM.V.19641303
Cite this video as: Janson, Lucas (2020). Floodgate: inference for model-free variable importance. CIRM. Audiovisual resource. doi:10.24350/CIRM.V.19641303
URI : http://dx.doi.org/10.24350/CIRM.V.19641303

See Also

Bibliography



Imagette Video

Bookmarks Report an error