Zhang et al., 2023 - Google Patents

Conformal off-policy prediction

Zhang et al., 2023

Document ID: 17145093134072104769
Author: Zhang Y; Shi C; Luo S
Publication year: 2023
Publication venue: International Conference on Artificial Intelligence and Statistics

External Links

Cited by

Snippet

Off-policy evaluation is critical in a number of applications where new policies need to be evaluated offline before online deployment. Most existing methods focus on the expected return, define the target parameter through averaging and provide a point estimator only. In …

Continue reading at proceedings.mlr.press (PDF) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G06N5/025—Extracting rules from data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6296—Graphical models, e.g. Bayesian networks

Similar Documents

Publication	Publication Date	Title
Ignatiadis et al.	2021	Covariate powered cross-weighted multiple testing
Zhang et al.	2023	Conformal off-policy prediction
Cauchois et al.	2024	Robust validation: Confident predictions even when distributions shift
Kallus et al.	2021	Causal inference under unmeasured confounding with negative controls: A minimax learning approach
Zhao et al.	2020	Individual calibration with randomized forecasting
US10936949B2 (en)	2021-03-02	Training machine learning models using task selection policies to increase learning progress
Wu et al.	2007	Robust truncated hinge loss support vector machines
US20180357566A1 (en)	2018-12-13	Unsupervised learning utilizing sequential output statistics
Feldman et al.	2017	Generalization for adaptively-chosen estimators via stable median
US11637858B2 (en)	2023-04-25	Detecting malware with deep generative models
Nguyen-Tang et al.	2021	Offline neural contextual bandits: Pessimism, optimization and generalization
Bouneffouf	2020	Online learning with corrupted context: Corrupted contextual bandits
US8626676B2 (en)	2014-01-07	Regularized dual averaging method for stochastic and online learning
Bomarito et al.	2023	Automated learning of interpretable models with quantified uncertainty
Kiyani et al.	2024	Conformal prediction with learned features
Wu et al.	2022	Mini-batch Metropolis–Hastings with reversible SGLD proposal
Ai et al.	2024	Not all distributional shifts are equal: Fine-grained robust conformal inference
Collier et al.	2023	Estimating propensity scores using neural networks and traditional methods: a comparative simulation study
Taheri et al.	2022	Balancing statistical and computational precision: a general theory and applications to sparse regression
Kong et al.	2023	Covariate balancing using the integral probability metric for causal inference
Hamman et al.	2024	Quantifying prediction consistency under model multiplicity in tabular llms
Dudoit et al.	2003	Asymptotics of cross-validated risk estimation in model selection and performance assessment
Lee et al.	2024	General frameworks for conditional two-sample testing
Guo et al.	2022	Robustness against weak or invalid instruments: Exploring nonlinear treatment models with machine learning
Liu et al.	2022	Black-box selective inference via bootstrapping