Zhang et al., 2023 - Google Patents
Conformal off-policy predictionZhang et al., 2023
View PDF- Document ID
- 17145093134072104769
- Author
- Zhang Y
- Shi C
- Luo S
- Publication year
- Publication venue
- International Conference on Artificial Intelligence and Statistics
External Links
Snippet
Off-policy evaluation is critical in a number of applications where new policies need to be evaluated offline before online deployment. Most existing methods focus on the expected return, define the target parameter through averaging and provide a point estimator only. In …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G06N5/025—Extracting rules from data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6296—Graphical models, e.g. Bayesian networks
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ignatiadis et al. | Covariate powered cross-weighted multiple testing | |
Zhang et al. | Conformal off-policy prediction | |
Cauchois et al. | Robust validation: Confident predictions even when distributions shift | |
Kallus et al. | Causal inference under unmeasured confounding with negative controls: A minimax learning approach | |
Zhao et al. | Individual calibration with randomized forecasting | |
US10936949B2 (en) | Training machine learning models using task selection policies to increase learning progress | |
Wu et al. | Robust truncated hinge loss support vector machines | |
US20180357566A1 (en) | Unsupervised learning utilizing sequential output statistics | |
Feldman et al. | Generalization for adaptively-chosen estimators via stable median | |
US11637858B2 (en) | Detecting malware with deep generative models | |
Nguyen-Tang et al. | Offline neural contextual bandits: Pessimism, optimization and generalization | |
Bouneffouf | Online learning with corrupted context: Corrupted contextual bandits | |
US8626676B2 (en) | Regularized dual averaging method for stochastic and online learning | |
Bomarito et al. | Automated learning of interpretable models with quantified uncertainty | |
Kiyani et al. | Conformal prediction with learned features | |
Wu et al. | Mini-batch Metropolis–Hastings with reversible SGLD proposal | |
Ai et al. | Not all distributional shifts are equal: Fine-grained robust conformal inference | |
Collier et al. | Estimating propensity scores using neural networks and traditional methods: a comparative simulation study | |
Taheri et al. | Balancing statistical and computational precision: a general theory and applications to sparse regression | |
Kong et al. | Covariate balancing using the integral probability metric for causal inference | |
Hamman et al. | Quantifying prediction consistency under model multiplicity in tabular llms | |
Dudoit et al. | Asymptotics of cross-validated risk estimation in model selection and performance assessment | |
Lee et al. | General frameworks for conditional two-sample testing | |
Guo et al. | Robustness against weak or invalid instruments: Exploring nonlinear treatment models with machine learning | |
Liu et al. | Black-box selective inference via bootstrapping |