Zhang et al., 2025 - Google Patents
MAT-agent: Adaptive multi-agent training optimizationZhang et al., 2025
View PDF- Document ID
- 3164363499596363827
- Author
- Zhang J
- Cai K
- Fan Y
- Liu N
- Wang K
- Publication year
- Publication venue
- arXiv preprint arXiv:2510.17845
External Links
Snippet
Multi-label image classification demands adaptive training strategies to navigate complex, evolving visual-semantic landscapes, yet conventional methods rely on static configurations that falter in dynamic settings. We propose MAT-Agent, a novel multi-agent framework that …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
- G06Q10/0639—Performance analysis
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G06N5/025—Extracting rules from data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zhang et al. | MAT-agent: Adaptive multi-agent training optimization | |
| Wang et al. | Adaptive Long-Short Pattern Transformer for Stock Investment Selection. | |
| Fonteneau et al. | Batch mode reinforcement learning based on the synthesis of artificial trajectories | |
| Weiss et al. | Learning adaptive value of information for structured prediction | |
| Liu et al. | Surrogate-assisted evolutionary algorithms for expensive combinatorial optimization: a survey | |
| Liu et al. | CorrDQN-FS: a two-stage feature selection method for energy consumption prediction via deep reinforcement learning | |
| Sedlak et al. | Active inference on the edge: A design study | |
| Huang et al. | Reinforcement learning-based Q-learning approach for optimizing data mining in dynamic environments | |
| Campbell et al. | Multiagent allocation of markov decision process tasks | |
| CN119378706A (en) | Method, electronic device, and program product for generating a machine learning model | |
| Hodashinsky et al. | Feature selection: Comparative analysis of binary metaheuristics and population based algorithm with adaptive memory | |
| Bossens | Robust lagrangian and adversarial policy gradient for robust constrained markov decision processes | |
| Yang et al. | Spotlight News Driven Quantitative Trading Based on Trajectory Optimization. | |
| Xia et al. | Solving time-delay issues in reinforcement learning via transformers: B. Xia et al. | |
| Bonnet et al. | One step at a time: Pros and cons of multi-step meta-gradient reinforcement learning | |
| Bagatella et al. | Active fine-tuning of generalist policies | |
| Sun et al. | An embedding-based deterministic policy gradient model for spatial crowdsourcing applications | |
| Zheng et al. | Variance reduction based partial trajectory reuse to accelerate policy gradient optimization | |
| Fatima Ezzahra et al. | Multi-objective reinforcement learning for recommender systems: a comprehensive survey of methods, challenges, and future directions: Z. Fatima Ezzahra et al. | |
| Van Moffaert et al. | Risk-sensitivity through multi-objective reinforcement learning | |
| Corrêa et al. | Unraveling the Rainbow: can value-based methods schedule? | |
| Chand et al. | Optimization of Selective Disassembly Sequence Planning for Waste Electrical and Electronic Equipment Using a Hybrid Dual‐Advantage Reinforcement Learning Approach | |
| Silva et al. | CurL-AutoML: Curriculum Learning-based AutoML | |
| Wang et al. | Optimizing Demand Forecasting: A Framework With Bayesian Optimization Embedded Reinforcement Learning for Combined Algorithm Selection and Hyperparameter Optimization | |
| Zhao | Brain-Inspired Planning for Better Generalization in Reinforcement Learning |