Zhang et al., 2025 - Google Patents

MAT-agent: Adaptive multi-agent training optimization

Zhang et al., 2025

Document ID: 3164363499596363827
Author: Zhang J; Cai K; Fan Y; Liu N; Wang K
Publication year: 2025
Publication venue: arXiv preprint arXiv:2510.17845

External Links

Cited by

Snippet

Multi-label image classification demands adaptive training strategies to navigate complex, evolving visual-semantic landscapes, yet conventional methods rely on static configurations that falter in dynamic settings. We propose MAT-Agent, a novel multi-agent framework that …

Continue reading at arxiv.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
- G06Q10/0639—Performance analysis
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G06N5/025—Extracting rules from data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines

Similar Documents

Publication	Publication Date	Title
Zhang et al.	2025	MAT-agent: Adaptive multi-agent training optimization
Wang et al.	2022	Adaptive Long-Short Pattern Transformer for Stock Investment Selection.
Fonteneau et al.	2013	Batch mode reinforcement learning based on the synthesis of artificial trajectories
Weiss et al.	2013	Learning adaptive value of information for structured prediction
Liu et al.	2024	Surrogate-assisted evolutionary algorithms for expensive combinatorial optimization: a survey
Liu et al.	2023	CorrDQN-FS: a two-stage feature selection method for energy consumption prediction via deep reinforcement learning
Sedlak et al.	2024	Active inference on the edge: A design study
Huang et al.	2025	Reinforcement learning-based Q-learning approach for optimizing data mining in dynamic environments
Campbell et al.	2013	Multiagent allocation of markov decision process tasks
CN119378706A (en)	2025-01-28	Method, electronic device, and program product for generating a machine learning model
Hodashinsky et al.	2019	Feature selection: Comparative analysis of binary metaheuristics and population based algorithm with adaptive memory
Bossens	2024	Robust lagrangian and adversarial policy gradient for robust constrained markov decision processes
Yang et al.	2023	Spotlight News Driven Quantitative Trading Based on Trajectory Optimization.
Xia et al.	2024	Solving time-delay issues in reinforcement learning via transformers: B. Xia et al.
Bonnet et al.	2021	One step at a time: Pros and cons of multi-step meta-gradient reinforcement learning
Bagatella et al.	2024	Active fine-tuning of generalist policies
Sun et al.	2021	An embedding-based deterministic policy gradient model for spatial crowdsourcing applications
Zheng et al.	2022	Variance reduction based partial trajectory reuse to accelerate policy gradient optimization
Fatima Ezzahra et al.	2025	Multi-objective reinforcement learning for recommender systems: a comprehensive survey of methods, challenges, and future directions: Z. Fatima Ezzahra et al.
Van Moffaert et al.	2015	Risk-sensitivity through multi-objective reinforcement learning
Corrêa et al.	2025	Unraveling the Rainbow: can value-based methods schedule?
Chand et al.	2026	Optimization of Selective Disassembly Sequence Planning for Waste Electrical and Electronic Equipment Using a Hybrid Dual‐Advantage Reinforcement Learning Approach
Silva et al.	2021	CurL-AutoML: Curriculum Learning-based AutoML
Wang et al.	2024	Optimizing Demand Forecasting: A Framework With Bayesian Optimization Embedded Reinforcement Learning for Combined Algorithm Selection and Hyperparameter Optimization
Zhao	2025	Brain-Inspired Planning for Better Generalization in Reinforcement Learning