Computer Science > Machine Learning

arXiv:2305.18030 (cs)

[Submitted on 25 May 2023 (v1), last revised 5 Oct 2023 (this version, v3)]

Title:Automated Search-Space Generation Neural Architecture Search

Authors:Tianyi Chen, Luming Liang, Tianyu Ding, Ilya Zharkov

View PDF

Abstract:To search an optimal sub-network within a general deep neural network (DNN), existing neural architecture search (NAS) methods typically rely on handcrafting a search space beforehand. Such requirements make it challenging to extend them onto general scenarios without significant human expertise and manual intervention. To overcome the limitations, we propose Automated Search-Space Generation Neural Architecture Search (ASGNAS), perhaps the first automated system to train general DNNs that cover all candidate connections and operations and produce high-performing sub-networks in the one shot manner. Technologically, ASGNAS delivers three noticeable contributions to minimize human efforts: (i) automated search space generation for general DNNs; (ii) a Hierarchical Half-Space Projected Gradient (H2SPG) that leverages the hierarchy and dependency within generated search space to ensure the network validity during optimization, and reliably produces a solution with both high performance and hierarchical group sparsity; and (iii) automated sub-network construction upon the H2SPG solution. Numerically, we demonstrate the effectiveness of ASGNAS on a variety of general DNNs, including RegNet, StackedUnets, SuperResNet, and DARTS, over benchmark datasets such as CIFAR10, Fashion-MNIST, ImageNet, STL-10 , and SVNH. The sub-networks computed by ASGNAS achieve competitive even superior performance compared to the starting full DNNs and other state-of-the-arts. The library will be released at this https URL.

Comments:	Graph visualization for DARTS, SuperResNet are omitted for arXiv version due to exceeding page dimension limit. Please refer to the open-review version for taking the visualizations
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.18030 [cs.LG]
	(or arXiv:2305.18030v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.18030

Submission history

From: Tianyi Chen [view email]
[v1] Thu, 25 May 2023 19:41:40 UTC (3,022 KB)
[v2] Tue, 3 Oct 2023 06:11:48 UTC (3,099 KB)
[v3] Thu, 5 Oct 2023 22:41:01 UTC (3,099 KB)

Computer Science > Machine Learning

Title:Automated Search-Space Generation Neural Architecture Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Automated Search-Space Generation Neural Architecture Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators