AutoML | Literature on Neural Architecture Search

1617.

Huang, Shenyang; Francois-Lavet, Vincent; Rabusseau, Guillaume

Understanding Capacity Saturation in Incremental Learning Proceedings Article

In: The 34th Canadian Conference on Artificial Intelligence, 2021.

Links | BibTeX

1616.

Chen, Wuyang; Gong, Xinyu; Wang, Zhangyang

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective Proceedings Article

In: ICLR 2021, 2021.

Links | BibTeX

1615.

Yang, Yuxuan; Gao, Zhongke; Li, Yanli; Wang, He

A CNN identified by reinforcement learning-based optimization framework for EEG-based state evaluation Journal Article

In: Journal of Neural Engineering, vol. 18, no. 4, pp. 046059, 2021.

Abstract | Links | BibTeX

@article{Yang_2021,

title = {A CNN identified by reinforcement learning-based optimization framework for EEG-based state evaluation},

author = {Yuxuan Yang and Zhongke Gao and Yanli Li and He Wang},

url = {https://doi.org/10.1088/1741-2552/abfa71},

doi = {10.1088/1741-2552/abfa71},

year  = {2021},

date = {2021-05-01},

journal = {Journal of Neural Engineering},

volume = {18},

number = {4},

pages = {046059},

publisher = {IOP Publishing},

abstract = {Objective. Electroencephalogram (EEG) data, as a kind of complex time-series, is one of the most widely-used information measurements for evaluating human psychophysiological states. Recently, numerous works applied deep learning techniques, especially the convolutional neural network (CNN), into EEG-based research. The design of the hyper-parameters of the CNN model has a great influence on the performance of the model. Therefore, automatically designing these hyper-parameters can save the time and labor of experts. This leads to the appearance of the neural architecture search technique. In this paper, we propose a reinforcement learning (RL)-based step-by-step framework to efficiently search for CNN models. Approach. Specifically, the deep Q network in RL is first used to determine the depth of convolutional layers and the connection modes among layers. Then particle swarm optimization algorithm is used to fine-tune the number and size of convolution kernels. Through this step-by-step strategy, the search space can be narrowed in each step for saving the overall time cost. This framework is employed for both EEG-based sleep stage classification and driver drowsiness evaluation tasks. Main results. The results show that compared with state-of-the-art methods, the high-performance CNN models identified by the proposed optimization framework, can achieve high overall accuracy and better root mean squared error in the two tasks. Significance. Therefore, the proposed optimization framework has a great potential to provide high-performance results for other kinds of classification and prediction tasks. In this way, it can greatly save researchers’ time cost and promote broader applications of CNNs.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1614.

Schoenherr, Georg P.

The Nonlinearity Coefficient - A Practical Guide to Neural Architecture Design PhD Thesis

2021.

Links | BibTeX

1613.

Zhang, Yi; Liu, Yang; Liu, X. Shirley

Neural network architecture search with AMBER Journal Article

In: Nature Machine Intelligence, pp. 372-373, 2021.

Links | BibTeX

1612.

Yuan, Zhihang; Liu, Jingze; Li, Xingchen; Yan, Longhao; Chen, Haoxiang; Wu, Bingzhe; Yang, Yuchao; Sun, Guangyu

NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators Journal Article

In: Science China Information Sciences, 2021.

Links | BibTeX

1611.

Perenda, Erma; Rajendran, Sreeraj; Bovet, Gerome; Pollin, Sofie; Zheleva, Mariya

Evolutionary Optimization of Residual Neural Network Architectures for Modulation Classification Miscellaneous

2021.

Links | BibTeX

1610.

Wang, Xiaobo

Teacher Guided Neural Architecture Search for Face Recognition Journal Article

In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 4, pp. 2817-2825, 2021.

Links | BibTeX

1609.

Anwar, Abrar

Evolving Spiking Circuit Motifs Using Weight Agnostic Neural Networks Journal Article

In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 18, pp. 15956-15957, 2021.

Links | BibTeX

1608.

Pan, Zheyi; Ke, Songyu; Yang, Xiaodu; Liang, Yuxuan; Yu, Yong; Zhang, Junbo; Zheng, Yu

AutoSTG: Neural Architecture Search forPredictions of Spatio-Temporal Graphs Proceedings Article

In: WWW 2021, 2021.

Links | BibTeX

1607.

Kyriakides, George; Margaritis, Konstantinos

Evolving graph convolutional networks for neural architecture search Journal Article

In: Neural Computing and Applications, 2021.

Abstract | Links | BibTeX

1606.

Harikrishnan, V. K.; Gambhir, MeenuAshima

Neural AutoML with Convolutional Networks for Diabetic Retinopathy Diagnosis Journal Article

In: Machine Intelligence and Smart Systems, pp. 145-157, 2021.

Links | BibTeX

1605.

Wang, Linnan; Xie, Saining; Li, Teng; Fonseca, Rodrigo; Tian, Yuandong

Sample-Efficient Neural Architecture Search by Learning Actions for Monte Carlo Tree Search Journal Article

In: IEEE transactions on pattern analysis and machine intelligence, vol. PP, 2021, ISSN: 0162-8828.

Abstract | Links | BibTeX

1604.

Zhang, Zhentong; Shan, Yugang; Yuan, Jie

Multi-Level Cell Progressive Differentiable Architecture Search to Improve Image Classification Accuracy Journal Article

In: Journal of Signal Processing Systems, 2021.

Abstract | Links | BibTeX

1603.

Zheng, X; Ji, R; Chen, Y; Wang, Q; Zhang, B; Ye, Q; Chen, J; Huang, F; Tian, Y

MIGO-NAS: Towards Fast and Generalizable Neural Architecture Search Journal Article

In: IEEE Transactions on Pattern Analysis & Machine Intelligence, no. 01, pp. 1-1, 2021, ISSN: 1939-3539.

Links | BibTeX

1602.

Zimmer, Lucas; Lindauer, Marius; Hutter, Frank

Auto-Pytorch: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL Journal Article

In: IEEE transactions on pattern analysis and machine intelligence, vol. PP, 2021, ISSN: 0162-8828.

Abstract | Links | BibTeX

1601.

Liu, Lanlan; Zhang, Yuting; Deng, Jia; Soatto, Stefano

Dynamically Grown Generative Adversarial Networks Proceedings Article

In: AAAI 2021, 2021.

Links | BibTeX

1600.

Xu, Y; Xie, L; Dai, W; Zhang, X; Chen, X; Qi, G; Xiong, H; Tian, Q

Partially-Connected Neural Architecture Search for Reduced Computational Redundancy Journal Article

In: IEEE Transactions on Pattern Analysis & Machine Intelligence, no. 01, pp. 1-1, 2021, ISSN: 1939-3539.

Abstract | Links | BibTeX

@article{9354953,

title = {Partially-Connected Neural Architecture Search for Reduced Computational Redundancy},

author = {Y Xu and L Xie and W Dai and X Zhang and X Chen and G Qi and H Xiong and Q Tian},

url = {https://www.computer.org/csdl/journal/tp/5555/01/09354953/1rgCccYlOaQ},

doi = {10.1109/TPAMI.2021.3059510},

issn = {1939-3539},

year  = {2021},

date = {2021-02-01},

journal = {IEEE Transactions on Pattern Analysis & Machine Intelligence},

number = {01},

pages = {1-1},

publisher = {IEEE Computer Society},

address = {Los Alamitos, CA, USA},

abstract = {Differentiable architecture search (DARTS) enables effective neural architecture search (NAS) using gradient descent, but suffers from high memory and computational costs. In this paper, we propose a novel approach, namely Partially-Connected DARTS (PC-DARTS), to achieve efficient and stable neural architecture search by reducing the channel and spatial redundancies of the super-network. In the channel level, partial channel connection is presented to randomly sample a small subset of channels for operation selection to accelerate the search process and suppress the over-fitting of the super-network. Side operation is introduced for bypassing (non-sampled) channels to guarantee the performance of searched architectures under extremely low sampling rates. In the spatial level, input features are down-sampled to eliminate spatial redundancy and enhance the efficiency of the mixed computation for operation selection. Furthermore, edge normalization is developed to maintain the consistency of edge selection based on channel sampling with the architectural parameters for edges. Experimental results demonstrate that the proposed approach achieves higher search speed and training stability than DARTS. PC-DARTS obtains a top-1 error rate of 2.55% on CIFAR-10 with 0.07 GPU-days for architecture search, and a state-of-the-art top-1 error rate of 24.1% on ImageNet (under the mobile setting) within 2.8 GPU-day.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1599.

Hao, Jie; Zhu, William

Architecture self-attention mechanism: nonlinear optimization for neural architecture search Journal Article

In: Journal of Nonlinear and Variational Analysis, vol. 5, pp. 119-140, 2021.

Links | BibTeX

1598.

Gao, Yanjie; Gu, Xianyu; Zhang, Hongyu; Lin, Haoxiang; Yang, Mao

Runtime Performance Prediction for Deep Learning Models with Graph Neural Network Technical Report

Microsoft no. MSR-TR-2021-3, 2021.

Abstract | Links | BibTeX

@techreport{gao2021runtime,

title = {Runtime Performance Prediction for Deep Learning Models with Graph Neural Network},

author = {Yanjie Gao and Xianyu Gu and Hongyu Zhang and Haoxiang Lin and Mao Yang},

url = {https://www.microsoft.com/en-us/research/publication/runtime-performance-prediction-for-deep-learning-models-with-graph-neural-network/},

year  = {2021},

date = {2021-02-01},

urldate = {2021-02-01},

number = {MSR-TR-2021-3},

institution = {Microsoft},

abstract = {Recently, deep learning (DL) has been widely adopted in many application domains. Predicting the runtime performance of DL models such as GPU memory consumption and training time is important to boost development productivity and reduce resource waste because improper configurations of hyperparameters and neural architectures can result in many failed training jobs or inappropriate models. However, general runtime performance prediction for DL models is challenging due to the hybrid DL programming paradigm, complicated hidden factors within the framework runtime, fairly huge model configuration space, and wide differences among models. In this paper, we propose DNNPerf, a novel and general machine learning approach to predict the runtime performance of DL models using Graph Neural Network. DNNPerf represents a DL model as a directed acyclic computation graph and designs a rich set of effective performance-related features based on the computational semantics of both nodes and edges. We also propose a new Attention-based Node-Edge Encoder to better encode the node and edge features. DNNPerf is extensively evaluated on thousands of configurations of real-world and synthetic DL models to predict their GPU memory consumption and training time. The experimental results demonstrate that DNNPerf achieves an overall error of 13.684% for the GPU memory consumption prediction and an overall error of 7.443% for the training time prediction, outperforming all the compared methods.},

keywords = {},

pubstate = {published},

tppubtype = {techreport}

}

Close

1597.

Pham, Hieu; Le, Quoc V

AutoDropout - Learning Dropout Patterns to Regularize Deep Networks Technical Report

2021.

Links | BibTeX

1596.

Huang, Yufang; Axsom, Kelly M; Lee, John; Subramanian, Lakshminarayanan; Zhang, Yiye

DICE: Deep Significance Clustering for Outcome-Aware Stratification Technical Report

2021.

Links | BibTeX

1595.

Ma, Ailong; Wan, Yuting; Zhong, Yanfei; Wang, Junjue; Zhang, Liangpei

SceneNet: Remote sensing scene classification deep learning network using multi-objective neural evolution architecture search Technical Report

2021, ISSN: 0924-2716.

Links | BibTeX

1594.

Yang, Hansi; Yao, Quanming; Kwok, James T

Tensorizing Subgraph Search in the Supernet Technical Report

2021.

Links | BibTeX

1593.

Syed, Muhtadyuzzaman; Srinivasan, Arvind Akpuram

Generalized Latency Performance Estimation for Once-For-All Neural Architecture Search Technical Report

2021.

Links | BibTeX

1592.

Huang, Hanxun; Ma, Xingjun; Erfani, Sarah M; Bailey, James

Neural Architecture Search via Combinatorial Multi-Armed Bandit Technical Report

2021.

Links | BibTeX

1591.

Tang, Tianqi; Yu, Xin; Dong, Xuanyi; Yang, Yi

Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation Technical Report

2021.

Links | BibTeX

1590.

Zheng, Shuai; Wang, Yabin; Li, Baotong; Li, Xin

A Hardware-adaptive Deep Feature Matching Pipeline for Real-time 3D Reconstruction Journal Article

In: vol. 132, pp. 102984, 2021.

Links | BibTeX

1589.

Hosseini, Ramtin; Yang, Xingyi; Xie, Pengtao

DSRNA - Differentiable Search of Robust Neural Architectures Proceedings Article

In: CVPR 2021, 2021.

Links | BibTeX

1588.

Ruchte, Michael; Zela, Arber; Siems, Julien Niklas; Grabocka, Josif; Hutter, Frank

NASLib: A Modular and Flexible Neural Architecture Search Library Technical Report

2021.

Links | BibTeX

1587.

Shaikh, Azhar; Sinha, Nishant

Learn to Bind and Grow Neural Structures Proceedings Article

In: pp. 119-126, 2021.

Links | BibTeX

1586.

Lu, Hao; Han, Hu

NAS-HR: search of neural architecture for heart-rate estimation from face videos Technical Report

2021.

Links | BibTeX

1585.

Peng, Daiyi; Dong, Xuanyi; Real, Esteban; Tan, Mingxing; Lu, Yifeng; Liu, Hanxiao; Bender, Gabriel; Kraft, Adam; Liang, Chen; Le, Quoc V

PyGlove - Symbolic Programming for Automated Machine Learning Technical Report

2021.

Links | BibTeX

1584.

Liu, Jiaheng; Zhou, Shunfeng; Wu, Yichao; Chen, Ken; Ouyang, Wanli; Xu, Dong

Block Proposal Neural Architecture Search Journal Article

In: vol. 30, pp. 15-25, 2021.

Links | BibTeX

1583.

He, Xin; Wang, Shihao; Ying, Guohao; Zhang, Jiyong; Chu, Xiaowen

Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans Technical Report

2021.

Links | BibTeX

1582.

Chu, Xiangxiang; Wang, Xiaoxing; Zhang, Bo; Lu, Shun; Wei, Xiaolin; Yan, Junchi

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators Technical Report

2021.

Links | BibTeX

1581.

Liang, Xinle; Liu, Yang; Luo, Jiahuan; He, Yuanqin; Chen, Tianjian; Yang, Qiang

Self-supervised Cross-silo Federated Neural Architecture Search Technical Report

2021.

Links | BibTeX

1580.

Mittal, Govind; Korus, Pawel; Memon, Nasir D

FiFTy - Large-Scale File Fragment Type Identification Using Convolutional Neural Networks Journal Article

In: vol. 16, pp. 28-41, 2021.

Links | BibTeX

1579.

Yang, Zhao; Zhang, Shengbing; Li, Ruxu; Li, Chuxi; Wang, Miao; Wang, Danghui; Zhang, Meng

Efficient Resource-Aware Convolutional Neural Architecture Search for Edge Computing with Pareto-Bayesian Optimization Journal Article

In: vol. 21, no. 2, pp. 444, 2021.

Links | BibTeX

1578.

Wei, Tao; Wang, Changhu; Chen, Chang Wen

Modularized Morphing of Deep Convolutional Neural Networks - A Graph Approach Journal Article

In: vol. 70, no. 2, pp. 305-315, 2021.

Links | BibTeX

1577.

Song, Xingyou; Choromanski, Krzysztof; Parker-Holder, Jack; Tang, Yunhao; Peng, Daiyi; Jain, Deepali; Gao, Wenbo; Pacchiano, Aldo; Sarlós, Tamás; Yang, Yuxiang

ES-ENAS - Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning Technical Report

2021.

Links | BibTeX

1576.

Rorabaugh, Ariel Keller; -, Silvina Caíno; II, Michael Wyatt R; Johnston, Travis; Taufer, Michela

PEng4NN: An Accurate Performance Estimation Engine for Efficient Automated Neural Network Architecture Search Technical Report

2021.

Links | BibTeX

1575.

Zhang, Haokui; Gong, Chengrong; Bai, Yunpeng; Bai, Zongwen; Li, Ying

3D-ANAS: 3D Asymmetric Neural Architecture Search for Fast Hyperspectral Image Classification Miscellaneous

2021.

Links | BibTeX

1574.

Gu, Hongyang; Fu, Guangyuan; Li, Jianmin; Zhu, Jun

Auto-ReID+: Searching for a multi-branch ConvNet for person re-identification Journal Article

In: Neurocomputing, vol. 435, pp. 53-66, 2021, ISSN: 0925-2312.

Abstract | Links | BibTeX

1573.

He, Xin; Wang, Shihao; Chu, Xiaowen; Shi, Shaohuai; Tang, Jiangping; Liu, Xin; Yan, Chenggang; Zhang, Jiyong; Ding, Guiguang

Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans Technical Report

2021.

Links | BibTeX

1572.

Zhou, Benjia; Li, Yunan; Wan, Jun

Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition Technical Report

2021.

Links | BibTeX

1571.

Yang, Zhao; Zhang, Shengbing; Li, Ruxu; Li, Chuxi; Wang, Miao; Wang, Danghui; Zhang, Meng

Efficient Resource-Aware Convolutional Neural Architecture Search for Edge Computing with Pareto-Bayesian Optimization Journal Article

In: Sensors, vol. 21, no. 2, 2021, ISSN: 1424-8220.

Abstract | Links | BibTeX

@article{s21020444,

title = {Efficient Resource-Aware Convolutional Neural Architecture Search for Edge Computing with Pareto-Bayesian Optimization},

author = {Zhao Yang and Shengbing Zhang and Ruxu Li and Chuxi Li and Miao Wang and Danghui Wang and Meng Zhang},

url = {https://www.mdpi.com/1424-8220/21/2/444},

doi = {10.3390/s21020444},

issn = {1424-8220},

year  = {2021},

date = {2021-01-01},

journal = {Sensors},

volume = {21},

number = {2},

abstract = {With the development of deep learning technologies and edge computing, the combination of them can make artificial intelligence ubiquitous. Due to the constrained computation resources of the edge device, the research in the field of on-device deep learning not only focuses on the model accuracy but also on the model efficiency, for example, inference latency. There are many attempts to optimize the existing deep learning models for the purpose of deploying them on the edge devices that meet specific application requirements while maintaining high accuracy. Such work not only requires professional knowledge but also needs a lot of experiments, which limits the customization of neural networks for varied devices and application scenarios. In order to reduce the human intervention in designing and optimizing the neural network structure, multi-objective neural architecture search methods that can automatically search for neural networks featured with high accuracy and can satisfy certain hardware performance requirements are proposed. However, the current methods commonly set accuracy and inference latency as the performance indicator during the search process, and sample numerous network structures to obtain the required neural network. Lacking regulation to the search direction with the search objectives will generate a large number of useless networks during the search process, which influences the search efficiency to a great extent. Therefore, in this paper, an efficient resource-aware search method is proposed. Firstly, the network inference consumption profiling model for any specific device is established, and it can help us directly obtain the resource consumption of each operation in the network structure and the inference latency of the entire sampled network. Next, on the basis of the Bayesian search, a resource-aware Pareto Bayesian search is proposed. Accuracy and inference latency are set as the constraints to regulate the search direction. With a clearer search direction, the overall search efficiency will be improved. Furthermore, cell-based structure and lightweight operation are applied to optimize the search space for further enhancing the search efficiency. The experimental results demonstrate that with our method, the inference latency of the searched network structure reduced 94.71% without scarifying the accuracy. At the same time, the search efficiency increased by 18.18%.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

With the development of deep learning technologies and edge computing, the combination of them can make artificial intelligence ubiquitous. Due to the constrained computation resources of the edge device, the research in the field of on-device deep learning not only focuses on the model accuracy but also on the model efficiency, for example, inference latency. There are many attempts to optimize the existing deep learning models for the purpose of deploying them on the edge devices that meet specific application requirements while maintaining high accuracy. Such work not only requires professional knowledge but also needs a lot of experiments, which limits the customization of neural networks for varied devices and application scenarios. In order to reduce the human intervention in designing and optimizing the neural network structure, multi-objective neural architecture search methods that can automatically search for neural networks featured with high accuracy and can satisfy certain hardware performance requirements are proposed. However, the current methods commonly set accuracy and inference latency as the performance indicator during the search process, and sample numerous network structures to obtain the required neural network. Lacking regulation to the search direction with the search objectives will generate a large number of useless networks during the search process, which influences the search efficiency to a great extent. Therefore, in this paper, an efficient resource-aware search method is proposed. Firstly, the network inference consumption profiling model for any specific device is established, and it can help us directly obtain the resource consumption of each operation in the network structure and the inference latency of the entire sampled network. Next, on the basis of the Bayesian search, a resource-aware Pareto Bayesian search is proposed. Accuracy and inference latency are set as the constraints to regulate the search direction. With a clearer search direction, the overall search efficiency will be improved. Furthermore, cell-based structure and lightweight operation are applied to optimize the search space for further enhancing the search efficiency. The experimental results demonstrate that with our method, the inference latency of the searched network structure reduced 94.71% without scarifying the accuracy. At the same time, the search efficiency increased by 18.18%.

Close

1570.

Weng, Yu; Chen, Zehua; Zhou, Tianbao

Improved differentiable neural architecture search for single image super-resolution Journal Article

In: Peer-to-Peer Networking and Applications, 2021.

Abstract | Links | BibTeX