AutoML | Literature on Neural Architecture Search

In: 2023 IEEE 16th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC), pp. 171-178, IEEE Computer Society, Los Alamitos, CA, USA, 2023.

Abstract | Links | BibTeX

3066.

Kapoor, A.; Soans, R.; Dixit, S.; Ns, P.; Singh, B.; Das, M.

NASEREX: Optimizing Early Exits via AutoML for Scalable Efficient Inference in Big Image Streams Proceedings Article

In: 2023 IEEE International Conference on Big Data (BigData), pp. 5266-5271, IEEE Computer Society, Los Alamitos, CA, USA, 2023.

Abstract | Links | BibTeX

@inproceedings{10386502,

title = {NASEREX: Optimizing Early Exits via AutoML for Scalable Efficient Inference in Big Image Streams},

author = {A. Kapoor and R. Soans and S. Dixit and P. Ns and B. Singh and M. Das},

url = {https://doi.ieeecomputersociety.org/10.1109/BigData59044.2023.10386502},

doi = {10.1109/BigData59044.2023.10386502},

year  = {2023},

date = {2023-12-01},

urldate = {2023-12-01},

booktitle = {2023 IEEE International Conference on Big Data (BigData)},

pages = {5266-5271},

publisher = {IEEE Computer Society},

address = {Los Alamitos, CA, USA},

abstract = {We investigate the problem of smart operational efficiency, at scale, in Machine Learning models for Big Data streams, in context of embedded AI applications, by learning optimal early exits. Embedded AI applications that employ deep neural models depend on efficient model inference at scale, especially on resource-constrained hardware. Recent vision/text/audio models are computationally complex with huge parameter spaces and input samples typically pass through multiple layers, each with large tensor computations, to produce valid outputs. Generally, in most real scenarios, AI applications deal with big data streams, such as streams of audio signals, static images and/or high resolution video frames. Deep ML models powering such applications have to continuously perform inference on such big data streams for varied tasks such as noise suppression, face detection, gait estimation and so on. Ensuring efficiency is challenging, even with model compression techniques since they reduce model size but often fail to achieve scalable inference efficiency over continuous streams. Early exits enable adaptive inference by extracting valid outputs from any pre-final layer of a deep model which significantly boosts efficiency at scale since many of the input instances need not be processed at all the layers of a deep model, especially for big streams. Suitable early exit structure design (number + positions) is a difficult but crucial aspect in improving efficiency without any loss in predictive performance, especially in context of big streams. Naive manual early exit design that does not consider the hardware capacity or data stream characteristics is counterproductive. We propose NASEREX framework that leverages Neural architecture Search (NAS) with a novel saliency-constrained search space and exit decision metric to learn suitable early exit structure to augment Deep Neural models for scalable efficient inference on big image streams. Optimized exit-augmented models perform $approx 2.5 times$ faster having $approx 4 times$ aggregated lower effective FLOPs, with no significant accuracy loss.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

We investigate the problem of smart operational efficiency, at scale, in Machine Learning models for Big Data streams, in context of embedded AI applications, by learning optimal early exits. Embedded AI applications that employ deep neural models depend on efficient model inference at scale, especially on resource-constrained hardware. Recent vision/text/audio models are computationally complex with huge parameter spaces and input samples typically pass through multiple layers, each with large tensor computations, to produce valid outputs. Generally, in most real scenarios, AI applications deal with big data streams, such as streams of audio signals, static images and/or high resolution video frames. Deep ML models powering such applications have to continuously perform inference on such big data streams for varied tasks such as noise suppression, face detection, gait estimation and so on. Ensuring efficiency is challenging, even with model compression techniques since they reduce model size but often fail to achieve scalable inference efficiency over continuous streams. Early exits enable adaptive inference by extracting valid outputs from any pre-final layer of a deep model which significantly boosts efficiency at scale since many of the input instances need not be processed at all the layers of a deep model, especially for big streams. Suitable early exit structure design (number + positions) is a difficult but crucial aspect in improving efficiency without any loss in predictive performance, especially in context of big streams. Naive manual early exit design that does not consider the hardware capacity or data stream characteristics is counterproductive. We propose NASEREX framework that leverages Neural architecture Search (NAS) with a novel saliency-constrained search space and exit decision metric to learn suitable early exit structure to augment Deep Neural models for scalable efficient inference on big image streams. Optimized exit-augmented models perform $approx 2.5 times$ faster having $approx 4 times$ aggregated lower effective FLOPs, with no significant accuracy loss.

Close

3065.

Heuillet, Alexandre

Exploring deep neural network differentiable architecture design PhD Thesis

Université Paris-Saclay, 2023.

Links | BibTeX

3064.

Chen, Hui; Li, Nannan; Chen, Rong

Ni-DehazeNet: representation learning via bilevel optimized architecture search for nighttime dehazing Journal Article

In: The Visual Computer, 2023.

Links | BibTeX

3063.

Lyu, Zonglei; Yu, Tong; Pan, Fuxi; Zhang, Yilin; Luo, Jia; Zhang, Dan; Chen, Yiren; Zhang, Bo; Li, Guangyao

A survey of model compression strategies for object detection Journal Article

In: Multimedia Tools and Applications , 2023.

Links | BibTeX

3062.

POYSER, MATTHEW

Minimizing Computational Resources for Deep Machine Learning: A Compression and Neural Architecture Search Perspective for Image Classification and Object Detection PhD Thesis

2023.

Links | BibTeX

3061.

Liu, Shiya

Energy-efficient Neuromorphic Computing for Resource-constrained Internet of Things Devices PhD Thesis

2023.

Links | BibTeX

3060.

Xie, Tao; Zhang, Haoming; Yang, Linqi; Wang, Ke; Dai, Kun; Li, Ruifeng; Zhao, Lijun

Point-NAS: A Novel Neural Architecture Search Framework for Point Cloud Analysis Journal Article

In: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, vol. PP, 2023, ISSN: 1057-7149.

Abstract | Links | BibTeX

@article{PMID:37963007,

title = {Point-NAS: A Novel Neural Architecture Search Framework for Point Cloud Analysis},

author = {Tao Xie and Haoming Zhang and Linqi Yang and Ke Wang and Kun Dai and Ruifeng Li and Lijun Zhao},

url = {https://doi.org/10.1109/TIP.2023.3331223},

doi = {10.1109/tip.2023.3331223},

issn = {1057-7149},

year  = {2023},

date = {2023-11-01},

urldate = {2023-11-01},

journal = {IEEE transactions on image processing : a publication of the IEEE Signal Processing Society},

volume = {PP},

abstract = {Recently, point-based networks have exhibited extraordinary potential for 3D point cloud processing. However, owing to the meticulous design of both parameters and hyperparameters inside the network, constructing a promising network for each point cloud task can be an expensive endeavor. In this work, we develop a novel one-shot search framework called Point-NAS to automatically determine optimum architectures for various point cloud tasks. Specifically, we design an elastic feature extraction (EFE) module that serves as a basic unit for architecture search, which expands seamlessly alongside both the width and depth of the network for efficient feature extraction. Based on the EFE module, we devise a searching space, which is encoded into a supernet to provide a wide number of latent network structures for a particular point cloud task. To fully optimize the weights of the supernet, we propose a weight coupling sandwich rule that samples the largest, smallest, and multiple medium models at each iteration and fuses their gradients to update the supernet. Furthermore, we present a united gradient adjustment algorithm that mitigates gradient conflict induced by distinct gradient directions of sampled models and supernet, thus expediting the convergence of the supernet and assuring that it can be comprehensively trained. Pursuant to the provided techniques, the trained supernet enables a multitude of subnets to be incredibly well-optimized. Finally, we conduct an evolutionary search for the supernet under resource constraints to find promising architectures for different tasks. Experimentally, the searched Point-NAS with weights inherited from the supernet realizes outstanding results across a variety of benchmarks. i.e., 94.2% and 88.9% overall accuracy under ModelNet40 and ScanObjectNN, 68.6% mIoU under S3DIS, 63.6% and 69.3% mAP@0.25 under SUN RGB-D and ScanNet V2 datasets.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

3059.

Dimanov, Daniel

Efficient Multi-Objective NeuroEvolution in Computer Vision and Applications for Threat Identification PhD Thesis

2023.

Links | BibTeX

3058.

Li, Yueyang; Wu, Zhouejie; Shen, Junfei; Zhang, Qican

Real-time 3D shape measurement of dynamic scenes using fringe projection profilometry: lightweight NAS-optimized dual frequency deep learning approach Journal Article

In: Opt. Express, vol. 31, no. 24, pp. 40803–40823, 2023.

Abstract | Links | BibTeX

3057.

Xie, G.; Li, Q.; Shi, Z.; Fang, H.; Ji, S.; Jiang, Y.; Yuan, Z.; Ma, L.; Xu, M.

Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search Journal Article

In: IEEE Transactions on Computers, no. 01, pp. 1-14, 2023, ISSN: 1557-9956.

Abstract | Links | BibTeX

@article{10323250,

title = {Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search},

author = {G. Xie and Q. Li and Z. Shi and H. Fang and S. Ji and Y. Jiang and Z. Yuan and L. Ma and M. Xu},

url = {https://www.computer.org/csdl/journal/tc/5555/01/10323250/1SewSv3Y4VO},

doi = {10.1109/TC.2023.3333253},

issn = {1557-9956},

year  = {2023},

date = {2023-11-01},

urldate = {5555-11-01},

journal = {IEEE Transactions on Computers},

number = {01},

pages = {1-14},

publisher = {IEEE Computer Society},

address = {Los Alamitos, CA, USA},

abstract = {Neural networks (NNs) are widely used in classification-based networking analysis to help traffic transmission and system security. However, there are heterogeneous network devices (e.g., switches and routers) in a network. Manually customizing NNs with specific device requirements (e.g., max allowed running latency) can be time-consuming and labor-intensive. Furthermore, the diverse data characteristics of different networking classification tasks add to the burden of NN customization. This paper introduces Loong, a neural architecture search (NAS) based system that automatically generates NNs for various networking tasks and devices. Loong includes a neural operation embedding module, which embeds candidate neural operations into the layer to be designed. Then, the layer-wise training is used to generate a task-specific NN layer by layer. This layer-wise scheme simultaneously trains and selects candidate neural operations using gradient feedback. Finally, only the important operations are selected to form the layer, maximizing accuracy. By incorporating multiple objectives, including deployment memory and running latency of devices, into the training and selection of NNs, Loong is able to customize NNs for heterogeneous network devices. Experiments show that Loong’s NNs outperform 13 manual-designed and NAS-based NNs, with a 4.11% improvement in F1-score. Additionally, Loong’s NNs achieve faster (7.92X) speeds on commodity devices.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

3056.

Wang, G.; Li, C.; Yuan, L.; Peng, J.; Xian, X.; Liang, X.; Chang, X.; Lin, L.

DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions Journal Article

In: IEEE Transactions on Pattern Analysis & Machine Intelligence, no. 01, pp. 1-18, 2023, ISSN: 1939-3539.

Abstract | Links | BibTeX

@article{10324326,

title = {DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions},

author = {G. Wang and C. Li and L. Yuan and J. Peng and X. Xian and X. Liang and X. Chang and L. Lin},

url = {https://www.computer.org/csdl/journal/tp/5555/01/10324326/1SgbFgBOI7u},

doi = {10.1109/TPAMI.2023.3335261},

issn = {1939-3539},

year  = {2023},

date = {2023-11-01},

urldate = {5555-11-01},

journal = {IEEE Transactions on Pattern Analysis & Machine Intelligence},

number = {01},

pages = {1-18},

publisher = {IEEE Computer Society},

address = {Los Alamitos, CA, USA},

abstract = {Neural Architecture Search (NAS), aiming at automatically designing neural architectures by machines, has been considered a key step toward automatic machine learning. One notable NAS branch is the weight-sharing NAS, which significantly improves search efficiency and allows NAS algorithms to run on ordinary computers. Despite receiving high expectations, this category of methods suffers from low search effectiveness. By employing a generalization boundedness tool, we demonstrate that the devil behind this drawback is the untrustworthy architecture rating with the oversized search space of the possible architectures. Addressing this problem, we modularize a large search space into blocks with small search spaces and develop a family of models with the distilling neural architecture (DNA) techniques. These proposed models, namely a DNA family, are capable of resolving multiple dilemmas of the weight-sharing NAS, such as scalability, efficiency, and multi-modal compatibility. Our proposed DNA models can rate all architecture candidates, as opposed to previous works that can only access a sub- search space using heuristic algorithms. Moreover, under a certain computational complexity constraint, our method can seek architectures with different depths and widths. Extensive experimental evaluations show that our models achieve state-of-the-art top-1 accuracy of 78.9% and 83.6% on ImageNet for a mobile convolutional network and a small vision transformer, respectively. Additionally, we provide in-depth empirical analysis and insights into neural architecture ratings. Codes available: https://github.com/changlin31/DNA.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

3055.

He, Xin; Chu, Xiaowen

MedPipe: End-to-End Joint Search of Data Augmentation and Neural Architecture for 3D Medical Image Classification Journal Article

In: 2023.

Links | BibTeX

3054.

Kim, Bosung; Lee, Seulki

On-NAS: On-Device Neural Architecture Search on Memory-Constrained Intelligent Embedded Systems Bachelor Thesis

2023.

Links | BibTeX

3053.

Cai, Zhiqiang; Chen, Jialin; Xu, Ke; Wang, Lingli

Recognizing Good Variational Quantum Circuits with Monte Carlo Tree Search Technical Report

2023.

Links | BibTeX

3052.

Xu, Zheng; Jain, Deepak Kumar; Shamsolmoali, Pourya; Goli, Alireza; Neelakandan, Subramani; Jain, Amar

Slime Mold optimization with hybrid deep learning enabled crowd-counting approach in video surveillance Journal Article

In: Machine Learning and Big Data Analytics for IoT Security and Privacy (SPIoT 2022), 2023.

Links | BibTeX

3051.

Yang, Lei; Mei, Sen; Liang, Pan; Li, Yan; Ma, Ling; Gao, Jianbo; Jiang, Huiqin

A 3D prediction model for benign or malignant of pulmonary nodules based on neural architecture search Journal Article

In: Signal, Image and Video Processing , 2023.

Links | BibTeX

3050.

Wu, Zhenpeng; Chen, Jiamin; Al-Sabri, Raeed; Oloulade, Babatounde Moctard; Gao, Jianliang

Adaptive graph contrastive learning with joint optimization of data augmentation and graph encoder Journal Article

In: Knowledge and Information Systems , 2023.

Links | BibTeX

3049.

Ragusa, Edoardo; Dosen, Straginja; Zunino, Rodolfo; Gastaldo, Paolo

Affordance Segmentation Using Tiny Networks for Sensing Systems in Wearable Robotic Devices Journal Article

In: IEEE SENSORS JOURNAL, , 2023.

Links | BibTeX

3048.

(Ed.)

Designing a New Search Space for Multivariate Time-Series Neural Architecture Search Collection

2023.

Links | BibTeX

3047.

García, Jesús Leopoldo Llano; Monroy, Raúl; Hernández, Víctor Adrián Sosa

An Experimental Protocol for Neural Architecture Search in Super-Resolution Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 4139-4146, 2023.

Links | BibTeX

3046.

Soro, Bedionita; Song, Chong

Enhancing Differentiable Architecture Search: A Study on Small Number of Cell Blocks in the Search Stage, and Important Branches-Based Cells Selection Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 1253-1261, 2023.

Links | BibTeX

3045.

Rosales, Rafael; Munoz, Pablo; Paulitsch, Michael

Assessing the Impact of Diversity on the Resilience of Deep Learning Ensembles: A Comparative Study on Model Architecture, Output, Activation, and Attribution Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 4406-4416, 2023.

Links | BibTeX

3044.

Bhardwaj, Kartikeya; Cheng, Hsin-Pai; Priyadarshi, Sweta; Li, Zhuojin

ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 1353-1357, 2023.

Links | BibTeX

3043.

Cavagnero, Niccolò; Robbiano, Luca; Pistilli, Francesca; Caputo, Barbara; Averta, Giuseppe

Entropic Score Metric: Decoupling Topology and Size in Training-Free NAS Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 1459-1468, 2023.

Links | BibTeX

3042.

Sakuma, Yuiko; Ishii, Masato; Narihira, Takuya

DetOFA: Efficient Training of Once-for-All Networks for Object Detection Using Path Filter Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 1333-1342, 2023.

Links | BibTeX

3041.

Akinola, Solomon Oluwole; Qingguo, Wang; Olukanmi, Peter; Tshilidzi, Marwala

A Boosted Evolutionary Neural Architecture Search for Timeseries Forecasting with Application to South African COVID-19 Cases Journal Article

In: International Journal of Online and Biomedical Engineering (iJOE), vol. 19, no. 14, pp. pp. 107–130, 2023.

Links | BibTeX

3040.

Zhang, Lei; Chen, Zhiqian; Lu, Chang-Tien; Zhao, Liang

Fast and Adaptive Dynamics-on-Graphs to Dynamics-of-Graphs Translation Journal Article

In: Sec. Data Mining and Management , 2023.

Links | BibTeX

3039.

Hoang, Quang Minh

Practical Methods for Automated Algorithm Design in Machine Learning and Computational Biology PhD Thesis

2023.

Links | BibTeX

3038.

An, S.; Channing, G.; Schuman, C.; Taufer, M.

VINARCH: A Visual Analytics Interactive Tool for Neural Network Archaeology Proceedings Article

In: 2023 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops), pp. 50-51, IEEE Computer Society, Los Alamitos, CA, USA, 2023.

Abstract | Links | BibTeX

3037.

Hu, Xuemei; Huang, Lan; Zeng, Jia; Wang, Kangping; Wang, Yan

EGFA-NAS: a neural architecture search method based on explosion gravitation field algorithm Journal Article

In: Complex & Intelligent Systems , 2023.

Links | BibTeX

3036.

Pan, Yang; Jin, Mingwu; Zhang, Shun-Rong; Wing, Simon; Deng, Yue

Neural Network Models for Ionospheric Electron Density Prediction: A Neural Architecture Search Study Journal Article

In: ESS Open Archive, 2023.

Links | BibTeX

3035.

O'Neill, Damien

Evolutionary Computation for the Optimisation of Skip-Connection Structures on Dense Convolutional Neural Networks PhD Thesis

2023.

Links | BibTeX

3034.

(Ed.)

MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention Collection

2023.

Links | BibTeX

3033.

(Ed.)

Automated Knowledge Distillation via Monte Carlo Tree Search Collection

2023.

Links | BibTeX

3032.

Xu, Yang; Ma, Yongjie

Evolutionary neural architecture search combining multi-branch ConvNet and improved transformer Journal Article

In: 2023.

Links | BibTeX

3031.

(Ed.)

An Evaluation of Zero-Cost Proxies - from Neural Architecture Performance Prediction to Model Robustness Collection

2023.

Links | BibTeX

3030.

Rogers, Brendan; Noman, Nasimul; Chalup, Stephan; Moscato, Pablo

A comparative analysis of deep neural network architectures for sentence classification using genetic algorithm Journal Article

In: Evolutionary Intelligence, 2023.

Links | BibTeX

3029.

Tang, Yi-Jun; Yan, Ke; Zhang, Xingyi; Tian, Ye; Liu, Bin

Protein intrinsically disordered region prediction by combining neural architecture search and multi-objective genetic algorithm Journal Article

In: BMC Biology , 2023.

Links | BibTeX

3028.

Afif, Mouna; Ayachi, Riadh; Said, Yahia; Atri, Mohamed

An indoor scene recognition system based on deep learning evolutionary algorithms Journal Article

In: Soft Computing , 2023.

Links | BibTeX

3027.

Zhang, Junfeng; Xie, Cheng; Yu, Beibei; Yang, Rui

Adaptive Hierarchical Knowledge Distillation from GNNs to MLPs Technical Report

2023.

Links | BibTeX

3026.

Cheung, Ming

Learning from the Past: Fast NAS for Tasks and Datasets Journal Article

In: ACM Trans. Multimedia Comput. Commun. Appl., 2023, ISSN: 1551-6857, (Just Accepted).

Abstract | Links | BibTeX

3025.

Chitty-Venkata, Krishna Teja

Hardware-aware design, search, and optimization of deep neural networks Bachelor Thesis

2023.

Links | BibTeX

3024.

Yang, Zhao; Sun, Qingshuang

Energy-Efficient Personalized Federated Search with Graph for Edge Computing Journal Article

In: ACM Trans. Embed. Comput. Syst., vol. 22, no. 5s, 2023, ISSN: 1539-9087.

Abstract | Links | BibTeX

3023.

Mousavi, Hamid; Loni, Mohammad; Alibeigi, Mina; Daneshtalab, Masoud

DASS: Differentiable Architecture Search for Sparse Neural Networks Journal Article

In: ACM Trans. Embed. Comput. Syst., vol. 22, no. 5s, 2023, ISSN: 1539-9087.

Abstract | Links | BibTeX

@article{10.1145/3609385,

title = {DASS: Differentiable Architecture Search for Sparse Neural Networks},

author = {Hamid Mousavi and Mohammad Loni and Mina Alibeigi and Masoud Daneshtalab},

url = {https://doi.org/10.1145/3609385},

doi = {10.1145/3609385},

issn = {1539-9087},

year  = {2023},

date = {2023-09-01},

urldate = {2023-09-01},

journal = {ACM Trans. Embed. Comput. Syst.},

volume = {22},

number = {5s},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

abstract = {The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available computational power. While recent research has made significant strides in developing pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find that the architectures designed for dense networks by differentiable architecture search methods are ineffective when pruning mechanisms are applied to them. The main reason is that the current methods do not support sparse architectures in their search space and use a search objective that is made for dense networks and does not focus on sparsity.This paper proposes a new method to search for sparsity-friendly neural architectures. It is done by adding two new sparse operations to the search space and modifying the search objective. We propose two novel parametric SparseConv and SparseLinear operations in order to expand the search space to include sparse operations. In particular, these operations make a flexible search space due to using sparse parametric versions of linear and convolution operations. The proposed search objective lets us train the architecture based on the sparsity of the search space operations. Quantitative analyses demonstrate that architectures found through DASS outperform those used in the state-of-the-art sparse networks on the CIFAR-10 and ImageNet datasets. In terms of performance and hardware effectiveness, DASS increases the accuracy of the sparse version of MobileNet-v2 from 73.44% to 81.35% (+7.91% improvement) with a 3.87× faster inference time.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

3022.

Gao, Jianliang; Oloulade, Babatounde Moctard; Al-Sabri, Raeed; Chen, Jiamin; Lyu, Tengfei; zhenpeng Wu,

Graph neural architecture prediction Journal Article

In: Knowledge and Information Systems , 2023.

Links | BibTeX