Related papers: Neural Network Architecture Optimization through S…

Optimizing Recurrent Neural Networks Architectures under Time Constraints

Recurrent neural network (RNN)'s architecture is a key factor influencing its performance. We propose algorithms to optimize hidden sizes under running time constraint. We convert the discrete optimization into a subset selection problem.…

Machine Learning · Statistics 2018-02-22 Junqi Jin , Ziang Yan , Kun Fu , Nan Jiang , Changshui Zhang

Resource Constrained Neural Network Architecture Search: Will a Submodularity Assumption Help?

The design of neural network architectures is frequently either based on human expertise using trial/error and empirical feedback or tackled via large scale reinforcement learning strategies performed over distinct discrete architecture…

Computer Vision and Pattern Recognition · Computer Science 2019-09-06 Yunyang Xiong , Ronak Mehta , Vikas Singh

Deep Genetic Network

Optimizing a neural network's performance is a tedious and time taking process, this iterative process does not have any defined solution which can work for all the problems. Optimization can be roughly categorized into - Architecture and…

Machine Learning · Computer Science 2019-12-16 Siddhartha Dhar Choudhury , Shashank Pandey , Kunal Mehrotra

Towards Searching Efficient and Accurate Neural Network Architectures in Binary Classification Problems

In recent years, deep neural networks have had great success in machine learning and pattern recognition. Architecture size for a neural network contributes significantly to the success of any neural network. In this study, we optimize the…

Machine Learning · Computer Science 2021-01-19 Yigit Alparslan , Ethan Jacob Moyer , Isamu Mclean Isozaki , Daniel Schwartz , Adam Dunlop , Shesh Dave , Edward Kim

Computational issues in Optimization for Deep networks

The paper aims to investigate relevant computational issues of deep neural network architectures with an eye to the interaction between the optimization algorithm and the classification performance. In particular, we aim to analyze the…

Optimization and Control · Mathematics 2024-05-06 Corrado Coppola , Lorenzo Papa , Marco Boresta , Irene Amerini , Laura Palagi

Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

We propose to focus on the problem of discovering neural network architectures efficient in terms of both prediction quality and cost. For instance, our approach is able to solve the following tasks: learn a neural network able to predict…

Machine Learning · Computer Science 2018-05-24 Tom Veniat , Ludovic Denoyer

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

Efficient Deep Neural Networks

The success of deep neural networks (DNNs) is attributable to three factors: increased compute capacity, more complex models, and more data. These factors, however, are not always present, especially for edge applications such as autonomous…

Computer Vision and Pattern Recognition · Computer Science 2019-08-26 Bichen Wu

Design Automation for Efficient Deep Learning Computing

Efficient deep learning computing requires algorithm and hardware co-design to enable specialization: we usually need to change the algorithm to reduce memory footprint and improve energy efficiency. However, the extra degree of freedom…

Machine Learning · Computer Science 2019-04-25 Song Han , Han Cai , Ligeng Zhu , Ji Lin , Kuan Wang , Zhijian Liu , Yujun Lin

A scalable constructive algorithm for the optimization of neural network architectures

We propose a new scalable method to optimize the architecture of an artificial neural network. The proposed algorithm, called Greedy Search for Neural Network Architecture, aims to determine a neural network with minimal number of layers…

Machine Learning · Computer Science 2021-04-30 Massimiliano Lupo Pasini , Junqi Yin , Ying Wai Li , Markus Eisenbach

Neural Architecture Search by Estimation of Network Structure Distributions

The influence of deep learning is continuously expanding across different domains, and its new applications are ubiquitous. The question of neural network design thus increases in importance, as traditional empirical approaches are reaching…

Neural and Evolutionary Computing · Computer Science 2021-01-29 Anton Muravev , Jenni Raitoharju , Moncef Gabbouj

Neural Architecture Design and Robustness: A Dataset

Deep learning models have proven to be successful in a wide range of machine learning tasks. Yet, they are often highly sensitive to perturbations on the input data which can lead to incorrect decisions with high confidence, hampering their…

Machine Learning · Computer Science 2023-06-13 Steffen Jung , Jovita Lukasik , Margret Keuper

Accelerating Deep Learning with Fixed Time Budget

The success of modern deep learning is attributed to two key elements: huge amounts of training data and large model sizes. Where a vast amount of data allows the model to learn more features, the large model architecture boosts the…

Machine Learning · Computer Science 2024-10-08 Muhammad Asif Khan , Ridha Hamila , Hamid Menouar

Growth strategies for arbitrary DAG neural architectures

Deep learning has shown impressive results obtained at the cost of training huge neural networks. However, the larger the architecture, the higher the computational, financial, and environmental costs during training and inference. We aim…

Machine Learning · Computer Science 2025-02-17 Stella Douka , Manon Verbockhaven , Théo Rudkiewicz , Stéphane Rivaud , François P. Landes , Sylvain Chevallier , Guillaume Charpiat

Scalable Greedy Feature Selection via Weak Submodularity

Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for…

Machine Learning · Statistics 2017-03-09 Rajiv Khanna , Ethan Elenberg , Alexandros G. Dimakis , Sahand Negahban , Joydeep Ghosh

Nonmonontone submodular maximization under routing constraints

In machine learning and big data, the optimization objectives based on set-cover, entropy, diversity, influence, feature selection, etc. are commonly modeled as submodular functions. Submodular (function) maximization is generally NP-hard,…

Data Structures and Algorithms · Computer Science 2022-12-13 Haotian Zhang , Rao Li , Zewei Wu , Guodong Sun

A Survey on Neural Architecture Search

The growing interest in both the automation of machine learning and deep learning has inevitably led to the development of a wide variety of automated methods for neural architecture search. The choice of the network architecture has proven…

Machine Learning · Computer Science 2019-06-19 Martin Wistuba , Ambrish Rawat , Tejaswini Pedapati

GradNets: Dynamic Interpolation Between Neural Architectures

In machine learning, there is a fundamental trade-off between ease of optimization and expressive power. Neural Networks, in particular, have enormous expressive power and yet are notoriously challenging to train. The nature of that…

Machine Learning · Computer Science 2015-11-24 Diogo Almeida , Nate Sauder

A Review of Meta-Reinforcement Learning for Deep Neural Networks Architecture Search

Deep Neural networks are efficient and flexible models that perform well for a variety of tasks such as image, speech recognition and natural language understanding. In particular, convolutional neural networks (CNN) generate a keen…

Machine Learning · Computer Science 2018-12-20 Yesmina Jaafra , Jean Luc Laurent , Aline Deruyver , Mohamed Saber Naceur