Related papers: Automated Deep Learning Optimization via DSL-Based…

Deep API Programmer: Learning to Program with APIs

We present DAPIP, a Programming-By-Example system that learns to program with APIs to perform data transformation tasks. We design a domain-specific language (DSL) that allows for arbitrary concatenations of API outputs and constant…

Artificial Intelligence · Computer Science 2017-04-17 Surya Bhupatiraju , Rishabh Singh , Abdel-rahman Mohamed , Pushmeet Kohli

AutoDrop: Training Deep Learning Models with Automatic Learning Rate Drop

Modern deep learning (DL) architectures are trained using variants of the SGD algorithm that is run with a $\textit{manually}$ defined learning rate schedule, i.e., the learning rate is dropped at the pre-defined epochs, typically when the…

Machine Learning · Computer Science 2021-12-14 Yunfei Teng , Jing Wang , Anna Choromanska

Woodpecker-DL: Accelerating Deep Neural Networks via Hardware-Aware Multifaceted Optimizations

Accelerating deep model training and inference is crucial in practice. Existing deep learning frameworks usually concentrate on optimizing training speed and pay fewer attentions to inference-specific optimizations. Actually, model…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-12 Yongchao Liu , Yue Jin , Yong Chen , Teng Teng , Hang Ou , Rui Zhao , Yao Zhang

ADEPT: Continual Pretraining via Adaptive Expansion and Dynamic Decoupled Tuning

Conventional continual pretraining (CPT) for large language model (LLM) domain adaptation often suffers from catastrophic forgetting and limited domain capacity. Existing strategies adopt layer expansion, introducing additional trainable…

Machine Learning · Computer Science 2025-10-14 Jinyang Zhang , Yue Fang , Hongxin Ding , Weibin Liao , Muyang Ye , Xu Chu , Junfeng Zhao , Yasha Wang

DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training

Recent developments have underscored the critical role of \textit{differential privacy} (DP) in safeguarding individual data for training machine learning models. However, integrating DP oftentimes incurs significant model performance…

Machine Learning · Computer Science 2024-03-06 Zihao Wang , Rui Zhu , Dongruo Zhou , Zhikun Zhang , John Mitchell , Haixu Tang , XiaoFeng Wang

Learning Efficient Detector with Semi-supervised Adaptive Distillation

Knowledge Distillation (KD) has been used in image classification for model compression. However, rare studies apply this technology on single-stage object detectors. Focal loss shows that the accumulated errors of easily-classified samples…

Computer Vision and Pattern Recognition · Computer Science 2019-01-15 Shitao Tang , Litong Feng , Wenqi Shao , Zhanghui Kuang , Wei Zhang , Yimin Chen

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Large-scale pre-trained language models have achieved impressive results on a wide range of downstream tasks recently. However, fine-tuning an extremely large-scale pre-trained language model on limited target datasets is often plagued by…

Computation and Language · Computer Science 2022-11-04 Haojie Zhang , Ge Li , Jia Li , Zhongjin Zhang , Yuqi Zhu , Zhi Jin

Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective

Deep learning (DL) has brought about remarkable breakthrough in processing images, video and speech due to its efficacy in extracting highly abstract representation and learning very complex functions. However, there is seldom operating…

Machine Learning · Computer Science 2021-01-01 Shen Chen , Mingwei Zhang , Jiamin Cui , Wei Yao

Supporting Meta-model-based Language Evolution and Rapid Prototyping with Automated Grammar Optimization

In model-driven engineering, developing a textual domain-specific language (DSL) involves constructing a meta-model, which defines an underlying abstract syntax, and a grammar, which defines the concrete syntax for the DSL. Language…

Software Engineering · Computer Science 2024-02-01 Weixing Zhang , Jörg Holtmann , Daniel Strüber , Regina Hebig , Jan-Philipp Steghöfer

ADELT: Transpilation Between Deep Learning Frameworks

We propose the Adversarial DEep Learning Transpiler (ADELT), a novel approach to source-to-source transpilation between deep learning frameworks. ADELT uniquely decouples code skeleton transpilation and API keyword mapping. For code…

Computation and Language · Computer Science 2024-05-09 Linyuan Gong , Jiayi Wang , Alvin Cheung

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

Recent advancements in deep learning (DL) have posed a significant challenge for automatic speech recognition (ASR). ASR relies on extensive training datasets, including confidential ones, and demands substantial computational and storage…

Sound · Computer Science 2024-04-19 Hamza Kheddar , Mustapha Hemis , Yassine Himeur

DLOPT: Deep Learning Optimization Library

Deep learning hyper-parameter optimization is a tough task. Finding an appropriate network configuration is a key to success, however most of the times this labor is roughly done. In this work we introduce a novel library to tackle this…

Machine Learning · Computer Science 2018-07-11 Andrés Camero , Jamal Toutouh , Enrique Alba

Dual-layer Image Compression via Adaptive Downsampling and Spatially Varying Upconversion

Ultra high resolution (UHR) images are almost always downsampled to fit small displays of mobile end devices and upsampled to its original resolution when exhibited on very high-resolution displays. This observation motivates us on jointly…

Image and Video Processing · Electrical Eng. & Systems 2023-02-14 Xi Zhang , Xiaolin Wu

OSP: Boosting Distributed Model Training with 2-stage Synchronization

Distributed deep learning (DDL) is a promising research area, which aims to increase the efficiency of training deep learning tasks with large size of datasets and models. As the computation capability of DDL nodes continues to increase,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-11 Zixuan Chen , Lei Shi , Xuandong Liu , Jiahui Li , Sen Liu , Yang Xu

The ANTAREX Domain Specific Language for High Performance Computing

The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-21 Cristina Silvano , Giovanni Agosta , Andrea Bartolini , Andrea R. Beccari , Luca Benini , Loïc Besnard , João Bispo , Radim Cmar , João M. P. Cardoso , Carlo Cavazzoni , Daniele Cesarini , Stefano Cherubin , Federico Ficarelli , Davide Gadioli , Martin Golasowski , Antonio Libri , Jan Martinovič , Gianluca Palermo , Pedro Pinto , Erven Rohou , Kateřina Slaninová , Emanuele Vitali

Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier

In this paper, we propose an analysis mechanism based structured Analysis Discriminative Dictionary Learning (ADDL) framework. ADDL seamlessly integrates the analysis discriminative dictionary learning, analysis representation and analysis…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Zhao Zhang , Weiming Jiang , Jie Qin , Li Zhang , Fanzhang Li , Min Zhang , Shuicheng Yan

Alignment Adapter to Improve the Performance of Compressed Deep Learning Models

Compressed Deep Learning (DL) models are essential for deployment in resource-constrained environments. But their performance often lags behind their large-scale counterparts. To bridge this gap, we propose Alignment Adapter (AlAd): a…

Machine Learning · Computer Science 2026-02-17 Rohit Raj Rai , Abhishek Dhaka , Amit Awekar

AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures

Speculative Decoding (SD) is a popular lossless technique for accelerating the inference of Large Language Models (LLMs). We show that the decoding speed of SD frameworks with static draft structures can be significantly improved by…

Artificial Intelligence · Computer Science 2024-12-30 Situo Zhang , Hankun Wang , Da Ma , Zichen Zhu , Lu Chen , Kunyao Lan , Kai Yu

DILBERT: Customized Pre-Training for Domain Adaptation withCategory Shift, with an Application to Aspect Extraction

The rise of pre-trained language models has yielded substantial progress in the vast majority of Natural Language Processing (NLP) tasks. However, a generic approach towards the pre-training procedure can naturally be sub-optimal in some…

Computation and Language · Computer Science 2021-09-03 Entony Lekhtman , Yftah Ziser , Roi Reichart

On the Use of Deep Learning in Software Defect Prediction

Context: Automated software defect prediction (SDP) methods are increasingly applied, often with the use of machine learning (ML) techniques. Yet, the existing ML-based approaches require manually extracted features, which are cumbersome,…

Software Engineering · Computer Science 2022-10-06 Görkem Giray , Kwabena Ebo Bennin , Ömer Köksal , Önder Babur , Bedir Tekinerdogan