English
Related papers

Related papers: Automated Deep Learning Optimization via DSL-Based…

200 papers

We present DAPIP, a Programming-By-Example system that learns to program with APIs to perform data transformation tasks. We design a domain-specific language (DSL) that allows for arbitrary concatenations of API outputs and constant…

Artificial Intelligence · Computer Science 2017-04-17 Surya Bhupatiraju , Rishabh Singh , Abdel-rahman Mohamed , Pushmeet Kohli

Modern deep learning (DL) architectures are trained using variants of the SGD algorithm that is run with a $\textit{manually}$ defined learning rate schedule, i.e., the learning rate is dropped at the pre-defined epochs, typically when the…

Machine Learning · Computer Science 2021-12-14 Yunfei Teng , Jing Wang , Anna Choromanska

Accelerating deep model training and inference is crucial in practice. Existing deep learning frameworks usually concentrate on optimizing training speed and pay fewer attentions to inference-specific optimizations. Actually, model…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-12 Yongchao Liu , Yue Jin , Yong Chen , Teng Teng , Hang Ou , Rui Zhao , Yao Zhang

Conventional continual pretraining (CPT) for large language model (LLM) domain adaptation often suffers from catastrophic forgetting and limited domain capacity. Existing strategies adopt layer expansion, introducing additional trainable…

Machine Learning · Computer Science 2025-10-14 Jinyang Zhang , Yue Fang , Hongxin Ding , Weibin Liao , Muyang Ye , Xu Chu , Junfeng Zhao , Yasha Wang

Recent developments have underscored the critical role of \textit{differential privacy} (DP) in safeguarding individual data for training machine learning models. However, integrating DP oftentimes incurs significant model performance…

Machine Learning · Computer Science 2024-03-06 Zihao Wang , Rui Zhu , Dongruo Zhou , Zhikun Zhang , John Mitchell , Haixu Tang , XiaoFeng Wang

Knowledge Distillation (KD) has been used in image classification for model compression. However, rare studies apply this technology on single-stage object detectors. Focal loss shows that the accumulated errors of easily-classified samples…

Computer Vision and Pattern Recognition · Computer Science 2019-01-15 Shitao Tang , Litong Feng , Wenqi Shao , Zhanghui Kuang , Wei Zhang , Yimin Chen

Large-scale pre-trained language models have achieved impressive results on a wide range of downstream tasks recently. However, fine-tuning an extremely large-scale pre-trained language model on limited target datasets is often plagued by…

Computation and Language · Computer Science 2022-11-04 Haojie Zhang , Ge Li , Jia Li , Zhongjin Zhang , Yuqi Zhu , Zhi Jin

Deep learning (DL) has brought about remarkable breakthrough in processing images, video and speech due to its efficacy in extracting highly abstract representation and learning very complex functions. However, there is seldom operating…

Machine Learning · Computer Science 2021-01-01 Shen Chen , Mingwei Zhang , Jiamin Cui , Wei Yao

In model-driven engineering, developing a textual domain-specific language (DSL) involves constructing a meta-model, which defines an underlying abstract syntax, and a grammar, which defines the concrete syntax for the DSL. Language…

Software Engineering · Computer Science 2024-02-01 Weixing Zhang , Jörg Holtmann , Daniel Strüber , Regina Hebig , Jan-Philipp Steghöfer

We propose the Adversarial DEep Learning Transpiler (ADELT), a novel approach to source-to-source transpilation between deep learning frameworks. ADELT uniquely decouples code skeleton transpilation and API keyword mapping. For code…

Computation and Language · Computer Science 2024-05-09 Linyuan Gong , Jiayi Wang , Alvin Cheung

Recent advancements in deep learning (DL) have posed a significant challenge for automatic speech recognition (ASR). ASR relies on extensive training datasets, including confidential ones, and demands substantial computational and storage…

Sound · Computer Science 2024-04-19 Hamza Kheddar , Mustapha Hemis , Yassine Himeur

Deep learning hyper-parameter optimization is a tough task. Finding an appropriate network configuration is a key to success, however most of the times this labor is roughly done. In this work we introduce a novel library to tackle this…

Machine Learning · Computer Science 2018-07-11 Andrés Camero , Jamal Toutouh , Enrique Alba

Ultra high resolution (UHR) images are almost always downsampled to fit small displays of mobile end devices and upsampled to its original resolution when exhibited on very high-resolution displays. This observation motivates us on jointly…

Image and Video Processing · Electrical Eng. & Systems 2023-02-14 Xi Zhang , Xiaolin Wu

Distributed deep learning (DDL) is a promising research area, which aims to increase the efficiency of training deep learning tasks with large size of datasets and models. As the computation capability of DDL nodes continues to increase,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-11 Zixuan Chen , Lei Shi , Xuandong Liu , Jiahui Li , Sen Liu , Yang Xu

The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize…

In this paper, we propose an analysis mechanism based structured Analysis Discriminative Dictionary Learning (ADDL) framework. ADDL seamlessly integrates the analysis discriminative dictionary learning, analysis representation and analysis…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Zhao Zhang , Weiming Jiang , Jie Qin , Li Zhang , Fanzhang Li , Min Zhang , Shuicheng Yan

Compressed Deep Learning (DL) models are essential for deployment in resource-constrained environments. But their performance often lags behind their large-scale counterparts. To bridge this gap, we propose Alignment Adapter (AlAd): a…

Machine Learning · Computer Science 2026-02-17 Rohit Raj Rai , Abhishek Dhaka , Amit Awekar

Speculative Decoding (SD) is a popular lossless technique for accelerating the inference of Large Language Models (LLMs). We show that the decoding speed of SD frameworks with static draft structures can be significantly improved by…

Artificial Intelligence · Computer Science 2024-12-30 Situo Zhang , Hankun Wang , Da Ma , Zichen Zhu , Lu Chen , Kunyao Lan , Kai Yu

The rise of pre-trained language models has yielded substantial progress in the vast majority of Natural Language Processing (NLP) tasks. However, a generic approach towards the pre-training procedure can naturally be sub-optimal in some…

Computation and Language · Computer Science 2021-09-03 Entony Lekhtman , Yftah Ziser , Roi Reichart

Context: Automated software defect prediction (SDP) methods are increasingly applied, often with the use of machine learning (ML) techniques. Yet, the existing ML-based approaches require manually extracted features, which are cumbersome,…

Software Engineering · Computer Science 2022-10-06 Görkem Giray , Kwabena Ebo Bennin , Ömer Köksal , Önder Babur , Bedir Tekinerdogan
‹ Prev 1 2 3 10 Next ›