Related papers: ADELT: Transpilation Between Deep Learning Framewo…

Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation

Multi-source entity linkage focuses on integrating knowledge from multiple sources by linking the records that represent the same real world entity. This is critical in high-impact applications such as data cleaning and user stitching. The…

Machine Learning · Computer Science 2021-10-28 Di Jin , Bunyamin Sisman , Hao Wei , Xin Luna Dong , Danai Koutra

Adversarial Knowledge Transfer from Unlabeled Data

While machine learning approaches to visual recognition offer great promise, most of the existing methods rely heavily on the availability of large quantities of labeled training data. However, in the vast majority of real-world settings,…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Akash Gupta , Rameswar Panda , Sujoy Paul , Jianming Zhang , Amit K. Roy-Chowdhury

Automated Deep Learning Optimization via DSL-Based Source Code Transformation

As deep learning models become increasingly bigger and more complex, it is critical to improve model training and inference efficiency. Though a variety of highly optimized libraries and packages (known as DL kernels) have been developed,…

Software Engineering · Computer Science 2024-08-22 Ruixin Wang , Minghai Lu , Cody Hao Yu , Yi-Hsiang Lai , Tianyi Zhang

Adversarial Robustness through Dynamic Ensemble Learning

Adversarial attacks pose a significant threat to the reliability of pre-trained language models (PLMs) such as GPT, BERT, RoBERTa, and T5. This paper presents Adversarial Robustness through Dynamic Ensemble Learning (ARDEL), a novel scheme…

Cryptography and Security · Computer Science 2025-05-14 Hetvi Waghela , Jaydip Sen , Sneha Rakshit

Adaptive Meta-Domain Transfer Learning (AMDTL): A Novel Approach for Knowledge Transfer in AI

This paper presents Adaptive Meta-Domain Transfer Learning (AMDTL), a novel methodology that combines principles of meta-learning with domain-specific adaptations to enhance the transferability of artificial intelligence models across…

Machine Learning · Computer Science 2024-09-12 Michele Laurelli

Alignment Adapter to Improve the Performance of Compressed Deep Learning Models

Compressed Deep Learning (DL) models are essential for deployment in resource-constrained environments. But their performance often lags behind their large-scale counterparts. To bridge this gap, we propose Alignment Adapter (AlAd): a…

Machine Learning · Computer Science 2026-02-17 Rohit Raj Rai , Abhishek Dhaka , Amit Awekar

Factor Augmented Supervised Learning with Text Embeddings

Large language models (LLMs) generate text embeddings from text data, producing vector representations that capture the semantic meaning and contextual relationships of words. However, the high dimensionality of these embeddings often…

Computation and Language · Computer Science 2025-08-12 Zhanye Luo , Yuefeng Han , Xiufan Yu

TEDL: A Text Encryption Method Based on Deep Learning

Recent years have seen an increasing emphasis on information security, and various encryption methods have been proposed. However, for symmetric encryption methods, the well-known encryption techniques still rely on the key space to…

Cryptography and Security · Computer Science 2020-03-12 Xiang Li , Peng Wang

ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning

Prompt Tuning (PT) enables the adaptation of Pre-trained Large Language Models (PLMs) to downstream tasks by optimizing a small amount of soft virtual tokens, which are prepended to the input token embeddings. Recently, Decomposed Prompt…

Computation and Language · Computer Science 2025-12-23 Pengwei Tang , Xiaolin Hu , Yong Liu

AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks

The widespread adoption of Large Language Models (LLMs) for code generation, exemplified by GitHub Copilot\footnote{A coding extension powered by a Code-LLM to assist in code completion tasks} surpassing a million users, highlights the…

Software Engineering · Computer Science 2025-05-13 Ilyas Oulkadda , Julien Perez

DeepMEL: A Multi-Agent Collaboration Framework for Multimodal Entity Linking

Multimodal Entity Linking (MEL) aims to associate textual and visual mentions with entities in a multimodal knowledge graph. Despite its importance, current methods face challenges such as incomplete contextual information, coarse…

Computation and Language · Computer Science 2025-08-25 Fang Wang , Tianwei Yan , Zonghao Yang , Minghao Hu , Jun Zhang , Zhunchen Luo , Xiaoying Bai

Multimodal Transfer Deep Learning with Applications in Audio-Visual Recognition

We propose a transfer deep learning (TDL) framework that can transfer the knowledge obtained from a single-modal neural network to a network with a different modality. Specifically, we show that we can leverage speech data to fine-tune the…

Neural and Evolutionary Computing · Computer Science 2016-02-19 Seungwhan Moon , Suyoun Kim , Haohan Wang

CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel Corpora

Transpilation, or code translation, aims to convert source code from one programming language (PL) to another. It is beneficial for many downstream applications, from modernizing large legacy codebases to augmenting data for low-resource…

Software Engineering · Computer Science 2026-04-21 Shangyu Li , Juyong Jiang , Meibo Ren , Sizhe Zhong , Huiri Tan , Yunhao Gou , Xu Han , Chun Yong Chong , Yun Peng , Jiasi Shen

DEPT: Decoupled Embeddings for Pre-training Language Models

Language Model pre-training uses broad data mixtures to enhance performance across domains and languages. However, training on such heterogeneous text corpora requires extensive and expensive efforts. Since these data sources vary…

Machine Learning · Computer Science 2025-04-08 Alex Iacob , Lorenzo Sani , Meghdad Kurmanji , William F. Shen , Xinchi Qiu , Dongqi Cai , Yan Gao , Nicholas D. Lane

Text embedding models can be great data engineers

Data engineering pipelines are essential - albeit costly - components of predictive analytics frameworks requiring significant engineering time and domain expertise for carrying out tasks such as data ingestion, preprocessing, feature…

Machine Learning · Computer Science 2025-05-22 Iman Kazemian , Paritosh Ramanan , Murat Yildirim

AccelAT: A Framework for Accelerating the Adversarial Training of Deep Neural Networks through Accuracy Gradient

Adversarial training is exploited to develop a robust Deep Neural Network (DNN) model against the malicious altered data. These attacks may have catastrophic effects on DNN models but are indistinguishable for a human being. For example, an…

Machine Learning · Computer Science 2022-10-14 Farzad Nikfam , Alberto Marchisio , Maurizio Martina , Muhammad Shafique

AdapLeR: Speeding up Inference by Adaptive Length Reduction

Pre-trained language models have shown stellar performance in various downstream tasks. But, this usually comes at the cost of high latency and computation, hindering their usage in resource-limited settings. In this work, we propose a…

Computation and Language · Computer Science 2022-03-18 Ali Modarressi , Hosein Mohebbi , Mohammad Taher Pilehvar

Towards Improving Adversarial Training of NLP Models

Adversarial training, a method for learning robust deep neural networks, constructs adversarial examples during training. However, recent methods for generating NLP adversarial examples involve combinatorial search and expensive sentence…

Computation and Language · Computer Science 2021-09-14 Jin Yong Yoo , Yanjun Qi

Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection

Machine learning classification systems are susceptible to poor performance when trained with incorrect ground truth labels, even when data is well-curated by expert annotators. As machine learning becomes more widespread, it is…

Machine Learning · Computer Science 2026-01-16 Zan Chaudhry , Noam H. Rotenberg , Brian Caffo , Craig K. Jones , Haris I. Sair

Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models

Vision-language pretrained models (VLPs) such as CLIP have achieved remarkable success, but are also highly vulnerable to backdoor attacks. Given a model fine-tuned by an untrusted third party, determining whether the model has been…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Zhongqi Wang , Jie Zhang , Shiguang Shan , Xilin Chen