Related papers: Variable Assignment Invariant Neural Networks for …

Symbolic AI for XAI: Evaluating LFIT Inductive Programming for Fair and Explainable Automatic Recruitment

Machine learning methods are growing in relevance for biometrics and personal information processing in domains such as forensics, e-health, recruitment, and e-learning. In these domains, white-box (human-readable) explanations of systems…

Artificial Intelligence · Computer Science 2020-12-02 Alfonso Ortega , Julian Fierrez , Aythami Morales , Zilong Wang , Tony Ribeiro

Variational Neural Machine Translation with Normalizing Flows

Variational Neural Machine Translation (VNMT) is an attractive framework for modeling the generation of target translations, conditioned not only on the source sentence but also on some latent random variables. The latent variable modeling…

Computation and Language · Computer Science 2020-05-29 Hendra Setiawan , Matthias Sperber , Udhay Nallasamy , Matthias Paulik

Variational Transfer Learning using Cross-Domain Latent Modulation

To successfully apply trained neural network models to new domains, powerful transfer learning solutions are essential. We propose to introduce a novel cross-domain latent modulation mechanism to a variational autoencoder framework so as to…

Machine Learning · Computer Science 2024-02-01 Jinyong Hou , Jeremiah D. Deng , Stephen Cranefield , Xuejie Din

Deep Natural Language Feature Learning for Interpretable Prediction

We propose a general method to break down a main complex task into a set of intermediary easier sub-tasks, which are formulated in natural language as binary questions related to the final target task. Our method allows for representing…

Computation and Language · Computer Science 2024-02-02 Felipe Urrutia , Cristian Buc , Valentin Barriere

Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming

Recently deep reinforcement learning has achieved tremendous success in wide ranges of applications. However, it notoriously lacks data-efficiency and interpretability. Data-efficiency is important as interacting with the environment is…

Machine Learning · Computer Science 2021-06-23 Duo Xu , Faramarz Fekri

Universal Language Model Fine-tuning for Text Classification

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective…

Computation and Language · Computer Science 2018-05-24 Jeremy Howard , Sebastian Ruder

The (In)Effectiveness of Intermediate Task Training For Domain Adaptation and Cross-Lingual Transfer Learning

Transfer learning from large language models (LLMs) has emerged as a powerful technique to enable knowledge-based fine-tuning for a number of tasks, adaptation of models for different domains and even languages. However, it remains an open…

Computation and Language · Computer Science 2022-11-08 Sovesh Mohapatra , Somesh Mohapatra

Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning

Visual reinforcement learning has proven effective in solving control tasks with high-dimensional observations. However, extracting reliable and generalizable representations from vision-based observations remains a central challenge.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-11 Xiaobo Hu , Youfang Lin , Yue Liu , Jinwen Wang , Shuo Wang , Hehe Fan , Kai Lv

Neural Variational Inference For Estimating Uncertainty in Knowledge Graph Embeddings

Recent advances in Neural Variational Inference allowed for a renaissance in latent variable models in a variety of domains involving high-dimensional data. While traditional variational methods derive an analytical approximation for the…

Machine Learning · Computer Science 2019-08-20 Alexander I. Cowen-Rivers , Pasquale Minervini , Tim Rocktaschel , Matko Bosnjak , Sebastian Riedel , Jun Wang

Learning to Transform Dynamically for Better Adversarial Transferability

Adversarial examples, crafted by adding perturbations imperceptible to humans, can deceive neural networks. Recent studies identify the adversarial transferability across various models, \textit{i.e.}, the cross-model attack ability of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-25 Rongyi Zhu , Zeliang Zhang , Susan Liang , Zhuo Liu , Chenliang Xu

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL. This work studies the former. Specifically, the Perception and Decision-making Interleaving Transformer (PDiT) network is…

Machine Learning · Computer Science 2023-12-27 Hangyu Mao , Rui Zhao , Ziyue Li , Zhiwei Xu , Hao Chen , Yiqun Chen , Bin Zhang , Zhen Xiao , Junge Zhang , Jiangjin Yin

Neuro-Symbolic Inductive Logic Programming with Logical Neural Networks

Recent work on neuro-symbolic inductive logic programming has led to promising approaches that can learn explanatory rules from noisy, real-world data. While some proposals approximate logical operators with differentiable operators from…

Artificial Intelligence · Computer Science 2021-12-08 Prithviraj Sen , Breno W. S. R. de Carvalho , Ryan Riegel , Alexander Gray

On the equivalence between functionally affine LPV state-space representations and LFT models

We propose a transformation algorithm for a class of Linear Parameter-Varying (LPV) systems with functional affine dependence on parameters, where the system matrices depend affinely on nonlinear functions of the scheduling varable, into…

Optimization and Control · Mathematics 2025-06-27 Mihály Petreczky , Ziad Alkhoury , Guillaume Mercère

Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning

Designing agents that acquire knowledge autonomously and use it to solve new tasks efficiently is an important challenge in reinforcement learning. Knowledge acquired during an unsupervised pre-training phase is often transferred by…

Machine Learning · Computer Science 2021-06-09 Víctor Campos , Pablo Sprechmann , Steven Hansen , Andre Barreto , Steven Kapturowski , Alex Vitvitskyi , Adrià Puigdomènech Badia , Charles Blundell

Learning Invariants through Soft Unification

Human reasoning involves recognising common underlying principles across many examples. The by-products of such reasoning are invariants that capture patterns such as "if someone went somewhere then they are there", expressed using…

Machine Learning · Computer Science 2020-10-27 Nuri Cingillioglu , Alessandra Russo

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability

There exist many problem domains where the interpretability of neural network models is essential for deployment. Here we introduce a recurrent architecture composed of input-switched affine transformations - in other words an RNN without…

Artificial Intelligence · Computer Science 2017-06-14 Jakob N. Foerster , Justin Gilmer , Jan Chorowski , Jascha Sohl-Dickstein , David Sussillo

Learning Optimal Features via Partial Invariance

Learning models that are robust to distribution shifts is a key concern in the context of their real-life applicability. Invariant Risk Minimization (IRM) is a popular framework that aims to learn robust models from multiple environments.…

Machine Learning · Computer Science 2023-04-04 Moulik Choraria , Ibtihal Ferwana , Ankur Mani , Lav R. Varshney

TLR: Transfer Latent Representation for Unsupervised Domain Adaptation

Domain adaptation refers to the process of learning prediction models in a target domain by making use of data from a source domain. Many classic methods solve the domain adaptation problem by establishing a common latent space, which may…

Machine Learning · Computer Science 2018-08-21 Pan Xiao , Bo Du , Jia Wu , Lefei Zhang , Ruimin Hu , Xuelong Li

DART-ing Through the Drift: Dynamic Tracing of Knowledge Neurons for Adaptive Inference-Time Pruning

Large Language Models (LLMs) exhibit substantial parameter redundancy, particularly in Feed-Forward Networks (FFNs). Existing pruning methods suffer from two primary limitations. First, reliance on dataset-specific calibration introduces…

Computation and Language · Computer Science 2026-02-02 Abhishek Tyagi , Yunuo Cen , Shrey Dhorajiya , Bharadwaj Veeravalli , Xuanyao Fong

Weight Learning in a Probabilistic Extension of Answer Set Programs

LPMLN is a probabilistic extension of answer set programs with the weight scheme derived from that of Markov Logic. Previous work has shown how inference in LPMLN can be achieved. In this paper, we present the concept of weight learning in…

Artificial Intelligence · Computer Science 2018-10-10 Joohyung Lee , Yi Wang