English
Related papers

Related papers: Knowledge-Driven Machine Learning: Concept, Model …

200 papers

Machine learning enables the extraction of useful information from large, diverse datasets. However, despite many successful applications, machine learning continues to suffer from performance and transparency issues. These challenges can…

Machine Learning · Computer Science 2025-07-08 V. C. Storey , J. Parsons , A. Castellanos , M. Tremblay , R. Lukyanenko , W. Maass , A. Castillo

Conventional machine learning (ML) relies heavily on manual design from machine learning experts to decide learning tasks, data, models, optimization algorithms, and evaluation metrics, which is labor-intensive, time-consuming, and cannot…

Machine Learning · Computer Science 2022-01-11 Wenwu Zhu , Xin Wang , Pengtao Xie

Deep Neural Networks (DNNs) have achieved notable performance in the fields of computer vision and natural language processing with various applications in both academia and industry. However, with recent advancements in DNNs and…

Machine Learning (ML) models are very effective in many learning tasks, due to the capability to extract meaningful information from large data sets. Nevertheless, there are learning problems that cannot be easily solved relying on pure…

Machine Learning · Computer Science 2021-01-29 Andrea Borghesi , Federico Baldo , Michele Lombardi , Michela Milano

Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large. In this paper…

Computation and Language · Computer Science 2016-09-23 Yoon Kim , Alexander M. Rush

In recent years, deep neural networks have been successful in both industry and academia, especially for computer vision tasks. The great success of deep learning is mainly due to its scalability to encode large-scale data and to maneuver…

Machine Learning · Computer Science 2021-05-21 Jianping Gou , Baosheng Yu , Stephen John Maybank , Dacheng Tao

Pre-trained language models (PLMs) have emerged as powerful tools for code understanding. However, deploying these PLMs in large-scale applications faces practical challenges due to their computational intensity and inference latency.…

Software Engineering · Computer Science 2025-08-22 Ruiqi Wang , Zezhou Yang , Cuiyun Gao , Xin Xia , Qing Liao

Collaborative learning has emerged as a key paradigm in large-scale intelligent systems, enabling distributed agents to cooperatively train their models while addressing their privacy concerns. Central to this paradigm is knowledge…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-24 Pengchao Han , Xi Huang , Yi Fang , Guojun Han

This paper presents an overview of scientific modeling and discusses the complementary strengths and weaknesses of ML methods for scientific modeling in comparison to process-based models. It also provides an introduction to the current…

Machine Learning · Computer Science 2024-05-03 Anuj Karpatne , Xiaowei Jia , Vipin Kumar

Informed Machine Learning has emerged as a viable generalization of Machine Learning (ML) by building a unified conceptual and algorithmic setting for constructing models on a unified basis of knowledge and data. Physics-informed ML…

Machine Learning · Computer Science 2026-04-02 Chuyi Dai , Witold Pedrycz , Suping Xu , Ding Liu , Xianmin Wang

Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive…

Machine Learning · Computer Science 2024-06-14 Ruibing Jin , Qing Xu , Min Wu , Yuecong Xu , Dan Li , Xiaoli Li , Zhenghua Chen

Continual learning refers to a dynamical framework in which a model receives a stream of non-stationary data over time and must adapt to new data while preserving previously acquired knowledge. Unluckily, neural networks fail to meet these…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-24 Umberto Cappellazzo , Daniele Falavigna , Alessio Brutti

Knowledge Distillation (KD) as a research area has gained a lot of traction in recent years as a compression tool to address challenges related to ever-larger models in NLP. Remarkably, Machine Translation (MT) offers a much more nuanced…

Computation and Language · Computer Science 2026-02-19 Ona de Gibert , Joseph Attieh , Timothee Mickus , Yves Scherrer , Jörg Tiedemann

Knowledge distillation is a popular machine learning technique that aims to transfer knowledge from a large 'teacher' network to a smaller 'student' network and improve the student's performance by training it to emulate the teacher. In…

Machine Learning · Computer Science 2022-10-19 Sushil Thapa

Knowledge distillation (KD) is a promising technique for model compression in neural machine translation. However, where the knowledge hides in KD is still not clear, which may hinder the development of KD. In this work, we first unravel…

Computation and Language · Computer Science 2024-07-18 Songming Zhang , Yunlong Liang , Shuaibo Wang , Wenjuan Han , Jian Liu , Jinan Xu , Yufeng Chen

The demand for a huge amount of data for machine learning (ML) applications is currently a bottleneck in an empirically dominated field. We propose a method to combine prior knowledge with data-driven methods to significantly reduce their…

Machine Learning · Computer Science 2023-03-06 Xia Chen , Manav Mahan Singh , Philipp Geyer

The conventional model aggregation-based federated learning (FL) approach requires all local models to have the same architecture, which fails to support practical scenarios with heterogeneous local models. Moreover, frequent model exchange…

Signal Processing · Electrical Eng. & Systems 2023-03-23 Zhixiong Chen , Wenqiang Yi , Yuanwei Liu , Arumugam Nallanathan

Deep neural models in recent years have been successful in almost every field, including extremely complex problem statements. However, these models are huge in size, with millions (and even billions) of parameters, thus demanding more…

Computer Vision and Pattern Recognition · Computer Science 2021-06-18 Lin Wang , Kuk-Jin Yoon

Multi-instance learning (MIL) is a widely-applied technique in practical applications that involve complex data structures. MIL can be broadly categorized into two types: traditional methods and those based on deep learning. These…

Artificial Intelligence · Computer Science 2023-04-25 Yu-Xuan Zhang , Zhengchun Zhou , Xingxing He , Avik Ranjan Adhikary , Bapi Dutta

Knowledge Distillation (KD) transfers the knowledge from a high-capacity teacher network to strengthen a smaller student. Existing methods focus on excavating the knowledge hints and transferring the whole knowledge to the student. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Chenxin Li , Mingbao Lin , Zhiyuan Ding , Nie Lin , Yihong Zhuang , Yue Huang , Xinghao Ding , Liujuan Cao
‹ Prev 1 2 3 10 Next ›