Related papers: Knowledge-Based Version Incompatibility Detection …

Decide: Knowledge-Based Version Incompatibility Detection in Deep Learning Stacks

Version incompatibility issues are prevalent when reusing or reproducing deep learning (DL) models and applications. Compared with official API documentation, which is often incomplete or out-of-date, Stack Overflow (SO) discussions possess…

Software Engineering · Computer Science 2024-08-06 Zihan Zhou , Zhongkai Zhao , Bonan Kou , Tianyi Zhang

DepOwl: Detecting Dependency Bugs to Prevent Compatibility Failures

Applications depend on libraries to avoid reinventing the wheel. Libraries may have incompatible changes during evolving. As a result, applications will suffer from compatibility failures. There has been much research on addressing…

Software Engineering · Computer Science 2021-02-18 Zhouyang Jia , Shanshan Li , Tingting Yu , Chen Zeng , Erci Xu , Xiaodong Liu , Ji Wang , Xiangke Liao

Small Changes, Big Trouble: Demystifying and Parsing License Variants for Incompatibility Detection in the PyPI Ecosystem

Open-source licenses establish the legal foundation for software reuse, yet license variants, including both modified standard licenses and custom-created alternatives, introduce significant compliance complexities. Despite their prevalence…

Software Engineering · Computer Science 2025-11-03 Weiwei Xu , Hengzhi Ye , Kai Gao , Minghui Zhou

Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis: An Open Source Benchmark Study

With the development of deep learning (DL) techniques, rotating machinery intelligent diagnosis has gone through tremendous progress with verified success and the classification accuracies of many DL-based intelligent diagnosis algorithms…

Signal Processing · Electrical Eng. & Systems 2020-08-20 Zhibin Zhao , Tianfu Li , Jingyao Wu , Chuang Sun , Shibin Wang , Ruqiang Yan , Xuefeng Chen

Knowledge Distillation Detection for Open-weights Models

We propose the task of knowledge distillation detection, which aims to determine whether a student model has been distilled from a given teacher, under a practical setting where only the student's weights and the teacher's API are…

Machine Learning · Computer Science 2025-10-03 Qin Shi , Amber Yijia Zheng , Qifan Song , Raymond A. Yeh

Knowledge-Augmented Language Model Verification

Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowledge internalized in parameters. Yet, LMs often generate the factually incorrect responses to the given queries, since their knowledge may be…

Computation and Language · Computer Science 2023-10-20 Jinheon Baek , Soyeong Jeong , Minki Kang , Jong C. Park , Sung Ju Hwang

Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG

Although LLMs have shown promising potential in vulnerability detection, this study reveals their limitations in distinguishing between vulnerable and similar-but-benign patched code (only 0.06 - 0.14 accuracy). It shows that LLMs struggle…

Software Engineering · Computer Science 2025-06-18 Xueying Du , Geng Zheng , Kaixin Wang , Yi Zou , Yujia Wang , Wentai Deng , Jiayi Feng , Mingwei Liu , Bihuan Chen , Xin Peng , Tao Ma , Yiling Lou

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Despite their success at many natural language processing (NLP) tasks, large language models still struggle to effectively leverage knowledge for knowledge-intensive tasks, manifesting limitations such as generating incomplete, non-factual,…

Computation and Language · Computer Science 2024-10-03 Yougang Lyu , Lingyong Yan , Shuaiqiang Wang , Haibo Shi , Dawei Yin , Pengjie Ren , Zhumin Chen , Maarten de Rijke , Zhaochun Ren

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

Large language models (LLMs) are now largely involved in software development workflows, and the code they generate routinely includes third-party library (TPL) imports annotated with specific version identifiers. These version choices can…

Software Engineering · Computer Science 2026-05-08 Chengjie Wang , Jingzheng Wu , Xiang Ling , Tianyue Luo , Chen Zhao

Deep Learning based Vulnerability Detection: Are We There Yet?

Automated detection of software vulnerabilities is a fundamental problem in software security. Existing program analysis techniques either suffer from high false positives or false negatives. Recent progress in Deep Learning (DL) has…

Software Engineering · Computer Science 2020-09-16 Saikat Chakraborty , Rahul Krishna , Yangruibo Ding , Baishakhi Ray

VersionRAG: Version-Aware Retrieval-Augmented Generation for Evolving Documents

Retrieval-Augmented Generation (RAG) systems fail when documents evolve through versioning-a ubiquitous characteristic of technical documentation. Existing approaches achieve only 58-64% accuracy on version-sensitive questions, retrieving…

Information Retrieval · Computer Science 2025-10-10 Daniel Huwiler , Kurt Stockinger , Jonathan Fürst

PCREQ: Automated Inference of Compatible Requirements for Python Third-party Library Upgrades

Python third-party libraries (TPLs) are essential in modern software development, but upgrades often cause compatibility issues, leading to system failures. These issues fall into two categories: version compatibility issues (VCIs) and code…

Software Engineering · Computer Science 2026-03-31 Huashan Lei , Guanping Xiao , Yepang Liu , Zheng Zheng

KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

Supervised fine-tuning (SFT) is a common approach to improve the domain-specific question-answering (QA) performance of large language models (LLMs). However, recent literature reveals that due to the conflicts between LLMs' internal…

Computation and Language · Computer Science 2025-05-29 Qihuang Zhong , Liang Ding , Xiantao Cai , Juhua Liu , Bo Du , Dacheng Tao

Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?

Concise and meaningful method names are crucial for program comprehension and maintenance. However, method names may become inconsistent with their corresponding implementations, causing confusion and errors. Several deep learning…

Software Engineering · Computer Science 2025-01-23 Taiming Wang , Yuxia Zhang , Lin Jiang , Yi Tang , Guangjie Li , Hui Liu

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Despite efforts to expand the knowledge of large language models (LLMs), knowledge gaps -- missing or outdated information in LLMs -- might always persist given the evolving nature of knowledge. In this work, we study approaches to identify…

Computation and Language · Computer Science 2024-07-02 Shangbin Feng , Weijia Shi , Yike Wang , Wenxuan Ding , Vidhisha Balachandran , Yulia Tsvetkov

An Empirical Study of the Imbalance Issue in Software Vulnerability Detection

Vulnerability detection is crucial to protect software security. Nowadays, deep learning (DL) is the most promising technique to automate this detection task, leveraging its superior ability to extract patterns and representations within…

Software Engineering · Computer Science 2026-02-13 Yuejun Guo , Qiang Hu , Qiang Tang , Yves Le Traon

Automatic Fault Detection for Deep Learning Programs Using Graph Transformations

Nowadays, we are witnessing an increasing demand in both corporates and academia for exploiting Deep Learning (DL) to solve complex real-world problems. A DL program encodes the network structure of a desirable DL model and the process by…

Software Engineering · Computer Science 2021-07-08 Amin Nikanjam , Houssem Ben Braiek , Mohammad Mehdi Morovati , Foutse Khomh

Error Detection in a Large-Scale Lexical Taxonomy

Knowledge base (KB) is an important aspect in artificial intelligence. One significant challenge faced by KB construction is that it contains many noises, which prevents its effective usage. Even though some KB cleansing algorithms have…

Artificial Intelligence · Computer Science 2018-08-07 Sifan Liu , Hongzhi Wang

Uncertainty-based Visual Question Answering: Estimating Semantic Inconsistency between Image and Knowledge Base

Knowledge-based visual question answering (KVQA) task aims to answer questions that require additional external knowledge as well as an understanding of images and questions. Recent studies on KVQA inject an external knowledge in a…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Jinyeong Chae , Jihie Kim

Knowledge-based Consistency Testing of Large Language Models

In this work, we systematically expose and measure the inconsistency and knowledge gaps of Large Language Models (LLMs). Specifically, we propose an automated testing framework (called KonTest) which leverages a knowledge graph to construct…

Computation and Language · Computer Science 2025-08-15 Sai Sathiesh Rajan , Ezekiel Soremekun , Sudipta Chattopadhyay