Related papers: DocTer: Documentation Guided Fuzzing for Testing D…

Detecting Multi-Parameter Constraint Inconsistencies in Python Data Science Libraries

Modern AI- and Data-intensive software systems rely heavily on data science and machine learning libraries that provide essential algorithmic implementations and computational frameworks. These libraries expose complex APIs whose correct…

Software Engineering · Computer Science 2024-11-20 Xiufeng Xu , Fuman Xie , Chenguang Zhu , Guangdong Bai , Sarfraz Khurshid , Yi Li

ACETest: Automated Constraint Extraction for Testing Deep Learning Operators

Deep learning (DL) applications are prevalent nowadays as they can help with multiple tasks. DL libraries are essential for building DL applications. Furthermore, DL operators are the important building blocks of the DL libraries, that…

Software Engineering · Computer Science 2023-06-06 Jingyi Shi , Yang Xiao , Yuekang Li , Yeting Li , Dongsong Yu , Chendong Yu , Hui Su , Yufeng Chen , Wei Huo

Improving Deep Learning Library Testing with Machine Learning

Deep Learning (DL) libraries like TensorFlow and Pytorch simplify machine learning (ML) model development but are prone to bugs due to their complex design. Bug-finding techniques exist, but without precise API specifications, they produce…

Software Engineering · Computer Science 2026-02-04 Facundo Molina , M M Abid Naziri , Feiran Qin , Alessandra Gorla , Marcelo d'Amorim

Automatically Generating Dockerfiles via Deep Learning: Challenges and Promises

Containerization allows developers to define the execution environment in which their software needs to be installed. Docker is the leading platform in this field, and developers that use it are required to write a Dockerfile for their…

Software Engineering · Computer Science 2023-03-29 Giovanni Rosa , Antonio Mastropaolo , Simone Scalabrino , Gabriele Bavota , Rocco Oliveto

DocChecker: Bootstrapping Code Large Language Model for Detecting and Resolving Code-Comment Inconsistencies

Comments within source code are essential for developers to comprehend the code's purpose and ensure its correct usage. However, as codebases evolve, maintaining an accurate alignment between the comments and the code becomes increasingly…

Software Engineering · Computer Science 2024-02-06 Anh T. V. Dau , Jin L. C. Guo , Nghi D. Q. Bui

Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models

Deep learning (DL) libraries, widely used in AI applications, often contain vulnerabilities like buffer overflows and use-after-free errors. Traditional fuzzing struggles with the complexity and API diversity of DL libraries such as…

Software Engineering · Computer Science 2025-01-09 Kunpeng Zhang , Shuai Wang , Jitao Han , Xiaogang Zhu , Xian Li , Shaohua Wang , Sheng Wen

Automatic Unit Test Generation for Deep Learning Frameworks based on API Knowledge

Many automatic unit test generation tools that can generate unit test cases with high coverage over a program have been proposed. However, most of these tools are ineffective on deep learning (DL) frameworks due to the fact that many of…

Software Engineering · Computer Science 2023-07-04 Arunkaleeshwaran Narayanan , Nima Shiri harzevili , Junjie Wang , Lin Shi , Moshi Wei , Song Wang

Fuzzing Deep-Learning Libraries via Automated Relational API Inference

A growing body of research has been dedicated to DL model testing. However, there is still limited work on testing DL libraries, which serve as the foundations for building, training, and running DL models. Prior work on fuzzing DL…

Software Engineering · Computer Science 2022-07-13 Yinlin Deng , Chenyuan Yang , Anjiang Wei , Lingming Zhang

Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries

Differential testing offers a promising strategy to alleviate the test oracle problem by comparing the test results between alternative implementations. However, existing differential testing techniques for deep learning (DL) libraries are…

Software Engineering · Computer Science 2025-05-09 Meiziniu Li , Dongze Li , Jianmeng Liu , Jialun Cao , Yongqiang Tian , Shing-Chi Cheung

Security Knowledge-Guided Fuzzing of Deep Learning Libraries

Recently, many Deep Learning fuzzers have been proposed for testing of DL libraries. However, they either perform unguided input generation (e.g., not considering the relationship between API arguments when generating inputs) or only…

Cryptography and Security · Computer Science 2023-12-27 Nima Shiri Harzevili , Mohammad Mahdi Mohajer , Moshi Wei , Hung Viet Pham , Song Wang

Checker Bug Detection and Repair in Deep Learning Libraries

Checker bugs in Deep Learning (DL) libraries are critical yet not well-explored. These bugs are often concealed in the input validation and error-checking code of DL libraries and can lead to silent failures, incorrect results, or…

Software Engineering · Computer Science 2024-10-10 Nima Shiri Harzevili , Mohammad Mahdi Mohajer , Jiho Shin , Moshi Wei , Gias Uddin , Jinqiu Yang , Junjie Wang , Song Wang , Zhen Ming , Jiang , Nachiappan Nagappan

DocCGen: Document-based Controlled Code Generation

Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical…

Software Engineering · Computer Science 2024-07-04 Sameer Pimparkhede , Mehant Kammakomati , Srikanth Tamilselvam , Prince Kumar , Ashok Pon Kumar , Pushpak Bhattacharyya

XAMT: Cross-Framework API Matching for Testing Deep Learning Libraries

Deep learning powers critical applications such as autonomous driving, healthcare, and finance, where the correctness of underlying libraries is essential. Bugs in widely used deep learning APIs can propagate to downstream systems, causing…

Software Engineering · Computer Science 2025-08-19 Bin Duan , Ruican Dong , Naipeng Dong , Dan Dongseong Kim , Guowei Yang

COMET: Coverage-guided Model Generation For Deep Learning Library Testing

Recent deep learning (DL) applications are mostly built on top of DL libraries. The quality assurance of these libraries is critical to the dependable deployment of DL applications. Techniques have been proposed to generate various DL…

Software Engineering · Computer Science 2024-06-14 Meiziniu Li , Jialun Cao , Yongqiang Tian , Tsz On Li , Ming Wen , Shing-Chi Cheung

Automatically Identifying Parameter Constraints in Complex Web APIs: A Case Study at Adyen

Web APIs may have constraints on parameters, such that not all parameters are either always required or always optional. Moreover, the presence or value of one parameter could cause another parameter to be required, or parameters could have…

Software Engineering · Computer Science 2021-02-02 Henk Grent , Aleksei Akimov , Maurício Aniche

Understanding Performance Problems in Deep Learning Systems

Deep learning (DL) has been widely applied to many domains. Unique challenges in engineering DL systems are posed by the programming paradigm shift from traditional systems to DL systems, and performance is one of the challenges.…

Software Engineering · Computer Science 2022-11-01 Junming Cao , Bihuan Chen , Chao Sun , Longjie Hu , Shuaihong Wu , Xin Peng

Detecting and Fixing API Misuses of Data Science Libraries Using Large Language Models

Data science libraries, such as scikit-learn and pandas, specialize in processing and manipulating data. The data-centric nature of these libraries makes the detection of API misuse in them more challenging. This paper introduces DSCHECKER,…

Software Engineering · Computer Science 2025-10-01 Akalanka Galappaththi , Francisco Ribeiro , Sarah Nadi

Benchmarking Deep Learning Fuzzers

In this work, we set out to conduct the first ground-truth empirical evaluation of state-of-the-art DL fuzzers. Specifically, we first manually created an extensive DL bug benchmark dataset, which includes 627 real-world DL bugs from…

Software Engineering · Computer Science 2023-10-12 Nima Shiri Harzevili , Hung Viet Pham , Song Wang

ConFL: Constraint-guided Fuzzing for Machine Learning Framework

As machine learning gains prominence in various sectors of society for automated decision-making, concerns have risen regarding potential vulnerabilities in machine learning (ML) frameworks. Nevertheless, testing these frameworks is a…

Software Engineering · Computer Science 2023-07-13 Zhao Liu , Quanchen Zou , Tian Yu , Xuan Wang , Guozhu Meng , Kai Chen , Deyue Zhang

Evaluating the Effectiveness of Coverage-Guided Fuzzing for Testing Deep Learning Library APIs

Deep Learning (DL) libraries such as PyTorch provide the core components to build major AI-enabled applications. Finding bugs in these libraries is important and challenging. Prior approaches have tackled this by performing either API-level…

Software Engineering · Computer Science 2025-09-19 Feiran Qin , M. M. Abid Naziri , Hengyu Ai , Saikat Dutta , Marcelo d'Amorim