Related papers: Lightweight Multilingual Software Analysis

Lightweight Call-Graph Construction for Multilingual Software Analysis

Analysis of multilingual codebases is a topic of increasing importance. In prior work, we have proposed the MLSA (MultiLingual Software Analysis) architecture, an approach to the lightweight analysis of multilingual codebases, and have…

Software Engineering · Computer Science 2018-08-06 Anne Marie Bogar , Damian M. Lyons , David Baird

Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets

Large language models (LLMs) and transformer-based architectures are increasingly utilized for source code analysis. As software systems grow in complexity, integrating LLMs into code analysis workflows becomes essential for enhancing…

Software Engineering · Computer Science 2025-03-25 Hamed Jelodar , Mohammad Meymani , Roozbeh Razavi-Far

Towards Analyzing N-language Polyglot Programs

Polyglot programming is gaining popularity as developers integrate multiple programming languages to harness their individual strengths. With the recent popularity of platforms like GraalVM and other multi-language runtimes, creating and…

Software Engineering · Computer Science 2026-02-03 Jyoti Prakash , Abhishek Tiwari , Mikkel Baun Kjærgaard

Static Code Analysis of Multilanguage Software Systems

Identifying dependency call graphs of multilanguage software systems using static code analysis is challenging. The different languages used in developing today's systems often have different lexical, syntactical, and semantic rules that…

Software Engineering · Computer Science 2019-06-04 Anas Shatnawi , Hafedh Mili , Manel Abdellatif , Yann-Gaël Guéhéneuc , Naouel Moha , Geoffrey Hecht , Ghizlane El Boussaidi , Jean Privat

LOLA -- An Open-Source Massively Multilingual Large Language Model

This paper presents LOLA, a massively multilingual large language model trained on more than 160 languages using a sparse Mixture-of-Experts Transformer architecture. Our architectural and implementation choices address the challenge of…

Computation and Language · Computer Science 2025-02-04 Nikit Srivastava , Denis Kuchelev , Tatiana Moteu Ngoli , Kshitij Shetty , Michael Röder , Hamada Zahera , Diego Moussallem , Axel-Cyrille Ngonga Ngomo

Software Architecture Meets LLMs: A Systematic Literature Review

Large Language Models (LLMs) are used for many different software engineering tasks. In software architecture, they have been applied to tasks such as classification of design decisions, detection of design patterns, and generation of…

Software Engineering · Computer Science 2025-05-23 Larissa Schmid , Tobias Hey , Martin Armbruster , Sophie Corallo , Dominik Fuchß , Jan Keim , Haoyu Liu , Anne Koziolek

Towards Scalable Schema Mapping using Large Language Models

The growing need to integrate information from a large number of diverse sources poses significant scalability challenges for data integration systems. These systems often rely on manually written schema mappings, which are complex,…

Databases · Computer Science 2025-06-02 Christopher Buss , Mahdis Safari , Arash Termehchy , Stefan Lee , David Maier

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

Despite various approaches being employed to detect vulnerabilities, the number of reported vulnerabilities shows an upward trend over the years. This suggests the problems are not caught before the code is released, which could be caused…

Cryptography and Security · Computer Science 2025-02-14 Karl Tamberg , Hayretdin Bahsi

A Contemporary Survey of Large Language Model Assisted Program Analysis

The increasing complexity of software systems has driven significant advancements in program analysis, as traditional methods unable to meet the demands of modern software development. To address these limitations, deep learning techniques,…

Software Engineering · Computer Science 2025-02-27 Jiayimei Wang , Tao Ni , Wei-Bin Lee , Qingchuan Zhao

LLM-Based Multi-Agent Systems for Software Engineering: Literature Review, Vision and the Road Ahead

Integrating Large Language Models (LLMs) into autonomous agents marks a significant shift in the research landscape by offering cognitive abilities that are competitive with human planning and reasoning. This paper explores the…

Software Engineering · Computer Science 2025-07-21 Junda He , Christoph Treude , David Lo

AI-Guided Exploration of Large-Scale Codebases

Understanding large-scale, complex software systems is a major challenge for developers, who spend a significant portion of their time on program comprehension. Traditional tools such as static visualizations and reverse engineering…

Software Engineering · Computer Science 2025-08-11 Yoseph Berhanu Alebachew

GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding

Programming languages possess rich semantic information - such as data flow - that is represented by graphs and not available from the surface form of source code. Recent code language models have scaled to billions of parameters, but model…

Computation and Language · Computer Science 2025-09-24 Ziyin Zhang , Hang Yu , Shijie Li , Peng Di , Jianguo Li , Rui Wang

Exploring Code Analysis: Zero-Shot Insights on Syntax and Semantics with LLMs

Code analysis is fundamental in Software Engineering, supporting debugging, optimization, and security assessment. Human developers approach it through syntax parsing, static semantics inference, and dynamic reasoning. Traditional tools are…

Software Engineering · Computer Science 2026-05-22 Wei Ma , Zhihao Lin , Shangqing Liu , Qiang Hu , Ye Liu , Wenhan Wang , Cen Zhang , Liming Nie , Li Li , Yang Liu , Lingxiao Jiang

LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages

The rapid evolution of large language models (LLMs) has opened new possibilities for automating various tasks in software development. This paper evaluates the capabilities of the Llama 2-70B model in automating these tasks for scientific…

Software Engineering · Computer Science 2025-07-09 Patrick Diehl , Nojoud Nader , Maxim Moraru , Steven R. Brandt

Bridging Code Property Graphs and Language Models for Program Analysis

Large Language Models (LLMs) face critical challenges when analyzing security vulnerabilities in real world codebases: token limits prevent loading entire repositories, code embeddings fail to capture inter procedural data flows, and LLMs…

Cryptography and Security · Computer Science 2026-03-27 Ahmed Lekssays

MLaGA: Multimodal Large Language and Graph Assistant

Large Language Models (LLMs) have demonstrated substantial efficacy in advancing graph-structured data analysis. Prevailing LLM-based graph methods excel in adapting LLMs to text-rich graphs, wherein node attributes are text descriptions.…

Artificial Intelligence · Computer Science 2025-06-04 Dongzhe Fan , Yi Fang , Jiajin Liu , Djellel Difallah , Qiaoyu Tan

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose…

Computation and Language · Computer Science 2024-02-28 Duarte M. Alves , José Pombal , Nuno M. Guerreiro , Pedro H. Martins , João Alves , Amin Farajian , Ben Peters , Ricardo Rei , Patrick Fernandes , Sweta Agrawal , Pierre Colombo , José G. C. de Souza , André F. T. Martins

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

Large Language Models (LLMs) have become increasingly popular for their advanced text generation capabilities across various domains. However, like any software, they face security challenges, including the risk of 'jailbreak' attacks that…

Cryptography and Security · Computer Science 2024-01-31 Jie Li , Yi Liu , Chongyang Liu , Ling Shi , Xiaoning Ren , Yaowen Zheng , Yang Liu , Yinxing Xue

The Emergence of Large Language Models in Static Analysis: A First Look through Micro-Benchmarks

The application of Large Language Models (LLMs) in software engineering, particularly in static analysis tasks, represents a paradigm shift in the field. In this paper, we investigate the role that current LLMs can play in improving…

Software Engineering · Computer Science 2024-02-28 Ashwin Prasad Shivarpatna Venkatesh , Samkutty Sabu , Amir M. Mir , Sofia Reis , Eric Bodden

Large Language Models for Multilingual Code Intelligence: A Survey

Large language models have transformed AI-assisted software engineering, but current research remains biased toward high-resource languages such as Python, with weaker performance in languages like Rust and OCaml. Since real-world systems…

Software Engineering · Computer Science 2026-04-30 Chao Jiang , Dugang Liu , Cheng Wen , Zhiwu Xu , Hua Zheng , Muhammad Sadiq , Jawwad Ahmed Shamsi , Shengchao Qin , Zhong Ming