Related papers: Towards Analyzing N-language Polyglot Programs

Challenges of Multilingual Program Specification and Analysis

Multilingual programs, whose implementations are made of different languages, are gaining traction especially in domains, such as web programming, that particularly benefit from the additional flexibility brought by using multiple…

Programming Languages · Computer Science 2024-07-17 Carlo A. Furia , Abhishek Tiwari

Exploring Aspects of Polyglot High-Performance Virtual Machine GraalVM

Contemporary software often becomes vastly complex, and we are required to use a variety of technologies and different programming languages for its development. As interoperability between programming languages could cause high overhead…

Programming Languages · Computer Science 2021-12-30 M. Šipek , B. Mihaljević , A. Radovan

Large Language Models for Multilingual Code Intelligence: A Survey

Large language models have transformed AI-assisted software engineering, but current research remains biased toward high-resource languages such as Python, with weaker performance in languages like Rust and OCaml. Since real-world systems…

Software Engineering · Computer Science 2026-04-30 Chao Jiang , Dugang Liu , Cheng Wen , Zhiwu Xu , Hua Zheng , Muhammad Sadiq , Jawwad Ahmed Shamsi , Shengchao Qin , Zhong Ming

PolyDebug: A Framework for Polyglot Debugging

As software grows increasingly complex, the quantity and diversity of concerns to be addressed also rises. To answer this diversity of concerns, developers may end up using multiple programming languages in a single software project, a…

Programming Languages · Computer Science 2025-03-03 Philémon Houdaille , Djamel Eddine Khelladi , Benoit Combemale , Gunter Mussbacher , Tijs van der Storm

Unifying Pointer Analyses for Polyglot Inter-operations through Summary Specialization

Modular analysis of polyglot applications is challenging because heap object flows across language boundaries must be resolved. The state-of-the-art analyses for polyglot applications have two fundamental limitations. First, they assume…

Software Engineering · Computer Science 2023-05-09 Jyoti Prakash , Abhishek Tiwari , Christian Hammer

Lightweight Multilingual Software Analysis

Developer preferences, language capabilities and the persistence of older languages contribute to the trend that large software codebases are often multilingual, that is, written in more than one computer language. While developers can…

Software Engineering · Computer Science 2018-08-06 Damian M. Lyons , Anne Marie Bogar , David Baird

Live Multi-language Development and Runtime Environments

Context: Software development tools should work and behave consistently across different programming languages, so that developers do not have to familiarize themselves with new tooling for new languages. Also, being able to combine…

Programming Languages · Computer Science 2018-03-28 Fabio Niephaus , Tim Felgentreff , Tobias Pape , Robert Hirschfeld , Marcel Taeumel

Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs

Large language models (LLMs) are at the forefront of transforming numerous domains globally. However, their inclusivity and effectiveness remain limited for non-Latin scripts and low-resource languages. This paper tackles the imperative…

Computation and Language · Computer Science 2025-01-08 Somnath Kumar , Vaibhav Balloli , Mercy Ranjit , Kabir Ahuja , Tanuja Ganu , Sunayana Sitaram , Kalika Bali , Akshay Nambi

Multilingual European Language Models: Benchmarking Approaches and Challenges

The breakthrough of generative large language models (LLMs) that can solve different tasks through chat interaction has led to a significant increase in the use of general benchmarks to assess the quality or performance of these models…

Computation and Language · Computer Science 2025-04-03 Fabio Barth , Georg Rehm

A Contemporary Survey of Large Language Model Assisted Program Analysis

The increasing complexity of software systems has driven significant advancements in program analysis, as traditional methods unable to meet the demands of modern software development. To address these limitations, deep learning techniques,…

Software Engineering · Computer Science 2025-02-27 Jiayimei Wang , Tao Ni , Wei-Bin Lee , Qingchuan Zhao

A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models

Polyglot is a pioneering project aimed at enhancing the non-English language performance of multilingual language models. Despite the availability of various multilingual models such as mBERT (Devlin et al., 2019), XGLM (Lin et al., 2022),…

Computation and Language · Computer Science 2023-06-07 Hyunwoong Ko , Kichang Yang , Minho Ryu , Taekyoon Choi , Seungmu Yang , Jiwung Hyun , Sungho Park , Kyubyong Park

PolyLM: An Open Source Polyglot Large Language Model

Large language models (LLMs) demonstrate remarkable ability to comprehend, reason, and generate following nature language instructions. However, the development of LLMs has been primarily focused on high-resource languages, such as English,…

Computation and Language · Computer Science 2023-07-13 Xiangpeng Wei , Haoran Wei , Huan Lin , Tianhao Li , Pei Zhang , Xingzhang Ren , Mei Li , Yu Wan , Zhiwei Cao , Binbin Xie , Tianxiang Hu , Shangjie Li , Binyuan Hui , Bowen Yu , Dayiheng Liu , Baosong Yang , Fei Huang , Jun Xie

Next-generation Web Applications with WebAssembly and TruffleWasm

In modern software development, the JavaScript ecosystem of various frameworks and libraries used to develop contemporary web applications presents many advantages. JavaScript is a widely known interpreted programming language, simple to…

Software Engineering · Computer Science 2021-12-16 M. Sipek , D. Muharemagic , B. Mihaljevic , A. Radovan

Multi-Lingual Development & Programming Languages Interoperability: An Empirical Study

As part of a research on a novel in-process multiprogramming-language interoperability system, this study investigates the interoperability and usage of multiple programming languages within a large dataset of GitHub projects and Stack…

Programming Languages · Computer Science 2024-11-14 Tsvi Cherny-Shahar , Amiram Yehudai

How does a Multilingual LM Handle Multiple Languages?

Multilingual language models have significantly advanced due to rapid progress in natural language processing. Models like BLOOM 1.7B, trained on diverse multilingual datasets, aim to bridge linguistic gaps. However, their effectiveness in…

Computation and Language · Computer Science 2026-02-03 Santhosh Kakarla , Gautama Shastry Bulusu Venkata , Aishwarya Gaddam , Maheedhar Sai Omtri Mohan

Lost in Translation: Large Language Models in Non-English Content Analysis

In recent years, large language models (e.g., Open AI's GPT-4, Meta's LLaMa, Google's PaLM) have become the dominant approach for building AI systems to analyze and generate language online. However, the automated systems that increasingly…

Computation and Language · Computer Science 2023-06-14 Gabriel Nicholas , Aliya Bhatia

What Is Missing in Multilingual Visual Reasoning and How to Fix It

NLP models today strive for supporting multiple languages and modalities, improving accessibility for diverse users. In this paper, we evaluate their multilingual, multimodal capabilities by testing on a visual reasoning task. We observe…

Computation and Language · Computer Science 2025-02-11 Yueqi Song , Simran Khanuja , Graham Neubig

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

In this paper, we introduce PolyMath, a multilingual mathematical reasoning benchmark covering 18 languages and 4 easy-to-hard difficulty levels. Our benchmark ensures difficulty comprehensiveness, language diversity, and high-quality…

Computation and Language · Computer Science 2025-11-04 Yiming Wang , Pei Zhang , Jialong Tang , Haoran Wei , Baosong Yang , Rui Wang , Chenshu Sun , Feitong Sun , Jiran Zhang , Junxuan Wu , Qiqian Cang , Yichang Zhang , Fei Huang , Junyang Lin , Fei Huang , Jingren Zhou

Benchmarking Concept-Spilling Across Languages in LLMs

Multilingual Large Language Models (LLMs) exhibit remarkable cross-lingual abilities, yet often exhibit a systematic bias toward the representations from other languages, resulting in semantic interference when generating content in…

Computation and Language · Computer Science 2026-01-21 Ilia Badanin , Daniil Dzenhaliou , Imanol Schlag

Towards Multilingual LLM Evaluation for European Languages

The rise of Large Language Models (LLMs) has revolutionized natural language processing across numerous languages and tasks. However, evaluating LLM performance in a consistent and meaningful way across multiple European languages remains…

Computation and Language · Computer Science 2024-10-18 Klaudia Thellmann , Bernhard Stadler , Michael Fromm , Jasper Schulze Buschhoff , Alex Jude , Fabio Barth , Johannes Leveling , Nicolas Flores-Herr , Joachim Köhler , René Jäkel , Mehdi Ali