Related papers: TIGER: A Generating-Then-Ranking Framework for Pra…

Generative Type Inference for Python

Python is a popular dynamic programming language, evidenced by its ranking as the second most commonly used language on GitHub. However, its dynamic type system can lead to potential type errors, leading researchers to explore automatic…

Software Engineering · Computer Science 2023-07-19 Yun Peng , Chaozheng Wang , Wenxuan Wang , Cuiyun Gao , Michael R. Lyu

Combining Type Inference and Automated Unit Test Generation for Python

Automated unit test generation is an established research field that has so far focused on statically-typed programming languages. The lack of type information in dynamically-typed programming languages, such as Python, inhibits test…

Software Engineering · Computer Science 2025-07-03 Lukas Krodinger , Stephan Lukasczyk , Gordon Fraser

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

In recent years, the fine-tuned generative models have been proven more powerful than the previous tagging-based or span-based models on named entity recognition (NER) task. It has also been found that the information related to entities,…

Computation and Language · Computer Science 2024-06-12 Guochao Jiang , Ziqin Luo , Yuchen Shi , Dixuan Wang , Jiaqing Liang , Deqing Yang

TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction

Pre-defined 3D object templates are widely used in 3D reconstruction of hand-object interactions. However, they often require substantial manual efforts to capture or source, and inherently restrict the adaptability of models to…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Yiyao Huang , Zhedong Zheng , Yu Ziwei , Yaxiong Wang , Tze Ho Elden Tse , Angela Yao

Text-to-Image GAN with Pretrained Representations

Generating desired images conditioned on given text descriptions has received lots of attention. Recently, diffusion models and autoregressive models have demonstrated their outstanding expressivity and gradually replaced GAN as the favored…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Xiaozhou You , Jian Zhang

TIGER: Text-Informed Generalized Enzyme-Reaction Retrieval

Enzyme-reaction retrieval is a fundamental problem in computational biology, underpinning enzyme characterization, reaction mechanism elucidation, and the rational design of metabolic pathways and biocatalysts. As a bidirectional task, it…

Artificial Intelligence · Computer Science 2026-05-26 Yuhang Zhang , Keyan Ding , Peilin Chen , Han Liu , Can Lin , Ruixi Chen , Shiqi Wang , Qi Song

Large Scale Generation of Labeled Type Data for Python

Recently, dynamically typed languages, such as Python, have gained unprecedented popularity. Although these languages alleviate the need for mandatory type annotations, types still play a critical role in program understanding and…

Programming Languages · Computer Science 2022-02-08 Ibrahim Abdelaziz , Julian Dolby , Kavitha Srinivas

Getting Python Types Right with RightTyper

Python type annotations enable static type checking, but most code remains untyped because manual annotation is time-consuming and tedious. Past approaches to automatic type inference fall short: static methods struggle with dynamic…

Programming Languages · Computer Science 2026-03-12 Juan Altmayer Pizzorno , Emery D. Berger

Infer As You Train: A Symmetric Paradigm of Masked Generative for Click-Through Rate Prediction

Generative models are increasingly being explored in click-through rate (CTR) prediction field to overcome the limitations of the conventional discriminative paradigm, which rely on a simple binary classification objective. However,…

Information Retrieval · Computer Science 2025-11-19 Moyu Zhang , Yujun Jin , Yun Chen , Jinxin Hu , Yu Zhang , Xiaoyi Zeng

PYInfer: Deep Learning Semantic Type Inference for Python Variables

Python type inference is challenging in practice. Due to its dynamic properties and extensive dependencies on third-party libraries without type annotations, the performance of traditional static analysis techniques is limited. Although…

Software Engineering · Computer Science 2021-06-29 Siwei Cui , Gang Zhao , Zeyu Dai , Luochao Wang , Ruihong Huang , Jeff Huang

Joint Generator-Ranker Learning for Natural Language Generation

Generate-then-rank is a widely used mechanism for text generation, where a generator produces multiple text candidates and a ranker chooses the best one among the text candidates. However, existing methods usually train the generator and…

Computation and Language · Computer Science 2023-05-30 Weizhou Shen , Yeyun Gong , Yelong Shen , Song Wang , Xiaojun Quan , Nan Duan , Weizhu Chen

Type4Py: Practical Deep Similarity Learning-Based Type Inference for Python

Dynamic languages, such as Python and Javascript, trade static typing for developer flexibility and productivity. Lack of static typing can cause run-time exceptions and is a major factor for weak IDE support. To alleviate these issues, PEP…

Machine Learning · Computer Science 2022-01-20 Amir M. Mir , Evaldas Latoskinas , Sebastian Proksch , Georgios Gousios

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Generative AI and large language models hold great promise in enhancing programming education by automatically generating individualized feedback for students. We investigate the role of generative AI models in providing human tutor-style…

Artificial Intelligence · Computer Science 2024-08-07 Tung Phung , Victor-Alexandru Pădurean , Anjali Singh , Christopher Brooks , José Cambronero , Sumit Gulwani , Adish Singla , Gustavo Soares

POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training

Large-scale pre-trained language models, such as BERT and GPT-2, have achieved excellent performance in language representation learning and free-form text generation. However, these models cannot be directly employed to generate text under…

Computation and Language · Computer Science 2020-09-29 Yizhe Zhang , Guoyin Wang , Chunyuan Li , Zhe Gan , Chris Brockett , Bill Dolan

Static Inference Meets Deep Learning: A Hybrid Type Inference Approach for Python

Type inference for dynamic programming languages such as Python is an important yet challenging task. Static type inference techniques can precisely infer variables with enough static constraints but are unable to handle variables with…

Software Engineering · Computer Science 2022-02-10 Yun Peng , Cuiyun Gao , Zongjie Li , Bowei Gao , David Lo , Qirun Zhang , Michael Lyu

Learning to Rank in Generative Retrieval

Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models,…

Computation and Language · Computer Science 2023-12-19 Yongqi Li , Nan Yang , Liang Wang , Furu Wei , Wenjie Li

DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual Data

Numerical reasoning over hybrid data containing tables and long texts has recently received research attention from the AI community. To generate an executable reasoning program consisting of math and table operations to answer a question,…

Computation and Language · Computer Science 2022-11-24 Xiao Li , Yin Zhu , Sichen Liu , Jiangzhou Ju , Yuzhong Qu , Gong Cheng

GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction

In a multi-stage recommendation system, reranking plays a crucial role in modeling intra-list correlations among items. A key challenge lies in exploring optimal sequences within the combinatorial space of permutations. Recent research…

Information Retrieval · Computer Science 2025-10-30 Zhijie Lin , Zhuofeng Li , Chenglei Dai , Wentian Bao , Shuai Lin , Enyun Yu , Haoxiang Zhang , Liang Zhao

Type-aware LLM-based Regression Test Generation for Python Programs

Automated regression test generation has been extensively explored, yet generating high-quality tests for Python programs remains particularly challenging. Because of the Python's dynamic typing features, existing approaches, ranging from…

Software Engineering · Computer Science 2025-10-23 Runlin Liu , Zhe Zhang , Yunge Hu , Yuhang Lin , Xiang Gao , Hailong Sun

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

Automated theorem proving (ATP) has become an appealing domain for exploring the reasoning ability of the recent successful generative language models. However, current ATP benchmarks mainly focus on symbolic inference, but rarely involve…

Computation and Language · Computer Science 2023-10-25 Jing Xiong , Jianhao Shen , Ye Yuan , Haiming Wang , Yichun Yin , Zhengying Liu , Lin Li , Zhijiang Guo , Qingxing Cao , Yinya Huang , Chuanyang Zheng , Xiaodan Liang , Ming Zhang , Qun Liu