Related papers: Semantic Code Classification for Automated Machine…

Interpretable Sequence Classification via Discrete Optimization

Sequence classification is the task of predicting a class label given a sequence of observations. In many applications such as healthcare monitoring or intrusion detection, early classification is crucial to prompt intervention. In this…

Machine Learning · Computer Science 2020-10-07 Maayan Shvo , Andrew C. Li , Rodrigo Toro Icarte , Sheila A. McIlraith

Seeking Diverse Reasoning Logic: Controlled Equation Expression Generation for Solving Math Word Problems

To solve Math Word Problems, human students leverage diverse reasoning logic that reaches different possible equation solutions. However, the mainstream sequence-to-sequence approach of automatic solvers aims to decode a fixed solution…

Computation and Language · Computer Science 2022-12-01 Yibin Shen , Qianying Liu , Zhuoyuan Mao , Zhen Wan , Fei Cheng , Sadao Kurohashi

Semantic Probabilistic Control of Language Models

Semantic control entails steering LM generations towards satisfying subtle non-lexical constraints, e.g., toxicity, sentiment, or politeness, attributes that can be captured by a sequence-level verifier. It can thus be viewed as sampling…

Machine Learning · Computer Science 2025-05-06 Kareem Ahmed , Catarina G Belem , Padhraic Smyth , Sameer Singh

Machine Learning Based Source Code Classification Using Syntax Oriented Features

As of today the programming language of the vast majority of the published source code is manually specified or programmatically assigned based on the sole file extension. In this paper we show that the source code programming language…

Machine Learning · Computer Science 2017-03-23 Shaul Zevin , Catherine Holzem

SGM: Sequence Generation Model for Multi-label Classification

Multi-label classification is an important yet challenging task in natural language processing. It is more complex than single-label classification in that the labels tend to be correlated. Existing methods tend to ignore the correlations…

Computation and Language · Computer Science 2018-06-18 Pengcheng Yang , Xu Sun , Wei Li , Shuming Ma , Wei Wu , Houfeng Wang

Lexical Complexity Controlled Sentence Generation

Text generation rarely considers the control of lexical complexity, which limits its more comprehensive practical application. We introduce a novel task of lexical complexity controlled sentence generation, which aims at keywords to…

Computation and Language · Computer Science 2022-11-29 Jinran Nie , Liner Yang , Yun Chen , Cunliang Kong , Junhui Zhu , Erhong Yang

Toward Code Generation: A Survey and Lessons from Semantic Parsing

With the growth of natural language processing techniques and demand for improved software engineering efficiency, there is an emerging interest in translating intention from human languages to programming languages. In this survey paper,…

Software Engineering · Computer Science 2021-05-20 Celine Lee , Justin Gottschlich , Dan Roth

Controlled Language Generation for Language Learning Items

This work aims to employ natural language generation (NLG) to rapidly generate items for English language learning applications: this requires both language models capable of generating fluent, high-quality English, and to control the…

Computation and Language · Computer Science 2022-11-30 Kevin Stowe , Debanjan Ghosh , Mengxuan Zhao

Automated Process Planning Based on a Semantic Capability Model and SMT

In research of manufacturing systems and autonomous robots, the term capability is used for a machine-interpretable specification of a system function. Approaches in this research area develop information models that capture all information…

Artificial Intelligence · Computer Science 2024-02-15 Aljosha Köcher , Luis Miguel Vieira da Silva , Alexander Fay

A Machine Learning Approach to the Classification of Dialogue Utterances

The purpose of this paper is to present a method for automatic classification of dialogue utterances and the results of applying that method to a corpus. Superficial features of a set of training utterances (which we will call cues) are…

cmp-lg · Computer Science 2008-02-03 Toine Andernach

Combining Dynamic Symbolic Execution, Machine Learning and Search-Based Testing to Automatically Generate Test Cases for Classes

This article discusses a new technique to automatically generate test cases for object oriented programs. At the state of the art, the problem of generating adequate sets of complete test cases has not been satisfactorily solved yet. There…

Software Engineering · Computer Science 2020-05-20 Matteo Modonato

Text classification using machine learning methods

In this paper we present the results of an experiment aimed to use machine learning methods to obtain models that can be used for the automatic classification of products. In order to apply automatic classification methods, we transformed…

Computation and Language · Computer Science 2025-02-28 Bogdan Oancea

LLM-assisted Labeling Function Generation for Semantic Type Detection

Detecting semantic types of columns in data lake tables is an important application. A key bottleneck in semantic type detection is the availability of human annotation due to the inherent complexity of data lakes. In this paper, we propose…

Databases · Computer Science 2024-08-30 Chenjie Li , Dan Zhang , Jin Wang

Neural Machine Translation for Code Generation

Neural machine translation (NMT) methods developed for natural language processing have been shown to be highly successful in automating translation from one natural language to another. Recently, these NMT methods have been adapted to the…

Computation and Language · Computer Science 2023-05-24 Dharma KC , Clayton T. Morrison

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

A wide range of LM applications require generating text that conforms to syntactic or semantic constraints. Imposing such constraints can be naturally framed as probabilistic conditioning, but exact generation from the resulting…

Computation and Language · Computer Science 2025-04-22 João Loula , Benjamin LeBrun , Li Du , Ben Lipkin , Clemente Pasti , Gabriel Grand , Tianyu Liu , Yahya Emara , Marjorie Freedman , Jason Eisner , Ryan Cotterell , Vikash Mansinghka , Alexander K. Lew , Tim Vieira , Timothy J. O'Donnell

Machine Learning in Automated Text Categorization

The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize…

Information Retrieval · Computer Science 2021-09-21 Fabrizio Sebastiani

Toward Controlled Generation of Text

Generic generation and manipulation of text is challenging and has limited success compared to recent deep generative modeling in visual domain. This paper aims at generating plausible natural language sentences, whose attributes are…

Machine Learning · Computer Science 2018-09-14 Zhiting Hu , Zichao Yang , Xiaodan Liang , Ruslan Salakhutdinov , Eric P. Xing

Human-machine cooperation for semantic feature listing

Semantic feature norms, lists of features that concepts do and do not possess, have played a central role in characterizing human conceptual knowledge, but require extensive human labor. Large language models (LLMs) offer a novel avenue for…

Computation and Language · Computer Science 2023-04-12 Kushin Mukherjee , Siddharth Suresh , Timothy T. Rogers

Linearly Controlled Language Generation with Performative Guarantees

The increasing prevalence of Large Language Models (LMs) in critical applications highlights the need for controlled language generation strategies that are not only computationally efficient but that also enjoy performance guarantees. To…

Computation and Language · Computer Science 2026-03-16 Emily Cheng , Carmen Amo Alonso

Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs

Even after fine-tuning and reinforcement learning, large language models (LLMs) can be difficult, if not impossible, to control reliably with prompts alone. We propose a new inference-time approach to enforcing syntactic and semantic…

Artificial Intelligence · Computer Science 2023-11-28 Alexander K. Lew , Tan Zhi-Xuan , Gabriel Grand , Vikash K. Mansinghka