Related papers: Automatic Rule Generation for Time Expression Norm…

TweeTime: A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter

We describe TweeTIME, a temporal tagger for recognizing and normalizing time expressions in Twitter. Most previous work in social media analysis has to rely on temporal resolvers that are designed for well-edited text, and therefore suffer…

Information Retrieval · Computer Science 2020-11-17 Jeniya Tabassum , Alan Ritter , Wei Xu

Temporal expression normalisation in natural language texts

Automatic annotation of temporal expressions is a research challenge of great interest in the field of information extraction. In this report, I describe a novel rule-based architecture, built on top of a pre-existing system, which is able…

Computation and Language · Computer Science 2012-06-12 Michele Filannino

A Semantic Parsing Framework for End-to-End Time Normalization

Time normalization is the task of converting natural language temporal expressions into machine-readable representations. It underpins many downstream applications in information retrieval, question answering, and clinical decision-making.…

Computation and Language · Computer Science 2025-07-10 Xin Su , Sungduk Yu , Phillip Howard , Steven Bethard

Multilingual Normalization of Temporal Expressions with Masked Language Models

The detection and normalization of temporal expressions is an important task and preprocessing step for many applications. However, prior work on normalization is rule-based, which severely limits the applicability in real-world…

Computation and Language · Computer Science 2023-02-13 Lukas Lange , Jannik Strötgen , Heike Adel , Dietrich Klakow

Data Extraction via Semantic Regular Expression Synthesis

Many data extraction tasks of practical relevance require not only syntactic pattern matching but also semantic reasoning about the content of the underlying text. While regular expressions are very well suited for tasks that require only…

Programming Languages · Computer Science 2023-08-28 Qiaochu Chen , Arko Banerjee , Çağatay Demiralp , Greg Durrett , Isil Dillig

On Synthesis of Timed Regular Expressions

Timed regular expressions serve as a formalism for specifying real-time behaviors of Cyber-Physical Systems. In this paper, we consider the synthesis of timed regular expressions, focusing on generating a timed regular expression consistent…

Formal Languages and Automata Theory · Computer Science 2025-09-12 Ziran Wang , Jie An , Naijun Zhan , Miaomiao Zhang , Zhenya Zhang

ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification

We propose ARISE, a framework that iteratively induces rules and generates synthetic data for text classification. We combine synthetic data generation and automatic rule induction, via bootstrapping, to iteratively filter the generated…

Computation and Language · Computer Science 2025-02-11 Yashwanth M. , Vaibhav Singh , Ayush Maheshwari , Amrith Krishna , Ganesh Ramakrishnan

ManTIME: Temporal expression identification and normalization in the TempEval-3 challenge

This paper describes a temporal expression identification and normalization system, ManTIME, developed for the TempEval-3 challenge. The identification phase combines the use of conditional random fields along with a post-processing…

Computation and Language · Computer Science 2013-05-01 Michele Filannino , Gavin Brown , Goran Nenadic

STEER: Simple Temporal Regularization For Neural ODEs

Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive. Indeed, computing the forward pass of such models involves solving an ODE which can become arbitrarily complex during training. Recent works have…

Machine Learning · Computer Science 2020-11-03 Arnab Ghosh , Harkirat Singh Behl , Emilien Dupont , Philip H. S. Torr , Vinay Namboodiri

Integrating Regular Expressions with Neural Networks via DFA

Human-designed rules are widely used to build industry applications. However, it is infeasible to maintain thousands of such hand-crafted rules. So it is very important to integrate the rule knowledge into neural networks to build a hybrid…

Computation and Language · Computer Science 2021-09-08 Shaobo Li , Qun Liu , Xin Jiang , Yichun Yin , Chengjie Sun , Bingquan Liu , Zhenzhou Ji , Lifeng Shang

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

Abstractive text summarization is a highly difficult problem, and the sequence-to-sequence model has shown success in improving the performance on the task. However, the generated summaries are often inconsistent with the source content in…

Computation and Language · Computer Science 2018-05-11 Bingzhen Wei , Xuancheng Ren , Xu Sun , Yi Zhang , Xiaoyan Cai , Qi Su

Modeling Others' Minds as Code

Accurate prediction of human behavior is essential for robust and safe human-AI collaboration. However, existing approaches for modeling people are often data-hungry and brittle because they either make unrealistic assumptions about…

Artificial Intelligence · Computer Science 2025-10-03 Kunal Jha , Aydan Yuenan Huang , Eric Ye , Natasha Jaques , Max Kleiman-Weiner

Discourse-Aware In-Context Learning for Temporal Expression Normalization

Temporal expression (TE) normalization is a well-studied problem. However, the predominately used rule-based systems are highly restricted to specific settings, and upcoming machine learning approaches suffer from a lack of labeled data. In…

Computation and Language · Computer Science 2024-04-12 Akash Kumar Gautam , Lukas Lange , Jannik Strötgen

Normalizing Text using Language Modelling based on Phonetics and String Similarity

Social media networks and chatting platforms often use an informal version of natural text. Adversarial spelling attacks also tend to alter the input text by modifying the characters in the text. Normalizing these texts is an essential step…

Computation and Language · Computer Science 2020-06-26 Fenil Doshi , Jimit Gandhi , Deep Gosalia , Sudhir Bagul

Deciding Regularity of the Set of Instances of a Set of Terms with Regular Constraints is EXPTIME-Complete

Finite-state tree automata are a well studied formalism for representing term languages. This paper studies the problem of determining the regularity of the set of instances of a finite set of terms with variables, where each variable is…

Symbolic Computation · Computer Science 2009-11-20 Omer Giménez , Guillem Godoy , Sebastian Maneth

Automatic Extraction of Rules for Generating Synthetic Patient Data From Real-World Population Data Using Glioblastoma as an Example

The generation of synthetic data is a promising technology to make medical data available for secondary use in a privacy-compliant manner. A popular method for creating realistic patient data is the rule-based Synthea data generator.…

Machine Learning · Computer Science 2025-12-19 Arno Appenzeller , Nick Terzer , André Homeyer , Jan-Philipp Redlich , Sabine Luttmann , Friedrich Feuerhake , Nadine S. Schaadt , Timm Intemann , Sarah Teuber-Hanselmann , Stefan Nikolin , Joachim Weis , Klaus Kraywinkel , Pascal Birnstill

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

Scene text recognition (STR) attracts much attention over the years because of its wide application. Most methods train STR model in a fully supervised manner which requires large amounts of labeled data. Although synthetic data contributes…

Computer Vision and Pattern Recognition · Computer Science 2022-05-24 Caiyuan Zheng , Hui Li , Seon-Min Rhee , Seungju Han , Jae-Joon Han , Peng Wang

Synthesis of Shared Control Protocols with Provable Safety and Performance Guarantees

We formalize synthesis of shared control protocols with correctness guarantees for temporal logic specifications. More specifically, we introduce a modeling formalism in which both a human and an autonomy protocol can issue commands to a…

Robotics · Computer Science 2016-10-28 Nils Jansen , Murat Cubuktepe , Ufuk Topcu

gMeta: Template-based Regular Expression Generation over Noisy Examples

Regular expressions (regexes) are widely used in different fields of computer science, such as programming languages, string processing, and databases. However, existing tools for synthesizing or repairing regexes always assume that the…

Software Engineering · Computer Science 2022-11-02 Shujun Wang , Yongqiang Tian andDengcheng He

Temporal Reasoning on Implicit Events from Distant Supervision

We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a…

Computation and Language · Computer Science 2021-05-11 Ben Zhou , Kyle Richardson , Qiang Ning , Tushar Khot , Ashish Sabharwal , Dan Roth