English
Related papers

Related papers: Improving Structured Text Recognition with Regular…

200 papers

Structured text understanding on Visually Rich Documents (VRDs) is a crucial part of Document Intelligence. Due to the complexity of content and layout in VRDs, structured text understanding has been a challenging task. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2021-11-09 Yulin Li , Yuxi Qian , Yuchen Yu , Xiameng Qin , Chengquan Zhang , Yan Liu , Kun Yao , Junyu Han , Jingtuo Liu , Errui Ding

Since regular expressions (abbrev. regexes) are difficult to understand and compose, automatically generating regexes has been an important research problem. This paper introduces TransRegex, for automatically constructing regexes from both…

Programming Languages · Computer Science 2021-03-03 Yeting Li , Shuaimin Li , Zhiwu Xu , Jialun Cao , Zixuan Chen , Yun Hu , Haiming Chen , Shing-Chi Cheung

This paper introduces MixTex, an end-to-end LaTeX OCR model designed for low-bias multilingual recognition, along with its novel data collection method. In applying Transformer architectures to LaTeX text recognition, we identified specific…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Renqing Luo , Yuhan Xu

The recent development of diffusion models has led to significant progress in solving inverse problems by leveraging these models as powerful generative priors. However, challenges persist due to the ill-posed nature of such problems, often…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Jeongsol Kim , Geon Yeong Park , Hyungjin Chung , Jong Chul Ye

Feature attribution methods highlight the important input tokens as explanations to model predictions, which have been widely applied to deep neural networks towards trustworthy AI. However, recent works show that explanations provided by…

Computation and Language · Computer Science 2024-01-01 Dongfang Li , Baotian Hu , Qingcai Chen , Shan He

Recent systems on structured prediction focus on increasing the level of structural dependencies within the model. However, our study suggests that complex structures entail high overfitting risks. To control the structure-based…

Machine Learning · Computer Science 2017-11-29 Xu Sun , Weiwei Sun , Shuming Ma , Xuancheng Ren , Yi Zhang , Wenjie Li , Houfeng Wang

Existing datasets for regular expression (regex) generation from natural language are limited in complexity; compared to regex tasks that users post on StackOverflow, the regexes in these datasets are simple, and the language used to…

Computation and Language · Computer Science 2020-05-05 Xi Ye , Qiaochu Chen , Isil Dillig , Greg Durrett

Reading text from natural images is challenging due to the great variety in text font, color, size, complex background and etc.. The perspective distortion and non-linear spatial arrangement of characters make it further difficult. While…

Computer Vision and Pattern Recognition · Computer Science 2019-11-12 Shangbang Long , Yushuo Guan , Bingxuan Wang , Kaigui Bian , Cong Yao

Deepfake detection, the task of automatically discriminating machine-generated text, is increasingly critical with recent advances in natural language generative models. Existing approaches to deepfake detection typically represent…

Computation and Language · Computer Science 2020-10-16 Wanjun Zhong , Duyu Tang , Zenan Xu , Ruize Wang , Nan Duan , Ming Zhou , Jiahai Wang , Jian Yin

Automated interpretability aims to translate large language model (LLM) features into human understandable descriptions. However, natural language feature descriptions can be vague, inconsistent, and require manual relabeling. In response,…

Computation and Language · Computer Science 2026-03-02 Angie Boggust , Donghao Ren , Yannick Assogba , Dominik Moritz , Arvind Satyanarayan , Fred Hohman

This article proposes a convenient tool for decoding the output of neural networks trained by Connectionist Temporal Classification (CTC) for handwritten text recognition. We use regular expressions to describe the complex structures…

Neural and Evolutionary Computing · Computer Science 2016-03-31 Tobias Strauß , Gundram Leifert , Tobias Grüning , Roger Labahn

Currently, the destruction of the sequence structure in handwritten text has become one of the main bottlenecks restricting the recognition task. The typical situations include additional specific markers (the text swapping modification)…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Zi-Rui Wang

When working with textual data, a natural application of disentangled representations is fair classification where the goal is to make predictions without being biased (or influenced) by sensitive attributes that may be present in the data…

Computation and Language · Computer Science 2022-10-10 Pierre Colombo , Guillaume Staerman , Nathan Noiry , Pablo Piantanida

In this paper, we present a method for enhancing the accuracy of scene text recognition tasks by judging whether the image and text match each other. While previous studies focused on generating the recognition results from input images,…

Computer Vision and Pattern Recognition · Computer Science 2024-04-10 Masato Fujitake

A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Frederick Jelinek

Many approaches have recently been proposed to detect irregular scene text and achieved promising results. However, their localization results may not well satisfy the following text recognition part mainly because of two reasons: 1)…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Liang Qiao , Sanli Tang , Zhanzhan Cheng , Yunlu Xu , Yi Niu , Shiliang Pu , Fei Wu

Recognizing text in natural images is a challenging task with many unsolved problems. Different from those in documents, words in natural images often possess irregular shapes, which are caused by perspective distortion, curved character…

Computer Vision and Pattern Recognition · Computer Science 2016-04-20 Baoguang Shi , Xinggang Wang , Pengyuan Lyu , Cong Yao , Xiang Bai

Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition. When using appropriate modeling units, e.g., byte-pair encoding, these systems are in principle open vocabulary systems. In practice,…

Computation and Language · Computer Science 2026-03-05 Christian Huber , Alexander Waibel

In this work, our objective is to address the problems of generalization and flexibility for text recognition in documents. We introduce a new model that exploits the repetitive nature of characters in languages, and decouples the visual…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Chuhan Zhang , Ankush Gupta , Andrew Zisserman

Framing is a process of emphasizing a certain aspect of an issue over the others, nudging readers or listeners towards different positions on the issue even without making a biased argument. {Here, we propose FrameAxis, a method for…

Computation and Language · Computer Science 2021-07-26 Haewoon Kwak , Jisun An , Elise Jing , Yong-Yeol Ahn
‹ Prev 1 2 3 10 Next ›