Related papers: An Interpretable Deep Learning Approach for Morpho…

An Approach to the Analysis of the South Slavic Medieval Labels Using Image Texture

The paper presents a new script classification method for the discrimination of the South Slavic medieval labels. It consists in the textural analysis of the script types. In the first step, each letter is coded by the equivalent script…

Computer Vision and Pattern Recognition · Computer Science 2015-09-08 Darko Brodic , Alessia Amelio , Zoran N. Milivojevic

Application of deep learning approaches for medieval historical documents transcription

Handwritten text recognition and optical character recognition solutions show excellent results with processing data of modern era, but efficiency drops with Latin documents of medieval times. This paper presents a deep learning method to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Maksym Voloshchuk , Bohdana Zarembovska , Mykola Kozlenko

The Learnable Typewriter: A Generative Approach to Text Analysis

We present a generative document-specific approach to character analysis and recognition in text lines. Our main idea is to build on unsupervised multi-object segmentation methods and in particular those that reconstruct images based on a…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Ioannis Siglidis , Nicolas Gonthier , Julien Gaubil , Tom Monnier , Mathieu Aubry

Labeling, Cutting, Grouping: an Efficient Text Line Segmentation Method for Medieval Manuscripts

This paper introduces a new way for text-line extraction by integrating deep-learning based pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex handwritten documents poses a significant challenge,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Michele Alberti , Lars Vögtlin , Vinaychandran Pondenkandath , Mathias Seuret , Rolf Ingold , Marcus Liwicki

Quantifying Scripts: Defining metrics of characters for quantitative and descriptive analysis

Analysis of scripts plays an important role in paleography and in quantitative linguistics. Especially in the field of digital paleography quantitative features are much needed to differentiate glyphs. We describe an elaborate set of…

Computation and Language · Computer Science 2015-01-09 Vinodh Rajan

Towards Improved Model Design for Authorship Identification: A Survey on Writing Style Understanding

Authorship identification tasks, which rely heavily on linguistic styles, have always been an important part of Natural Language Understanding (NLU) research. While other tasks based on linguistic style understanding benefit from deep…

Computation and Language · Computer Science 2020-10-01 Weicheng Ma , Ruibo Liu , Lili Wang , Soroush Vosoughi

Visual Script and Language Identification

In this paper we introduce a script identification method based on hand-crafted texture features and an artificial neural network. The proposed pipeline achieves near state-of-the-art performance for script identification of video-text and…

Computer Vision and Pattern Recognition · Computer Science 2016-01-11 Anguelos Nicolaou , Andrew Bagdanov , Lluis Gomez-Bigorda , Dimosthenis Karatzas

A Hybrid Deep Learning Model for Arabic Text Recognition

Arabic text recognition is a challenging task because of the cursive nature of Arabic writing system, its joint writing scheme, the large number of ligatures and many other challenges. Deep Learning DL models achieved significant progress…

Computer Vision and Pattern Recognition · Computer Science 2020-09-07 Mohammad Fasha , Bassam Hammo , Nadim Obeid , Jabir Widian

Morphological Reconstruction for Word Level Script Identification

A line of a bilingual document page may contain text words in regional language and numerals in English. For Optical Character Recognition (OCR) of such a document page, it is necessary to identify different script forms before running an…

Computer Vision and Pattern Recognition · Computer Science 2011-07-05 B. V. Dhandra , Mallikarjun Hangarge

Deep Learning the Indus Script

Standardized corpora of undeciphered scripts, a necessary starting point for computational epigraphy, requires laborious human effort for their preparation from raw archaeological records. Automating this process through machine learning…

Computer Vision and Pattern Recognition · Computer Science 2017-02-03 Satish Palaniappan , Ronojoy Adhikari

Advanced Deep Learning Approaches for Automated Recognition of Cuneiform Symbols

This paper presents a thoroughly automated method for identifying and interpreting cuneiform characters via advanced deep-learning algorithms. Five distinct deep-learning models were trained on a comprehensive dataset of cuneiform…

Computation and Language · Computer Science 2025-05-09 Shahad Elshehaby , Alavikunhu Panthakkan , Hussain Al-Ahmad , Mina Al-Saad

Transcribing Medieval Manuscripts for Machine Learning

This article focuses on the transcription of medieval manuscripts. Whereas problems of transcription have long interested medievalists, few workable options in the era of printed editions were available besides normalisation. The automation…

Digital Libraries · Computer Science 2024-08-07 Estelle Guéville , David Joseph Wrisley

Proof of Concept: Automatic Type Recognition

The type used to print an early modern book can give scholars valuable information about the time and place of its production as well as its producer. Recognizing such type is currently done manually using both the character shapes of `M'…

Computer Vision and Pattern Recognition · Computer Science 2020-10-21 Vincent Christlein , Nikolaus Weichselbaumer , Saskia Limbach , Mathias Seuret

Ancient Script Image Recognition and Processing: A Review

Ancient scripts, e.g., Egyptian hieroglyphs, Oracle Bone Inscriptions, and Ancient Greek inscriptions, serve as vital carriers of human civilization, embedding invaluable historical and cultural information. Automating ancient script image…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Xiaolei Diao , Rite Bo , Yanling Xiao , Lida Shi , Zhihan Zhou , Hao Xu , Chuntao Li , Xiongfeng Tang , Massimo Poesio , Cédric M. John , Daqian Shi

Multi-Stage Prototype Learning for Interpretable Time Series Classification

Deep learning methods are powerful tools in classifying multivariate time series data. Despite their high performance, these methods are hard to interpret, which diminishes their applications in high-risk domains such as healthcare. In this…

Machine Learning · Computer Science 2026-05-11 Bhavesh Kalisetti , Vincent Wang , Gaurav R. Ghosal , Maryam Bijanzadeh , Reza Abbasi-Asl

Classifying Fonts and Calligraphy Styles Using Complex Wavelet Transform

Recognizing fonts has become an important task in document analysis, due to the increasing number of available digital documents in different fonts and emphases. A generic font-recognition system independent of language, script and content…

Computer Vision and Pattern Recognition · Computer Science 2014-07-11 Alican Bozkurt , Pinar Duygulu , A. Enis Cetin

A Deep Factorization of Style and Structure in Fonts

We propose a deep factorization model for typographic analysis that disentangles content from style. Specifically, a variational inference procedure factors each training glyph into the combination of a character-specific content embedding…

Machine Learning · Computer Science 2020-05-19 Nikita Srivatsan , Jonathan T. Barron , Dan Klein , Taylor Berg-Kirkpatrick

TIPICAL -- Type Inference for Python In Critical Accuracy Level

Type inference methods based on deep learning are becoming increasingly popular as they aim to compensate for the drawbacks of static and dynamic analysis approaches, such as high uncertainty. However, their practical application is still…

Software Engineering · Computer Science 2023-08-08 Jonathan Elkobi , Bernd Gruner , Tim Sonnekalb , Clemens-Alexander Brust

Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

Grouping has been commonly used in deep metric learning for computing diverse features. However, current methods are prone to overfitting and lack interpretability. In this work, we propose an improved and interpretable grouping method to…

Computer Vision and Pattern Recognition · Computer Science 2021-08-26 Xinyi Xu , Zhengyang Wang , Cheng Deng , Hao Yuan , Shuiwang Ji

Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network

In this paper, we propose a novel approach of word-level Indic script identification using only character-level data in training stage. The advantages of using character level data for training have been outlined in section I. Our method…

Computer Vision and Pattern Recognition · Computer Science 2019-10-17 Ayan Kumar Bhunia , Subham Mukherjee , Aneeshan Sain , Ankan Kumar Bhunia , Partha Pratim Roy , Umapada Pal