Related papers: Using statistical smoothing to date medieval manus…

Dating medieval English charters

Deeds, or charters, dealing with property rights, provide a continuous documentation which can be used by historians to study the evolution of social, economic and political changes. This study is concerned with charters (written in Latin)…

Applications · Statistics 2013-01-21 Gelila Tilahun , Andrey Feuerverger , Michael Gervers

Probabilistic Dating of Historical Manuscripts via Evidential Deep Regression on Visual Script Features

We introduce a probabilistic approach for dating historical manuscript pages from visual features alone. Instead of aggregating centuries into classes as is standard in the previous literature, we pose dating as an evidential deep…

Artificial Intelligence · Computer Science 2026-05-18 Ranjith Chodavarapu

Application of deep learning approaches for medieval historical documents transcription

Handwritten text recognition and optical character recognition solutions show excellent results with processing data of modern era, but efficiency drops with Latin documents of medieval times. This paper presents a deep learning method to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Maksym Voloshchuk , Bohdana Zarembovska , Mykola Kozlenko

Dating ancient manuscripts using radiocarbon and AI-based writing style analysis

Determining the chronology of ancient handwritten manuscripts is essential for reconstructing the evolution of ideas. For the Dead Sea Scrolls, this is particularly important. However, there is an almost complete lack of date-bearing…

Digital Libraries · Computer Science 2024-10-21 Mladen Popović , Maruf A. Dhali , Lambert Schomaker , Johannes van der Plicht , Kaare Lund Rasmussen , Jacopo La Nasa , Ilaria Degano , Maria Perla Colombini , Eibert Tigchelaar

Manuscripts in Time and Space: Experiments in Scriptometrics on an Old French Corpus

Witnesses of medieval literary texts, preserved in manuscript, are layered objects , being almost exclusively copies of copies. This results in multiple and hard to distinguish linguistic strata -- the author's scripta interacting with the…

Computation and Language · Computer Science 2018-02-06 Jean-Baptiste Camps

Probing the statistical properties of unknown texts: application to the Voynich Manuscript

While the use of statistical physics methods to analyze large corpora has been useful to unveil many patterns in texts, no comprehensive investigation has been performed investigating the properties of statistical measurements across…

Physics and Society · Physics 2013-07-04 Diego R. Amancio , Eduardo G. Altmann , Diego Rybski , Osvaldo N. Oliveira , Luciano da F. Costa

Writing Style Invariant Deep Learning Model for Historical Manuscripts Alignment

Historical manuscript alignment is a widely known problem in document analysis. Finding the differences between manuscript editions is mostly done manually. In this paper, we present a writer independent deep learning model which is trained…

Computer Vision and Pattern Recognition · Computer Science 2018-06-12 Majeed Kassis , Jumana Nassour , Jihad El-Sana

Image-based material analysis of ancient historical documents

Researchers continually perform corroborative tests to classify ancient historical documents based on the physical materials of their writing surfaces. However, these tests, often performed on-site, requires actual access to the manuscript…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Thomas Reynolds , Maruf A. Dhali , Lambert Schomaker

Recognizing Handwriting Styles in a Historical Scanned Document Using Unsupervised Fuzzy Clustering

The forensic attribution of the handwriting in a digitized document to multiple scribes is a challenging problem of high dimensionality. Unique handwriting styles may be dissimilar in a blend of several factors including character size,…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Sriparna Majumdar , Aaron Brick

Labeling, Cutting, Grouping: an Efficient Text Line Segmentation Method for Medieval Manuscripts

This paper introduces a new way for text-line extraction by integrating deep-learning based pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex handwritten documents poses a significant challenge,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Michele Alberti , Lars Vögtlin , Vinaychandran Pondenkandath , Mathias Seuret , Rolf Ingold , Marcus Liwicki

PHD: Pixel-Based Language Modeling of Historical Documents

The digitisation of historical documents has provided historians with unprecedented research opportunities. Yet, the conventional approach to analysing historical documents involves converting them from images to text using OCR, a process…

Computation and Language · Computer Science 2023-11-07 Nadav Borenstein , Phillip Rust , Desmond Elliott , Isabelle Augenstein

Open Source Handwritten Text Recognition on Medieval Manuscripts using Mixed Models and Document-Specific Finetuning

This paper deals with the task of practical and open source Handwritten Text Recognition (HTR) on German medieval manuscripts. We report on our efforts to construct mixed recognition models which can be applied out-of-the-box without any…

Computer Vision and Pattern Recognition · Computer Science 2022-01-20 Christian Reul , Stefan Tomasek , Florian Langhanki , Uwe Springmann

Historical Document Processing: Historical Document Processing: A Survey of Techniques, Tools, and Trends

Historical Document Processing is the process of digitizing written material from the past for future use by historians and other scholars. It incorporates algorithms and software tools from various subfields of computer science, including…

Computer Vision and Pattern Recognition · Computer Science 2020-09-14 James P. Philips , Nasseh Tabrizi

A data science and machine learning approach to continuous analysis of Shakespeare's plays

The availability of quantitative text analysis methods has provided new ways of analyzing literature in a manner that was not available in the pre-information era. Here we apply comprehensive machine learning analysis to the work of William…

Computation and Language · Computer Science 2024-02-14 Charles Swisher , Lior Shamir

Neural Word Search in Historical Manuscript Collections

We address the problem of segmenting and retrieving word images in collections of historical manuscripts given a text query. This is commonly referred to as "word spotting". To this end, we first propose an end-to-end trainable model based…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Tomas Wilkinson , Jonas Lindström , Anders Brun

Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics

Die analysis is an essential numismatic method, and an important tool of ancient economic history. Yet, manual die studies are too labor-intensive to comprehensively study large coinages such as those of the Roman Empire. We address this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-02 Andreas Heinecke , Emanuel Mayer , Abhinav Natarajan , Yoonju Jung

Record Counting in Historical Handwritten Documents with Convolutional Neural Networks

In this paper, we investigate the use of Convolutional Neural Networks for counting the number of records in historical handwritten documents. With this work we demonstrate that training the networks only with synthetic images allows us to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-21 Samuele Capobianco , Simone Marinai

A Spatial Modeling Approach for Linguistic Object Data: Analysing dialect sound variations across Great Britain

Dialect variation is of considerable interest in linguistics and other social sciences. However, traditionally it has been studied using proxies (transcriptions) rather than acoustic recordings directly. We introduce novel statistical…

Methodology · Statistics 2018-07-02 Shahin Tavakoli , Davide Pigoli , John A. D. Aston , John S. Coleman

A Generic Image Retrieval Method for Date Estimation of Historical Document Collections

Date estimation of historical document images is a challenging problem, with several contributions in the literature that lack of the ability to generalize from one dataset to others. This paper presents a robust date estimation system…

Computer Vision and Pattern Recognition · Computer Science 2022-04-11 Adrià Molina , Lluis Gomez , Oriol Ramos Terrades , Josep Lladós

Distance for Functional Data Clustering Based on Smoothing Parameter Commutation

We propose a novel method to determine the dissimilarity between subjects for functional data clustering. Spline smoothing or interpolation is common to deal with data of such type. Instead of estimating the best-representing curve for each…

Methodology · Statistics 2021-03-23 ShengLi Tzeng , Christian Hennig , Yu-Fen Li , Chien-Ju Lin